EPITOME: Enhanced Phylogenetic Inference Through Optimized Mapping Efficiency

EPITOME condenses a diverse set of DNA sequences (species-level or closer) into discrete, composite sequences that represent the overarching diversity of the dataset. In other words, EPITOME creates sequences that are the epitome of the dataset diversity. This is accomplished by clustering the input based on pairwise genetic distances and then selecting the most common nucleotide at each genomic position (ties selected at random). When the genetic distance is based on read mapping efficiency, EPITOME creates a set of reference genomes for consensus-based assembly pipelines, like VAPER or viralrecon.

See the wiki for more information.

Quick Start

Step 1. Create your samplesheet

Note: Nextflow requires absolute paths in samplesheets Create a samplesheet containing the taxa name, genome segment, path to a multi-fasta file of sequences for the taxa, and the expected sequence length (within 25%). samplesheet.csv:

taxa,segment,assembly,length
Influenza_A,HA,flu-a_HA_NCBI_2024-4-1.fasta,1950
Influenza_A,NA,flu-a_NA_NCBI_2024-4-1.fasta,1400
Measles,wg,measles_NCBI_2024-4-1.fasta,16000

Step 2. Run EPITOME

Run EPITOME using the command below.

Note: See the wiki for how to assign references with existing subtype classifications (e.g., H1-H9) using the --seeds parameter.

nextflow run DOH-JDJ0303/epitome \
    -r main \
    -profile singularity \
    --input samplesheet.csv \
    --outdir results

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.devcontainer		.devcontainer
.github		.github
assets		assets
bin		bin
conf		conf
dockerfiles		dockerfiles
docs		docs
lib		lib
modules		modules
subworkflows/local		subworkflows/local
workflows		workflows
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
.nf-core.yml		.nf-core.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc.yml		.prettierrc.yml
CHANGELOG.md		CHANGELOG.md
CITATIONS.md		CITATIONS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
modules.json		modules.json
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
pyproject.toml		pyproject.toml
tower.yml		tower.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EPITOME: Enhanced Phylogenetic Inference Through Optimized Mapping Efficiency

Quick Start

Step 1. Create your samplesheet

Step 2. Run EPITOME

About

Releases 6

Packages

Contributors 3

Languages

License

DOH-JDJ0303/epitome

Folders and files

Latest commit

History

Repository files navigation

EPITOME: Enhanced Phylogenetic Inference Through Optimized Mapping Efficiency

Quick Start

Step 1. Create your samplesheet

Step 2. Run EPITOME

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 3

Languages

Packages