It is a similar workflow as Nullarbor, but it only runs isolate specific analysis. No phylogenetic tree and pangenome analysis will be generated. A snakemake workflow that process fastq files of bacterial pathogens to produce:
- Sequencing yield (fq),
- Species identification (Kraken2),
- de novo assemblies (shovill) and their qc (fa)
- Subtyping (MLST)
- Antimicrobial resistance profile (abricate)
It is practically a snakemake workflow with all other softwares from Nullarbor.
- Nullarbor
It is best to install Nullarbor using conda.
$ conda create -n eyre nullarbor
- Snakemake
The snakemake software should be added to the Nullarbor conda environment
$ conda activate eyre
$ conda install snakemake
- Git clone the eyre repository
$ git clone https://github.com/lexleong/eyre.git
- Download Kraken2 database
Modification on the Snakefile is required to direct the pipeline script to the directory path containing kraken2 database files (hash.k2d, taxo.k2d, and opts.k2d).
-
Modify the sequencing submission sheet as per the SampleSheet.csv template
-
Run the bcl2eyre.sh script
$ bcl2eyre.sh [dir]/SampleSheet.csv
The Nullarbor is a huge treeless plain that spans the area between South Australia and Western Australia. If one were to travel from Adelaide to Nullarbor, one will have to go through Eyre Peninsula. So a South Australian public health microbiologist has to perform the Eyre pipeline prior to the Nullarbor pipeline for their bacterial WGS.
This workflow is specific to working in slurm and conda environments. It is used mainly by SA Pathology MID PHL for downstream processing of bacterial whole genome sequencing output.