This snakemake pipeline can conduct cov-19 virus classification, de novo assembly, coverage assessment and variant calling.
The pipeline is built according to https://github.com/BGI-IORI/nCoV_Meta (preprint: https://doi.org/10.1101/2020.03.16.993584)
Differences between mgi-ncov19-snakemake and nCoV_Meta:
- low complexity reads removal were implemented with fastp (bgi: prinSEQ)
- kraken2 was employed in this snakemake pipeline (bgi: kraken1)
- SOAPnuke v2 (bgi: SOAPnuke v1) (better to change to SOAPnuke v1)
- not yet finished with the alignment and variant calling steps.
updated: 2020-04-14
git clone [email protected]:huyue87/mgi-ncov19-snakemake
# 2.1 load input files (paired-end raw reads)
cd mgi-ncov19-snakemake/input
ln -s Sample_{1,2}.fq.gz .
# 2.2 run de novo assembly and generating sam files
cd mgi-ncov19-snakemake
snakemake --use-conda -n
snakemake --use-conda