BIO550_Bioinformatics_Project

Semester Final Project for BIO 550 - Bioinformatics. All information and relevant code is included in the PDF, and with time I would be more than happy to walk through the report in more detail.

In summary, though, the goal of the project was to assemble and identify an Escherichea coli genome present in provided read data. The primary steps were as follows:

Visualizing read data from Nanopore and Illumina sources and comparing major differences
Appropriate quality and read length filtering
Assembling contigs and comparing the different assemblers
Running and parsing the results of a pre-written taxonomized BLAST script, as prepared by the professor
Using QUAST to better compare assembler results after BLAST filtering
Running & comparing the results of genome annotation programs PROKKA and DFAST
Using samtools and analyzing the results of mapping our original reads to our now-assembled reference genome

In the end, we were able to assemble and annotate a genome of Escherichea coli using the methods and steps listed above.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Bioinformatics_Final_Project.pdf		Bioinformatics_Final_Project.pdf
README.md		README.md
runParse.pl		runParse.pl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BIO550_Bioinformatics_Project

About

Releases

Packages

Languages

matren395/BIO550_Bioinformatics_Project

Folders and files

Latest commit

History

Repository files navigation

BIO550_Bioinformatics_Project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages