-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sourmash ONT taxonomy questions #3480
Comments
Thank you! Flattery will get you many places :)
We do not, but would love to hear back! Please also see: contig level gather, #3095, which is being worked on (albeit back burner),
Mmmh, I do not have good intuition here. I could see it going either way:
@bluegenes thoughts?
It is not so far away - sourmash-bio/sourmash_plugin_branchwater#262 and sourmash-bio/sourmash_plugin_branchwater#520 - so if we had a good reason it would be straightforward to add. Such a reason might be you finding that it works well in small test circumstances and now you want to scale up... :) |
I really think this should help, but haven't tried it yet. I also expect newer Nanopore datasets will work better b/c of accuracy improvements over time.
What specificity are you looking for? From testing a while back, protein classification for microbes seemed to work well at the genus level, but may not be able to properly find the best species match within there.
|
Thanks a lot for your replies! I have unfortunately not had the time to test this yet, but I'll let let you know when I do:) |
First, thanks for such a great tool! Sourmash (especially with the branchwater plugin) is incredibly impressive.
Firstly, I'm interested in using sourmash for ONT metagenome taxonomic assignment. From the paper "Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets", sourmash seems to struggle a bit with ONT.
If the root issue here is low read accuracy, does anyone have any experience with pre-processing using read error correction tools such as herro or dechat?
Secondly, does anyone have experience with using translated long reads with sourmash-sketched protein databases for taxonomic assignment? Do you expect this to improve taxonomic assignment?
In relation to this, I see that base sourmash currently supports translated sketching. Is this a feature you are considering adding to the branchwater plugin?
Thanks for any answers or comments!
The text was updated successfully, but these errors were encountered: