-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consensus Sequence #3
Comments
@j23414 Thanks for the suggestion. It's a fine idea and I think falls within the intended scope of |
I can now build a consensus table that counts the characters in each column of the alignment. If we want to reduce these to a single consensus string, then we need to decide how to resolve ties. ``` >seq_1 AAATATAT >seq_2 AAAATTAT >seq_3 AAATATAA ``` ``` A T 3 0 3 0 3 0 1 2 2 1 0 3 3 0 1 2 ```
I've partially implemented a consensus function over on the dev branch. You can check out the comment in the commit message. The current output is a table of character counts across the columns in the alignment. But if we want to go from this table to a single consensus string, we will need to decide how to resolve ties. |
For ties, would it make sense to output both options (both consensus sequences)? Although I could see that becoming many consensus sequences (ties in more than 1 position)... |
I can't do that in FASTA format. Perhaps if there is a near tie, I could use a wildcard, like |
As a standard input for LOGO plots would be acceptable. By multiple consensuses I'm talking about:
I'm looking at influenza virus sequence which group as H1.alpha, H1.beta, H1.gamma, etc... This might explain why a vaccine works for alpha (targets certain amino acids), but not for gamma, etc... No worries either way, I can work with the table of character counts. : ) |
Hmm, I'd worry about a combinatorial explosion. |
@j23414 You can check out the lastest version (2.13.0) of Is good? |
Good! : ) Consensus and table look good.
|
Any thoughts on
smof
being able to output a consensus sequence when given an alignment file?input fasta
output fasta
The text was updated successfully, but these errors were encountered: