-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Updated output documentation for yak triobin #1
Comments
|
Thanks for the quick answer! That's what I guessed, but wanted to make sure before proceeding. Thanks for the tool! |
Forgot to say that you can ignore most of other columns. Those are mostly for debugging purpose. |
Dear @lh3, We are testing out trio binning and it looks like our binned assemblies are more fragmented than the non-binned assemblies. Both haplotypes have good coverage. Is there a way to adjust the triobinning step to be more specific? I.E. require more p/m markers? What is the meaning of these options?:
Do you have any suggestions for improving binning at the counting stage? |
By default, if a k-mer occurs 5 times or more in mother but occurs twice or less in father, the k-mer is considered to be a mother-specific k-mer. The label on the 2nd column is determined by the rest of columns under complex rules coded in function It is hard to get perfect trio binning. Hifiasm effectively uses the HiFi assembly graph to fix binning errors. Without doing that, hifiasm would only get ~10Mb N50, comparable to trio HiCanu. |
For a simple way to increase specificity:
|
Hi @lh3, Thank you for sharing these ideas. Just confirming, you think triobinning isn't as effective as just assembling and phasing in a single genome? That has been my experience, at least using yak and HifiASM/IPA. |
Yes, when HiFi phasing and trio phasing are inconsistent, HiFi phasing is often the correct one. |
In early days, we tried hicanu trio binning. I manually inspected many differences between hicanu and yak binning. I think yak is generally more accurate. Nonetheless, the assembly with hicanu binning is similar to the assembly with yak binning. |
Also, hifiasm applies trio binning to error corrected reads. This noticeably improves the binning accuracy: there are much fewer inconsistencies between trio phasing and hifi read phasing. |
yak/triobin.c
Line 176 in 6de3aff
Do you have any updated documentation for the output of
yak triobin
? I'm looking at the output of verison r43, which has 13 columns as opposed to the 10 columns documented in the help text. I'm especially trying to understand column 2, which has valuesm
,p
,a
, and0
.The text was updated successfully, but these errors were encountered: