-
Notifications
You must be signed in to change notification settings - Fork 105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for the BED12 format #108
Comments
Do you have something specific in mind? |
I'm not familiar with c codes, so I don't know if it is possible. For example, the Interval object only contains information of first 6 fields (chrom, stat, end, name, score, strand), and it could not efficiently parse other fields like cds_start, cds_end, exon_starts and exon_ends in the BED12 format. Another issue is that in bedtools, there is options like -split in getfasta to support BED12 format, but in pybedtools, this option was omitted. |
Ah, I see what you mean with respect to However, any field of an block_starts = x[11] And then handle the string yourself, maybe like: block_starts = [int(i) for i in x[11].split(',')] In general, pybedtools doesn't do anything special with BED files over 3 fields. It just splits on the tab characters and provides aliases to each position. For example, I think what you might want is some sort of object representing a transcript model, which would parse a BED12 line and make available all the exons, CDSs, and introns. Something like this might be better suited for gffutils, where @yarden has already proposed something similar at daler/gffutils#21. Alternatively, you could write a class that accepts an As for your second issue regarding |
Thanks for your detailed explanation. In past, I always parsed BED12 files using raw python codes, and its performance is good enough. The reason why I open this issue is just to find whether pybedtools has any more convenient way to solve this problems. |
I read the document, and found only a few functions support the BED12 format. Is it possible to add more supports for the BED12 format? Thanks!
The text was updated successfully, but these errors were encountered: