You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am currently applying what I learned from RNA-seq by example to my own dataset and have run into what seems to be a common issue when passing an NCBI annotation file (GFF format) to featureCounts.
The feature.gff in the RNA-seq by example demo has this format:
gene_name=AAA-750000-UP-4; gene_id=AAA-750000-UP-4; transcript_id=AAA-750000-UP-4-T; exon_number=1;
But NCBI datasets genome.gff has this format:
ID=geneLOC100125545;Dbxref=GeneID:100125545;Name=LOC100125545;gbkey=Gene;gene=LOC100125545;gene_biotype=protein_coding
But the fixes are really a nuisance for someone who is new to programming. Many are trying to parse the GFF files, to create SAF (or other weird) formats.
Just a suggestion that maybe it would be helpful to point out to the reader that a GTF format as provided by ensembl will work without any complex editing of the reference annotation files.
Love the book. BIG help.
The text was updated successfully, but these errors were encountered:
Hi,
I am currently applying what I learned from RNA-seq by example to my own dataset and have run into what seems to be a common issue when passing an NCBI annotation file (GFF format) to featureCounts.
The feature.gff in the RNA-seq by example demo has this format:
gene_name=AAA-750000-UP-4; gene_id=AAA-750000-UP-4; transcript_id=AAA-750000-UP-4-T; exon_number=1;
But NCBI datasets genome.gff has this format:
ID=geneLOC100125545;Dbxref=GeneID:100125545;Name=LOC100125545;gbkey=Gene;gene=LOC100125545;gene_biotype=protein_coding
The error caused is pretty common
But the fixes are really a nuisance for someone who is new to programming. Many are trying to parse the GFF files, to create SAF (or other weird) formats.
Just a suggestion that maybe it would be helpful to point out to the reader that a GTF format as provided by ensembl will work without any complex editing of the reference annotation files.
Love the book. BIG help.
The text was updated successfully, but these errors were encountered: