Consensus on naming scheme #19

GallVp · 2024-04-18T04:22:58Z

Current scheme: AGAT

Ross's Perl script (gene.t1, gene.t2, etc.)
mRNA or transcript (GFF3 requires mRNA)
liftoffID is acceptable (Ross)
For a gene with multiple different descriptions=differing%20isoform%20descriptions

GallVp · 2024-04-18T20:55:39Z

Summary of a call with @rosscrowhurst

NCBI requires that the text in 'product' attribute adheres to a set of rules. These rules keep changing and an automated validation tool is not known.
JBrowse2 picks the 'description' or 'note' attribute from the gene feature to display as the annotation text. This capability is a high priority for us because many fairGenomes/JBrowse2 users have requested it. Therefore, we should populate the 'description' attribute for the gene features.
We use eggnogmapper to obtain functional annotations for transcripts. These annotations should be stored both at the transcript and the gene level under the 'description' attribute.
An experimental feature of pangene is to support multiple isoforms. This might be dropped later. If a gene has multiple isoforms and and they have different functional annotations, this indicates a likely problem with gene prediction. In such a case, the gene level 'description' will be 'differing isoform descriptions'
To avoid pesky formatting failures, we should use url encoding. Thus, the above description will be stored as: description=differing%20isoform%20descriptions

GallVp · 2024-04-29T02:25:06Z

Notes:

@jasonshiller, @rosscrowhurst, @CeciliaDeng Global transcript numbers are confusing. BRAKER uses t1, t2 and that's what we should use. Convention: geneXX.tYY
@rosscrowhurst A single naming scheme might not work for every case and every user
Provide transformation tables/web apps for pan-genome

GallVp · 2024-10-06T21:37:50Z

GallVp added the discussion needed Further discussion is needed label Apr 18, 2024

GallVp added this to the 0.4 milestone Apr 23, 2024

GallVp self-assigned this Jun 20, 2024

GallVp added the done on dev label Jun 20, 2024

GallVp closed this as completed Oct 6, 2024

Provide feedback