-
Notifications
You must be signed in to change notification settings - Fork 37
Consider displaying or documenting more detail about association sources #1604
Comments
I'll bring this up on one of the GO calls. The overlap between GO and
phenotype annotations is and has been a long standing issues. MGI is
particularly unusual because they have two completely separate annotation
groups for this respective functions. Seems to me it would be extremely odd
not to consider this related to a heart development phenotype. And if GO is
not used we (Monarch) would lose a tremendous amount of valuable
information for insight into disease manifestations. This is a much larger
discussion.
|
Perhaps this is something that I can bring up internally to the MGI team and suggest possible ways of improvement. Can we talk about this at some upcoming meeting?
|
I have described this issue as far back as ~2007 perhaps ;-). I do think the provenance description as Matt suggests is a good idea. At ZFIN they have some general curation guidelines for when to make one annotation and not another, perhaps a survey of MODs to assess this information at this time would be helpful. |
My concern goes deeper than provenance. MGI deliberately did not go down this path of asserting phenotype annotations from GO IMP annotations. It is often different in mouse from other organisms that cellular based phenotype assertions frequently do not translate accurately to whole mouse phenotype. A couple of existing examples of this in your data that I found quickly include:
I can get you more examples... I think you will get better accuracy if there is a mouse mutant allele ID in the annotation assertion that the data is inferred from - this indicates that an actual mouse was the source of the GO annotation. Matt's Bmpr1a example above was not necessarily egregious or wrong, but it duplicates existing phenotype data that has more detailed information (more specific phenotype term, specific mouse mutation, mouse strain background, etc...). I'm not sure what value the derived annotation from the GO is adding in this case. Happy to discuss further. |
Sorry, reading the entire thread now I may have misunderstood initially. It would be great to discuss this with Cindy as to how to best represent this information! |
@cindyJax this is very informative. My understanding is that the go-phenotype annotations are for normal phenotypes, which fall under the Biological process phenotype sub tree in the uber pheno ontology. Could you link to the sp9 and Fam126a? I'm not able to track them down my self. The provenance for these won't be obvious from examining the graph alone, but I agree we need to document why this choice is made and make this available for these annotations. |
@selewis I appreciate that you have concerns about the separate streams for GO and phenotype annotation. There are simpler ways to solve the disconnect problem than having MGI upend our entire curation workflows. In fact, we had a QC process that was set up to tag papers with GO IMP annotations for high-priority evaluation by the phenotypes group. I had a closer look at this and the process is clearly broken and has been for quite some time. We will take care of this. @kshefchek Where did you want me to link to ? |
@cindyJax thanks for your comments. I think the phenotypes derived from IMPs should be switched off (and more generally improve the provenance reporting, but in this case it's not an issue if we simply switch these off) |
@cindyJax I was referring to your statement:
and was hoping you could point me to these examples in our data (were they on our site, flat files?) @cmungall my understanding is that this is required by our R24 - monarch-initiative/dipper#410 (comment) |
MGI was surprised (displeased?) to find themselves referenced as a source of certain phenotype annotations on genes that they did not assert. e.g. for Bmpr1a
This annotation is derived/inferred by Monarch a GO:heart development' annotation with an IMP evidence code. MGI curators make this GO annotation and report it on their site, but MGI does not make or report the assertion that Bmpr1a is associated with a heart development phenotype.
they raised this concern in a recent meeting - i.e. that it looked on our site like this assertion came from MGI, and that they many not want credit for this.
It may be worth considering how to describe provenance at a more granular level, to be clear about what source made the actual assertion, vs what sources provided information that was used in some way to support the assertion. In this example, it would be nice to know that this phenotype association was made by Monarch, as inferred from an IMP-based GO annotation made by MGI that Bmpr1a is involved in GO:heart development.
The text was updated successfully, but these errors were encountered: