-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Transcriptomics metadata template #75
Comments
Key transcriptomics related entities for FAIR and some ontologies include Key searching ontologies
Key searching entities (not ontologies)
|
Define own minimal set of metadata, recommendations. Selection criteria for ontologies used. |
For disease, I would use MONDO (possibly supplemented with NCIt for cancers) as it is currently the most actively developed, so most likely to respond quickly to any change requests. I definitely wouldn't use MeSH. Agreed on all the other ontologies. I'd also add
Searching entities |
Bioschema's may be an appropriate approach here to define a minimal metadata record that would be searchable on the web. |
I tried to compile a potential starting point for a recipe. Hope it makes sense to you. Really looking forward to your thoughts. Maybe we can flesh this out. Task
Define competency questions
Defining Minimal Set Of Metadata (MSOM) according to these questions
Introducing semantics into the template
Reality check
|
Link to recipe |
I think this would benefit from some structure for an actual study that involves transcriptomics data. Apart from general metadata (who did it, where, where was it stored and so on), this should have a description of the study (which includes what other measurements were done in the same study), this should follow the ISA principles. How samples were created and how the actual measurements were performed. Next, it should also link (and have an ontological description) of 1) parallel measurements (like did you also do proteomics and where do I find that info). 2) phenotypic outcome data. Like under the treatment in the study the data that was measured was blood pressure and so on, and again where you would store that. Note that, ideally, in a public study, the ISA types of data would go into Biosamples, and the other measurements would be in Biostudies, or (for other comics data) be linked from there. So our choices should ideally align with how these repositories (and of course Arrayexpress and GEO) work. (Sorry if all that was already in the cookbook) |
We had some discussion about whether this could not better be part of the catalogue model. Of course, the catalog needs to align with how data is collected. But we need to also make sure of our recipes align with a "FAIR at source" approach where people can start to collect the relevant data when they design, perform and evaluate the actual study. |
Determine which ontologies to use for transcriptomics data (meta data templates)
The text was updated successfully, but these errors were encountered: