-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dwc:GeologicalContext: Chronostratigraphy vocabulary - curation before uploading first vocabulary version #121
Comments
|
I will setup all the verbatim field tabs tomorrow and let you know when it is ready. |
I have now set up 10 tabs for each field related to chronostratigraphy. Duplicate/identical values have been removed, although the same value may appear in e.g. "", () or similar - please map these to concepts as well although they appear to be duplicates. If a value does not belong to any of the concepts, please leave it unmapped. You may also want to take a look at the suggested definitions tab where you can fill out definitions and descriptions for the concepts (including time period) according to authoritative sources. @ekrimmel - I have heard you also have a Slack channel assigned for this work. Feel free to add me if you find it useful for me to be part of it. |
Following meetings with the Paleo Working Group in CPH this week, we have decided that we want one search term for stratigraphy (all 10 dwc fields), 1 search term for lithistratigraphy (combining 4 dwc fields) and 1 field for biostratigraphy (combining two dwc fields). So we will reduce 16 dwc fields to 3 in searches - see this issue: gbif/gbif-web#497. Now, how should I set up the vocabular(y/ies) on the vocabulary server for this?
|
Again, I strongly support this. Question, does "Range" refer to text or numeric values? Numeric values are more precise, but a moving target. If using IUGS values, use only the ratified values and not numbers (or text) harvested from issues of "Episodes" where values are not finalized. I've seen some wild ones recently. |
The plan is to use the numerical age from the most recent ICS source: https://stratigraphy.org/ICSchart/ChronostratChart2023-09.pdf. I do not see any mention of IUGS values, but I do see this specification:
Would you then advice GBIF not to use the uncertain ages (~)? @ekrimmel and others, we did not discuss this, but you may want to chime in? Just to be clear - the numerical ages would be used to structure data in the back end to enable more dynamic searches on paleo data. What users would see and search for would most likely be the concepts themselves. |
The vocabulary concepts are now uploaded to UAT and PROD. @MortenHofft this was what you needed for the hosted portal, right? Now we just need to add the hidden value mappings when they are ready. |
We are working on this again! Sorry for the long delays between action :) |
No worries - thank you for dealing with the mappings and let me know if you have any questions for the rest of them. |
We now have the potential flags and issues included. They still require proper documentation. |
Originally posted by @CecSve in #120 (comment)
A Chronostratigraphy vocabulary would cover concepts across multiple terms in the dwc:GeologicalContext category (gbif/pipelines#400 (comment)):
The vocabulary follows the vocabulary published by the CGI Geoscience Terminology Working Group hosted by the International Commission on Stratigraphy (ICS) (https://vocabs.ardc.edu.au/viewById/196, gbif/pipelines#400 (comment), https://github.com/CSIRO-enviro-informatics/interactive-geological-timescale/blob/master/src/assets/timeline_data.json, https://stratigraphy.org/timescale/).
Here is a file to edit:
https://docs.google.com/spreadsheets/d/1k3YpAeRT3HxR9DBnkh0jkZZl12jimkHU3_H_pCPOUHc/edit?usp=sharinghttps://docs.google.com/spreadsheets/d/1aHqhhtO93nooQ0o4AAVcSBVpyb-IGUXu9dZVTiN77TY/edit#gid=694447980 (updated version that supports numerical ranges for the time scales - version to be implemented)
It contains:
a list of the values already mapped to the concepts (they are all in the Hidden sheet/tab for now)the GBIF verbatim values for this field that appear more than 10,000 times or in 5 or more datasetsthis tab contains all verbatim values for the relevant terms from here: https://github.com/tdwg/dwc-qa/tree/master/data/GBIFDistinctValues/2022-03-08Please check instructions here: #70
The text was updated successfully, but these errors were encountered: