Reworking how unknown information is handled #147
paulmillar
started this conversation in
Metadata Schema Suggestions
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
What is the problem that your suggestion solves?
Information may be unavailable for any number of reasons.
Appendix 3 provides a mechanism that aims to address this problem specifically when information is mandatory. It does this by providing a list of place-holder values that should be used under different circumstances.
There are a number of potential problems with this approach:
<title>:unap</title>
calls into question whytitle
is a mandatory metadata property: either title is mandatory (and the record shouldn't be accepted, or perhaps the definition of title needs adjusting) or a title "makes no sense" so title shouldn't be mandatory.<title>:unkn</title>
would indicate that the title is "known to be unknown", but would<title> :unkn</title>
(space before the:
) or<title> :unkn </title>
(space after then
) or<title> :unknown</title>
(extra characters after then
) also carry the same meaning. Is there a way to represent a property value with one of these standard values? More generally, are metadata property values that start with a:
now reserved?What solution might meet your needs?
An alternative approach might be to use XML attributes to carry this information.
For example:
The values of this
status
attribute would be a controlled vocabulary, similar to Appendix 3.This
status
attribute could be applied to any XML element, allowing its inclusion for metadata property of any type (string, integer, date, ...) without being confused as a valid value and without having to extend validation rules.For metadata properties with multiple cardinality (e.g,.
Creator
), thestatus
attribute could be applied to the collection element, indicating that this information is currently not available.It could also apply to individual items within the collection; e.g.,
Another alternative would be for DataCite to drop support for place-holder values.
Are they really used in practice? If so, doesn't this suggest that certain mandatory metadata property would be better as a recommended property? The publisher could simply omit the information and update the record when (and if) information becomes available. If metadata properties should stay mandatory then a publisher would need to delay obtaining a DOI until after all the mandatory metadata property values are available.
Your name
Paul Millar
Your organization
DESY
What alternatives have you tried or considered?
No response
Is there anything else you would like to share?
No response
What group(s) would benefit from your suggestion?
If other group(s), please describe.
No response
Beta Was this translation helpful? Give feedback.
All reactions