The NDE Requirements for datasets describes how a dataset description should be defined. The guideline promotes the use of a part of the Schema.org vocabulary (sdo
) for this purpose.
With this vocabulary aspects like ownership, creators, license, titles, data locations etc. are all described using a simple conceptual model of 6 key elements: DataCatalog, Dataset, Distribution (or DataDownload), Organization and Person (not shown):
The Linked Data Sets module translates the various parts of the conceptual model to Omeka S resources (Items) with the appropiate properties and lists, so that Omeka-S can be used to create the descriptions of a datacatalog, dataset and distribution.
The table below shows how the elements from the guideline correspond to the resource templates which are added by this module.
Concept in guideline | Usage | Type | Resource template |
---|---|---|---|
DataCatalog | Top element, has collection of Datasets | sdo:DataCatalog | LDS DataCatalog |
Dataset | Has links to zero or more Distributions *) | sdo:Dataset | LDS Dataset |
Distribution | Element that references a datadump (file) or data endpoint (API) *) | sdo:DataDownload | LDS Distribution |
Organization | Element used to represent an organization (for publisher, creator and/or funder) | sdo:Organization | LDS Organization |
Person | Element used to represent a person (for publisher, creator and/or funder) | sdo:Person | LDS Person |
*) When a Dataset has an sdo:isBasedOn
property which has one (or more) Itemsets as value, the module can crawl this dataset and output the RDF in the (in the Distribution) defined property sdo:encodingFormat
. See section 2 of the Usage.