Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add dataset: WWI_documents_dataset #52

Open
1 task done
Skorkmaz88 opened this issue Jul 13, 2022 · 2 comments
Open
1 task done

Add dataset: WWI_documents_dataset #52

Skorkmaz88 opened this issue Jul 13, 2022 · 2 comments
Assignees
Labels
dataset Dataset to be added

Comments

@Skorkmaz88
Copy link

A URL for this dataset

https://rdf.muninn-project.org/

Dataset description

This dataset is actually about WWI archives, specifically documents subcategory from the store above, the data is in linked format. Currently I am prototyping a converter for tabular format by consuming sparql endpoint of the archieve. As result final output will name, label, primary topic of document, scanned images of first_page, and last_page (I am planning to omit any other pages if they are available), access rights for each entry, origin country.

The dataset may be used for old document, WWI document classification such as looking at doc and classifying it as an attestation paper from Canadian origin.

Dataset modality

Mixed

Dataset licence

Other license

Other licence

Each item will have license associated

How can you access this data

Via an open API

Confirm the dataset has an open licence

  • To the best of my knowledge, this dataset is accessible via an open licence

Contact details for data custodian

No response

@Skorkmaz88 Skorkmaz88 added the candidate-dataset Proposed dataset to be added label Jul 13, 2022
@davanstrien
Copy link
Collaborator

This sounds great, thanks for suggesting it! If you also want to work on adding this feel free to use the #self-assign command to assign yourself to work on this.

@Skorkmaz88
Copy link
Author

#self-assign

@davanstrien davanstrien changed the title Add dataset: [WWI_documents_dataset] Add dataset: WWI_documents_dataset Jul 28, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dataset Dataset to be added
Development

No branches or pull requests

2 participants