-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add dataset: odeuropa_smell_objects #71
Comments
Happy to help anyone who wants to work on this. I have a WIP loading script for another COCO formatted dataset: https://huggingface.co/datasets/biglam/nls_chapbook_illustrations |
Also, I really want to call this dataset |
I'd love to work on this! Will be a good change from the text datasets so far. |
#self-assign |
Awesome, and don't worry if you can't finish this before you go away. It can wait until you're back too 🙂 |
Hopefully, I should be able to get it done. From the Zenodo page:
Should the dataset just contain the links to the images then? |
Yes I think that would be best for this one. We can provide example code for downloading the images in the datacard. |
@davanstrien This dataset has a lot of associated metadata
Should they all be included in the dataset? Most of them are missing, from a cursory glance at the data. |
My own feeling would be to include as much as possible. One option if things are often missing would be to put some of this metadata in an additional metadata column as a dictionary? This way it doesn't get lost but also is slightly less distracting than having a lot of columns with mostly missing data? |
Yeah, I was building out the features as follows:
I'll probably get back to this in about two weeks, after I come back from vacation |
Have a great vacation! |
@davanstrien I'm back to working on this dataset, but it seems like the URLs aren't accessible. Even the download script provided in the dataset gives the following error: |
@shamikbose hey, hope you had a good break! I'll try and take a look at this too but also tagging @kiymetakdemir who works on this project and might be able to help with this. |
@davanstrien I did! It was a much needed break |
Hi @shamikbose, can you check it again? Now I tried to download the images with the given script but I haven't encountered any error, it downloaded successfully. |
@kiymetakdemir I was able to download them today. Thanks! |
@kiymetakdemir I get an error for this URL (http://134.76.24.240/download/07876601/flc0596164z_p?Expires=1610722060&Signature=SX15SE0B~KbZ7yvkTJtis1rsKysZddvhsxJzZSZ7oZoxqd~NNsKp22iYZGBQViGXMy7zwTDCYxu-Qan2O0aq2QxizENey~CF4WIV5-~bHwEZZjrmCoBdWDEeS0Y6XNajZ6DYzWQolxkiGWoqLs~Bw0j4GSrQef7QvgQciIWDlTE_&Key-Pair-Id=APKAJGHHKKX2FHRP63AQ) It's not accessible |
A URL for this dataset
https://doi.org/10.5281/zenodo.6367776
Dataset description
From the Zenodo page:
Object detection datasets are time consuming to collect and there are relativlely few datasets for object detection that use LAM data. Those that do exist often use the output of one of the various YOLO models which may be of some interest but often includes categories which are unlikely to be particularly useful for research/curation of LAM collections. This dataset, in contrast, includes categories related to smell: a topic of interest to both art historians and social historians. As a result, this dataset offers a much richer exploration of the possibilities of using object detection with historical paintings.
Dataset modality
Image
Dataset licence
Creative Commons Attribution 4.0 International
Other licence
No response
How can you access this data
Other
Confirm the dataset has an open licence
Contact details for data custodian
No response
The text was updated successfully, but these errors were encountered: