Add eligibility criteria to datasets.toml #115

J535D165 · 2024-12-02T18:02:15Z

@Rensvandeschoot @EmilyWes extracted the eligibility criteria from the supplementing publication for each synergy dataset. This PR adds the quoted sections to the TOML metadata for the SYNERGY datasets.

These sections are especially relevant to machine learning applications based on chatbot/LLM technology. For more use cases, please take a look at issue #94.

In another PR, I will update the release pipeline and add the criteria to the released dataset (however, feel free to use it from the TOML file already).

datasets.toml

EmilyWes

I think all criteria should start and end with a quote, resulting in triple quotes in the file. This to make clear they are blocked quotes from in the papers.
Currently this is not yet the case for: Appenzeller-Herzog, Bos, Brouwer, Donners, Leenaars_2020, Moran, Oud, Sep and Wassenaar.

2 criteria are missing (their names changed over time):
van_de_Schoot and (van_der_)Valk

J535D165 · 2024-12-03T14:47:40Z

Good catch on Valk and Schoot dataset. Will open another PR.

README.md

J535D165 · 2024-12-04T08:58:31Z

Thanks for the reviews. Helped.

@EmilyWes, I didn't incorporate your first suggestion because of the TOML specification https://toml.io/en/v1.0.0#string.

Add eligibility criteria to datasets.toml

da7f8c1

J535D165 added the enhancement New feature or request label Dec 2, 2024

J535D165 requested review from Rensvandeschoot and EmilyWes December 2, 2024 18:02

J535D165 commented Dec 2, 2024

View reviewed changes

datasets.toml Outdated Show resolved Hide resolved

J535D165 commented Dec 2, 2024

View reviewed changes

datasets.toml Outdated Show resolved Hide resolved

Add statement that eligibility_criteria are blocked quotes

05386aa

EmilyWes suggested changes Dec 3, 2024

View reviewed changes

J535D165 added 2 commits December 4, 2024 09:45

Add van_de_Schoot and van_der_Valk

39daa30

Merge remote-tracking branch 'origin/add-criteria' into add-criteria

8809feb

J535D165 commented Dec 4, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

J535D165 added 2 commits December 4, 2024 09:47

unicode

836c133

Update README.md

623b54e

J535D165 commented Dec 4, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

Update README.md

379ccec

J535D165 merged commit ca8cb9e into master Dec 4, 2024

J535D165 deleted the add-criteria branch December 4, 2024 08:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add eligibility criteria to datasets.toml #115

Add eligibility criteria to datasets.toml #115

J535D165 commented Dec 2, 2024

EmilyWes left a comment

J535D165 commented Dec 3, 2024

J535D165 commented Dec 4, 2024

Add eligibility criteria to datasets.toml #115

Add eligibility criteria to datasets.toml #115

Conversation

J535D165 commented Dec 2, 2024

EmilyWes left a comment

Choose a reason for hiding this comment

J535D165 commented Dec 3, 2024

J535D165 commented Dec 4, 2024