Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chemcaption smarts #456

Merged
merged 3 commits into from
Oct 29, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
41 changes: 41 additions & 0 deletions data/tabular/chem_caption_smarts/meta.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
---
name: chem_caption_smarts
description: |-
This dataset contains the count of substructures in molecules
targets:
- id: smarts
type: text
description: substructure smarts
names:
- noun: SMARTS
- noun: SMiles ARbitrary Target Specification (SMARTS)
- id: completion
type: categorical
description: number of matches
- id: completion_labels
type: text
description: name of the substructure
identifiers:
- id: representation
type: text
description: representation
- id: representation_type
type: text
description: representation type
license: CC BY 4.0
links:
- url: https://github.com/lamalab-org/chem-caption
description: Original codebase used to generate this dataset
templates:
- |-
Question: {#How many times|How often!} does the {#molecule|chemical|compound|chemical structure!} with {representation_type#} {representation#} contain the substructure with the {smarts__names__noun} {#smarts#}?
Answer: {completion#}
- |-
Question: {#How many times|How often!} does the {#molecule|chemical|compound|chemical structure!} with {representation_type#} {representation#} contain a {completion#} substructure?
Answer: {smarts__names__noun} {#smarts#}
- |-
User: {#I want to|I have to|I must|I would like to!} know {#how many times|how often!} the {#molecule|chemical|compound|chemical structure!} with {representation_type#} {representation#} contains the substructure with the {smarts__names__noun} {#smarts#}.
Assistant: The {#molecule|chemical|compound|chemical structure!} with {representation_type#} {representation#} contains the substructure with the {smarts__names__noun} {#smarts#} {completion#} times.
- |-
User: {#I want to|I have to|I must|I would like to!} know how many times the {#molecule|chemical|compound|chemical structure!} with {representation_type#} {representation#} contains a {completion#} substructure.
Assistant: The {#molecule|chemical|compound|chemical structure!} contains the substructure with the {smarts__names__noun} {#smarts#} {completion#} times.
Loading
Loading