Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: drugchat dataset #341

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

feat: drugchat dataset #341

wants to merge 2 commits into from

Conversation

alxfgh
Copy link

@alxfgh alxfgh commented Jun 29, 2023

Dataset of SMILES, Questions & Answers from Liang, Zhang et al., 2023. This closes #293.

@MicPie MicPie self-requested a review July 24, 2023 15:31
Copy link
Contributor

@MicPie MicPie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi Alex,
I just changed two filenames so they are the same with the rest of the datasets.
As this dataset is slightly different we would need some text templates to properly embed the data for the model.
Please let me know if you have time/want to look into that! .-)
Best,
Michael

@MicPie MicPie changed the base branch from main to text_sampling July 24, 2023 16:25
@MicPie MicPie changed the base branch from text_sampling to main July 24, 2023 16:25
- id: Question
type: string
description: Question about SMILES
license: CC BY 4.0
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are we sure about this license? The repo seems to be under a BSD license?

Comment on lines +3 to +4
PUBCHEM_DATASET = "alxfgh/PubChem_Drug_Instruction_Tuning"
CHEMBL_DATASET = "alxfgh/ChEMBL_Drug_Instruction_Tuning"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we put the requests.get code her or make those paths customizable?

@kjappelbaum kjappelbaum mentioned this pull request Oct 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add DrugChat data
3 participants