Skip to content

OpenBioML/datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bio-datasets

bio-datasets

PubChem Compound Dataset

Processing and convering PubChem Compoud Dataset can be found in datasets/pubchem. The process_data.py script downloads the SDF file, converts the canonical SMILES representation to SELFIES, and saves it in a jsonl file.

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages