-
Notifications
You must be signed in to change notification settings - Fork 2
Molecular Data Provider
The Molecular Data Provider, known as MolePro, is part of the NCATS Biomedical Data Translator project. MolePro integrates diverse chemical biology data sources and provides a curated framework for chemical entities, enhancing our understanding of their effects on biological targets.
Key features of MolePro include:
-
Data Integration: MolePro consolidates information from various public chemical biology datasets, portals, and tools to offer comprehensive insights into compounds and their protein targets. This includes similarity-based connections.
-
Unbiased Approach: We adhere to a stringent policy to avoid expert bias, ensuring true innovation. Our process involves impartial data scouting, efficient architecture building, and meticulous data provenance capturing.
-
Unified Framework: The result is a cohesive framework for compounds and targets, facilitating the identification of new compounds or targets through "guilt-by-association" inferences from large datasets.
-
Chemical Biology Expertise: MolePro contributes to the Translator project with expertise with probes in drug-discovery, toxicity flagging, and small-molecule mechanism of action elucidation.
MolePro harnesses powerful and diverse datasets to support research and development in chemical biology, ultimately aiming to deepen our understanding and application of chemical entities in biomedical science.
Vlado Dancik (@vdancik)
MolePro was designed to provide the same molecular data knowledge in a choice of two APIs to cater to various user preferences: the MolePro API and the TRAPI.
We have made MolePro's knowledge graph accessible to the scientific community through the MolePro API hosted on two servers:
- NCATS Translator server: https://molepro.transltr.io/molecular_data_provider/api
- Broad Institute server: https://translator.broadinstitute.org/molecular_data_provider/api
MolePro can communicate biomedical questions and answers using the standardized Translator Reasoner API (TRAPI). Adhering to the TRAPI standard, MolePro integrates the query graph, knowledge graph, and results into a single HTTP message. Consequently, MolePro's knowledge graph is accessible to the scientific community through the TRAPI hosted on two servers:
- NCATS Translator server: https://molepro-trapi.transltr.io/molepro/trapi/v1.5/ui/
- Broad Institute server: https://translator.broadinstitute.org/molepro/trapi/v1.5/ui/
MolePro's knowledge graph is constructed through extensive data scouting and careful data wrangling, integrating information from over two dozen discovered knowledge sources, including:
- BiGG Models http://bigg.ucsd.edu/
- BigGIM http://biggim.ncats.io/api/
- BindingDB https://www.bindingdb.org/
- ChEBI https://www.ebi.ac.uk/chebi
- ChemBank http://chembank.broadinstitute.org/
- ChEMBL https://www.ebi.ac.uk/chembl/
- CMAP https://clue.io/
- CTD http://ctdbase.org/
- CTRP http://portals.broadinstitute.org/ctrp/
- DepMap https://depmap.org/portal/
- DGIdb http://dgidb.org/
- DrugBank https://www.drugbank.ca/
- DrugCentral http://drugcentral.org/privacy
- Drug Repurposing Hub https://clue.io/repurposing
- DSSTox https://www.epa.gov/comptox-tools/distributed-structure-searchable-toxicity-dsstox-database
- Gelinea https://github.com/broadinstitute/GeLiNEA
- GtoPdb https://www.guidetopharmacology.org/
- HGNC https://www.genenames.org
- HMDB https://www.hmdb.ca/
- Inxight:Drugs https://drugs.ncats.io/
- Kinomescan https://lincs.hms.harvard.edu/kinomescan/
- MSigDB https://www.gsea-msigdb.org/gsea/msigdb/index.jsp
- PharmGKB https://www.pharmgkb.org/
- Pharos https://pharos.nih.gov
- ProbeMiner https://probeminer.icr.ac.uk/#/
- PubChem https://pubchem.ncbi.nlm.nih.gov/
- Reactome https://reactome.org/
- RxNorm https://www.nlm.nih.gov/research/umls/rxnorm/index.html
- SIDER http://sideeffects.embl.de/
- STITCH http://stitch.embl.de
- STRING https://string-db.org/
For additional details about ingested data source see Catalog of MolePro knowledge sources (JSON formatted output).