The Disease Normalizer resolves ambiguous references and descriptions of human diseases to consistently-structured, normalized terms. For concepts extracted from NCIt, Mondo Disease Ontology, The Human Disease Ontology, OMIM, and OncoTree, it designates a CURIE, and provides additional metadata like aliases and cross-references.
Documentation · Installation · API reference
The Disease Normalizer is available via PyPI:
python3 -m pip install disease-normalizer
See installation instructions in the documentation for a description of installation options and data setup requirements.
Use the live service to programmatically normalize disease terms, as in the following truncated example:
$ curl -s 'https://normalize.cancervariants.org/disease/normalize?q=liver%20cancer' | python -m json.tool
{
"query": "liver cancer",
"warnings": null,
"match_type": 80,
"disease": {
"conceptType": "Disease",
"primaryCode": "ncit:C34803",
"id": "normalize.disease:liver%20cancer",
"label": "Primary Malignant Liver Neoplasm",
# ...
}
}
Or utilize the Python API for fast local access:
>>> from disease.query import QueryHandler
>>> from disease.database import create_db
>>> q = QueryHandler(create_db())
>>> result = q.normalize("NSCLC")
>>> result.disease.primaryCode.root
'ncit:C2926'
We welcome bug reports, feature requests, and code contributions from users and interested collaborators. The documentation contains guidance for submitting feedback and contributing new code.