Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ClinVar Protein HGVS not being returned correctly? #128

Open
kkrysiak opened this issue Jul 22, 2021 · 3 comments · Fixed by #130
Open

ClinVar Protein HGVS not being returned correctly? #128

kkrysiak opened this issue Jul 22, 2021 · 3 comments · Fixed by #130
Assignees
Labels

Comments

@kkrysiak
Copy link

We noticed pulling data for CIViC that the Protein HGVS is missing but ClinVar definitely has these available. At least I thought it used to be available.

https://myvariant.info/v1/variant/chr7:g.140453136A%3ET?fields=clinvar.hgvs

https://www.ncbi.nlm.nih.gov/clinvar/variation/13961/

@erikyao
Copy link
Contributor

erikyao commented Aug 10, 2021

@kkrysiak, it's confirmed that there is a hidden bug in our ClinVar data parser. Will fix soon. Thank you for pointing it out!

@erikyao
Copy link
Contributor

erikyao commented Aug 10, 2021

@newgene It's caused by the ill-timed break statement on clinvar_xml_parser.py#L156. Some attributes (not only proteins) will be discarded after the break statement is executed.

A simple fix is to delete this break, but to avoid such problems in the future, a little bit refactoring might be necessary.

@erikyao
Copy link
Contributor

erikyao commented Aug 12, 2021

Fix merged to codebase; waiting for new data release.

@erikyao erikyao reopened this Aug 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants