-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How DNAScope improves variant candidate detection? #10
Comments
We have a preprint describing DNAscope's methodology at, https://www.biorxiv.org/content/10.1101/2022.05.20.492556v1. DNAscope is very similar to Sentieon's Haplotyper (Sentieon's implementation of the GATK's HaplotypeCaller), but has three major improvements:
The improved local assembler allows DNAscope to call variants that cannot be called by Haplotyper due to limitations of the local assembler. The ML-based genotyping can also rescue incorrect genotype calls from the Bayesian statistical model, which can also help reduce FNs. Besides the assembly and genotyping improvements, DNAscope does perform a very sensitive first-pass of variant calling followed by a filtering step. This also helps to greatly reduce FNs. Best, |
Thanks for your kindly reply. I do have a follow-up question regarding the sevaral major improvements. Have you tested the performance of these components individually? Could you please share some of statistically results on that if possible? Best, |
We've studied the impact of the improved local assembler quite extensively. The new local assembler rescues some variants were not previously called due to errors in the assembly. However, the new assembler also calls some additional false-positive variants, mostly due to variants from chimeric/spurious alignments. The combined effect is that new assembler has a higher recall but lower precision (relative to the original assembler) when compared to the GIAB truthset. Combining the improved local assembler (improving sensitivity) with the ML model for genotyping/filtering (improving precision) are both necessary for obtaining the highest variant calling accuracy. I can share some slides with some specific variants and overall metrics relative to the GIAB truthset through email, if you'd like to reach out to me at [email protected]. |
That would be great if you could share slides about it, much appreciated! |
Slides are sent! |
Hi @DonFreed
I found out that there are less FNs in VCFs called by DNAScope, which is impressive. Runing GATK haplotypecaller with sensitive setting did not performs well as DNAScope in terms of FN variants.
So can I draw such a conclusion that DNAScope is not just a re-implemention of GATK haplotypecaller with parameters set sensitively?
The text was updated successfully, but these errors were encountered: