You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Currently VCFs containing multi-allelic sites need to be decomposed, whereas these are supported by vep. This limitation makes direct benchmark comparisons more limited.
Describe the solution you'd like
mehari should support multi allelics and simply decompose these while processing. As the used vcf parsing library already fully supports parsing multi-allelics, only changes in mehari should be necessary.
Describe alternatives you've considered
Otherwise preprocessing using bcftools using e.g. bcftools norm -m- -a is necessary. As mehari also doesn't support directly reading from stdin, a write to disk is always necessary. This has significant impact on overall performance for non-normalized vcf.
Additional context
If possible performance penalty to normalized VCFs should be kept close to zero. This needs to be established when making changes.
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
Currently VCFs containing multi-allelic sites need to be decomposed, whereas these are supported by
vep
. This limitation makes direct benchmark comparisons more limited.Describe the solution you'd like
mehari should support multi allelics and simply decompose these while processing. As the used vcf parsing library already fully supports parsing multi-allelics, only changes in mehari should be necessary.
Describe alternatives you've considered
Otherwise preprocessing using bcftools using e.g.
bcftools norm -m- -a
is necessary. As mehari also doesn't support directly reading from stdin, a write to disk is always necessary. This has significant impact on overall performance for non-normalized vcf.Additional context
If possible performance penalty to normalized VCFs should be kept close to zero. This needs to be established when making changes.
The text was updated successfully, but these errors were encountered: