Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VCF deviates from the VCF v4.2 specification #387

Open
mhkc opened this issue Aug 19, 2024 · 2 comments
Open

VCF deviates from the VCF v4.2 specification #387

mhkc opened this issue Aug 19, 2024 · 2 comments

Comments

@mhkc
Copy link

mhkc commented Aug 19, 2024

Hi Jody,

The VCF produced by TbProfiler v6.3 violates the VCF v4.2 specification which might cause issues with parser and genome browsers. TbProfiler reports GQ values as Floats but the specification mandate that they should be Integers.

GQ (Integer): Conditional genotype quality, encoded as a phred quality −10log10 p(genotype call is wrong,
conditioned on the site’s being variant)

Abbreviated example VCF

##fileformat=VCFv4.2
...
##FORMAT=<ID=GQ,Number=1,Type=Float,Description="Genotype Quality, the Phred-scaled marginal (or unconditional) probability of the called genotype">
...
#CHROM  POS ID  REF ALT QUAL    FILTER  INFO    FORMAT  test1_240816_nb000000_0000_test
NC_000962.3 2784611 DEL00000028 T   <DEL>   5670    PASS    PRECISE;SVTYPE=DEL;SVMETHOD=EMBL.DELLYv1.2.6;END=2785969;PE=75;MAPQ=60;CT=3to5;CIPOS=-6,6;CIEND=-6,6;SRMAPQ=60;INSLEN=0;HOMLEN=5;SR=20;SRQ=1;CONSENSUS=CGGCGCGAATTGCTGGCCACCCGGAACTTGACGACCTCTTGATCACCGACTTTGCGGCGCTGCAAAT    CGTTGACGATGTGACCGACCACGGTCAGTGGCGTTTCGAACATTTGCTCATTCCTTTCCTAGTTGCGTTGGCACAGTTGCGTTGGCACCGGGTGATTCCGCGAACTGCCCACGCATATGC;CE=1.97451;CONSBP=92;AC=2;AN=2   GT:GL:GQ:FT:RCL:RC:RCR:RDCN:DR:DV:RR:RV 1/1:-395.943,-39.6786,0:10000:PASS:419:45:380:0:0:75:0:132
@jodyphelan
Copy link
Owner

Hi @mhkc

Thanks for letting me know. Tb-profiler itself does not create the VCFs. I've just checked the tools and it looks like the issue is with freebayes, so perhaps opening an issue there would be useful.

@jodyphelan
Copy link
Owner

On looking at freebayes a bit closer I noticed the --strict-vcf option which forces GQ to be an int. I'll add this into tb-profiler when the call to freebayes is made.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants