Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

merging vcf files from the same sample before VQSR and CNV #6

Open
grendon opened this issue Jun 28, 2022 · 1 comment
Open

merging vcf files from the same sample before VQSR and CNV #6

grendon opened this issue Jun 28, 2022 · 1 comment

Comments

@grendon
Copy link

grendon commented Jun 28, 2022

The samples were sequenced in two batches at different points in time, Each batch was analyzed separately with the Sentieon pipeline and we now have two vcfs per sample. How can we merge the calls in the vcfs before running VQRS and CNV on each sample?

@DonFreed
Copy link
Contributor

Merging information from the two VCFs is somewhat complex. There will be some discordant calls and it is not clear how the discordant calls should be handled.

Instead of merging the VCFs, you might pass the BAM files from both batches to the variant caller so that it can take advantage of the read information in both datasets to make the most accurate variant calls. If the BAM files from both batches have the same sample readgroup (RGSM), you can pass both BAM files directly to the variant caller:

sentieon driver -i <sample_batch1.bam> -i <sample_batch2.bam> -r <ref> ... \
  --algo Haplotyper ... sample_jointCalls.vcf.gz

If the BAM files from the two batches have different RGSM tags, you might use samtools reheader to replace the RGSM information in one of the BAM files.

The result would be similar to starting from the fastq for both batches and then processing the data using the multi-FASTQ.sh script.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants