-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assess Popgen48/scalepopgen #80
Comments
Requirement for input files.
vcf_input.csv: sample.map: Splitting the VCF file by chromosomes |
Downloaded supplementary data from https://doi.org/10.1093/molbev/msad207 and followed EurasianOtter_PopGen.html to obtain vcf.gz files and rename samples, and select only autosomes and bialleleic SNPs for analyses. Split the vcf file by chromosomes using bcftools. Ran "nextflow run scalepopgen -profile singularity -params-file /global/scratch/users/hangxue/otter/vcf_publication/jul4_parameters.yml -qs 10". See output graphs at https://docs.google.com/presentation/d/1O8vFmYImrJd6p4pvSLyzwiMsf9fTAZSTaG_FJGLz8t8/edit#slide=id.p |
Tested PCA, Admixture, Pairwise Fst and Treemix in scalepopgen. These can run successfully with little modifications. Scalepopgen can also do Tajimas_D and search for selective sweeps selection (Sweepfinder2), but plotting the these two results requires the type of the chromosome name being integer. Out of these, Sweepfinder2 takes the longest, ~7hr for the otter data, followed by admixture ~1hr.
|
I have doubled checked the label. I think the ones I am working with is labeled correctly. Yeah, I think the difference might be due to different softwares / parameters |
We need to review how much of our population genomics ideas Popgen48/scalepopgen can do to determine:
Links: poster
Summary
Next developments
Based on the tests above, to use scalepopgen, we would want to:
The text was updated successfully, but these errors were encountered: