Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong scaffold coverage calculation when using coverage file #39

Open
adityabandla opened this issue Oct 11, 2019 · 5 comments
Open

Wrong scaffold coverage calculation when using coverage file #39

adityabandla opened this issue Oct 11, 2019 · 5 comments

Comments

@adityabandla
Copy link

I ran the refinem scaffold_stats & outliers command with a custom coverage file using the --coverage_file option. Some contigs got flagged as having zero coverage across all samples.

However, when I had a look at the coverage file, the contig did have coverage values across several samples. Second, I had a look at the scaffold coverage value in the outliers.tsv file for this contig and it did not correspond with the mean or median for that contig. I am a bit lost as to what is going on here

@donovan-h-parks
Copy link
Owner

Hi. If you can send me the relevant data via email (donovan.parks [at] gmail.com) I can take a look. Please indicate the exact command you are running and one or more contigs that look problematic.

@adityabandla
Copy link
Author

Sent!

@adityabandla
Copy link
Author

The issue with zero coverage goes away when the coverage file has column names. However, the calculation of scaffold_coverage in outliers.tsv still seems off

@donovan-h-parks
Copy link
Owner

Are all the column names unique? This is also a requirement.

@adityabandla
Copy link
Author

Yes, the column names are all unique. I am just trying to understand the math behind the column scaffold coverage

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants