Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

--cov_corr value #19

Open
palomo11 opened this issue Feb 26, 2018 · 1 comment
Open

--cov_corr value #19

palomo11 opened this issue Feb 26, 2018 · 1 comment

Comments

@palomo11
Copy link

palomo11 commented Feb 26, 2018

Hi,

I'm using RefineM with my genomes and 24 samples.

In the readme, it is suggested that:

"If you have more than 6 data point (i.e. BAM files) comprising your coverage profiles you may wish to consider using the coverage correlation criteria (--cov_corr) instead of or in addition to this absolute error criteria"

Which would be a recommended value to be used? 0.95?
Is it better to combine both or just use cov_corr?

In addition, could you explain a bit how exactly the cov_perc and the cov_corr are calculated?

Thank you very much in advance.

@donovan-h-parks
Copy link
Owner

Hello.

I'm not sure about the best threshold to use. I haven't had a chance to play with data where this filtering is relevant. It really depends on how conservative you want to be. My gut feeling is something a bit more lenient than 0.95 though. Perhaps 0.8???

The median coverage of a bin is the median across all contigs comprising a bin.

cov_corr: Pearson's r between the median coverage of a bin in each sample vs. the coverage of each individual contig.

cov_corr: This is the mean absolute error in coverage between a contig and the median coverage of a bin taken over all samples. For each sample this is given by: abs(coverage_contig - median_coverage_bin) * 100 / median_coverage_bin.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants