You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+92-12
Original file line number
Diff line number
Diff line change
@@ -1,11 +1,14 @@
1
1
# DAS Tool for genome resolved metagenomics
2
2
3
-

3
+

4
4
5
5
DAS Tool is an automated method that integrates the results of a flexible number of binning algorithms to calculate an optimized, non-redundant set of bins from a single assembly.
6
6
7
+
# Reference
8
+
9
+
Christian M. K. Sieber, Alexander J. Probst, Allison Sharrar, Brian C. Thomas, Matthias Hess, Susannah G. Tringe & Jillian F. Banfield (2018). [Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy.](https://www.nature.com/articles/s41564-018-0171-1) Nature Microbiology. [https://doi.org/10.1038/s41564-018-0171-1.](https://doi.org/10.1038/s41564-018-0171-1)
- Summary of output bins including quality and completeness estimates (DASTool_summary.txt).
67
-
-Scaffold to bin file of output bins (DASTool_scaffolds2bin.txt).
72
+
-Scaffolds to bin file of output bins (DASTool_scaffolds2bin.txt).
68
73
- Quality and completeness estimates of input bin sets, if ```--write_bin_evals 1``` is set ([method].eval).
69
74
- Plots showing the amount of high quality bins and score distribution of bins per method, if ```--create_plots 1``` is set (DASTool_hqBins.pdf, DASTool_scores.pdf).
70
75
- Bins in fasta format if ```--write_bins 1``` is set (DASTool_bins).
Christian M. K. Sieber, Alexander J. Probst, Allison Sharrar, Brian C. Thomas, Matthias Hess, Susannah G. Tringe & Jillian F. Banfield (2018). [Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy.](https://www.nature.com/articles/s41564-018-0171-1) Nature Microbiology. [https://doi.org/10.1038/s41564-018-0171-1.](https://doi.org/10.1038/s41564-018-0171-1)
140
+
# Installation of dependent R-packages
141
+
142
+
```
143
+
$ R
144
+
> repo='http://cran.us.r-project.org' #select a repository
Not all binning tools provide results in a tab separated file of scaffold-IDs and bin-IDs. A helper script can be used to convert a set of bins in fasta format to tabular scaffold2bin file, which can be used as input for DAS Tool: `src/Fasta_to_Scaffolds2Bin.sh -h`.
164
+
165
+
### Usage:
166
+
```
167
+
Fasta_to_Scaffolds2Bin: Converts genome bins in fasta format to scaffolds-to-bin table.
168
+
(DAS Tool helper script)
169
+
170
+
Usage: Fasta_to_Scaffolds2Bin.sh -e fasta > my_scaffolds2bin.tsv
171
+
172
+
-e, --extension Extension of fasta files. (default: fasta)
173
+
-i, --input_folder Folder with bins in fasta format. (default: ./)
174
+
-h, --help Show this message.
175
+
```
176
+
177
+
### Example: Converting MaxBin fasta output into tab separated scaffolds2bin file:
$ src/Fasta_to_Scaffolds2Bin.sh -i /maxbin/output/folder -e fasta > maxbin.scaffolds2bin.tsv
183
+
184
+
$ head gut_maxbin2_scaffolds2bin.tsv
185
+
NODE_10_length_127450_cov_375.783524 maxbin.001
186
+
NODE_27_length_95143_cov_427.155298 maxbin.001
187
+
NODE_51_length_78315_cov_504.322425 maxbin.001
188
+
NODE_84_length_66931_cov_376.684775 maxbin.001
189
+
NODE_87_length_65653_cov_460.202156 maxbin.001
190
+
```
191
+
192
+
Some binning tools (such as CONCOCT) provide a comma separated tabular output. To convert a comma separated file into a tab separated file a one liner can be used: `perl -pe "s/,/\t/g;" scaffolds2bin.csv > scaffolds2bin.tsv`.
0 commit comments