|
2 | 2 | Usage
|
3 | 3 | =====
|
4 | 4 |
|
5 |
| -CONCOCT uses several command line options to control the clustering, here is a complete documentation of these. These can also be viewed by typing ``concoct -h`` on the command line.:: |
6 |
| - |
7 |
| - usage: concoct [-h] [--coverage_file COVERAGE_FILE] |
8 |
| - [--composition_file COMPOSITION_FILE] [-c CLUSTERS] |
9 |
| - [-k KMER_LENGTH] [-l LENGTH_THRESHOLD] [-r READ_LENGTH] |
10 |
| - [--total_percentage_pca TOTAL_PERCENTAGE_PCA] [-b BASENAME] |
11 |
| - [-s SEED] [-i ITERATIONS] [-e EPSILON] [--no_cov_normalization] |
12 |
| - [--no_total_coverage] [-o] [-d] [-v] |
13 |
| - |
14 |
| - optional arguments: |
15 |
| - -h, --help show this help message and exit |
16 |
| - --coverage_file COVERAGE_FILE |
17 |
| - specify the coverage file, containing a table where |
18 |
| - each row correspond to a contig, and each column |
19 |
| - correspond to a sample. The values are the average |
20 |
| - coverage for this contig in that sample. All values |
21 |
| - are separated with tabs. |
22 |
| - --composition_file COMPOSITION_FILE |
23 |
| - specify the composition file, containing sequences in |
24 |
| - fasta format. It is named the composition file since |
25 |
| - it is used to calculate the kmer composition (the |
26 |
| - genomic signature) of each contig. |
27 |
| - -c CLUSTERS, --clusters CLUSTERS |
28 |
| - specify maximal number of clusters for VGMM, default |
29 |
| - 400. |
30 |
| - -k KMER_LENGTH, --kmer_length KMER_LENGTH |
31 |
| - specify kmer length, default 4. |
32 |
| - -l LENGTH_THRESHOLD, --length_threshold LENGTH_THRESHOLD |
33 |
| - specify the sequence length threshold, contigs shorter |
34 |
| - than this value will not be included. Defaults to |
35 |
| - 1000. |
36 |
| - -r READ_LENGTH, --read_length READ_LENGTH |
37 |
| - specify read length for coverage, default 100 |
38 |
| - --total_percentage_pca TOTAL_PERCENTAGE_PCA |
39 |
| - The percentage of variance explained by the principal |
40 |
| - components for the combined data. |
41 |
| - -b BASENAME, --basename BASENAME |
42 |
| - Specify the basename for files or directory where |
43 |
| - outputwill be placed. Path to existing directory or |
44 |
| - basenamewith a trailing '/' will be interpreted as a |
45 |
| - directory.If not provided, current directory will be |
46 |
| - used. |
47 |
| - -s SEED, --seed SEED Specify an integer to use as seed for clustering. 0 |
48 |
| - gives a random seed, 1 is the default seed and any |
49 |
| - other positive integer can be used. Other values give |
50 |
| - ArgumentTypeError. |
51 |
| - -i ITERATIONS, --iterations ITERATIONS |
52 |
| - Specify maximum number of iterations for the VBGMM. |
53 |
| - Default value is 500 |
54 |
| - -e EPSILON, --epsilon EPSILON |
55 |
| - Specify the epsilon for VBGMM. Default value is 1.0e-6 |
56 |
| - --no_cov_normalization |
57 |
| - By default the coverage is normalized with regards to |
58 |
| - samples, then normalized with regards of contigs and |
59 |
| - finally log transformed. By setting this flag you skip |
60 |
| - the normalization and only do log transorm of the |
61 |
| - coverage. |
62 |
| - --no_total_coverage By default, the total coverage is added as a new |
63 |
| - column in the coverage data matrix, independently of |
64 |
| - coverage normalization but previous to log |
65 |
| - transformation. Use this tag to escape this behaviour. |
66 |
| - -o, --converge_out Write convergence info to files. |
67 |
| - -d, --debug Debug parameters. |
68 |
| - -v, --version show program's version number and exit |
| 5 | +CONCOCT uses several command line options to control the clustering, here is a |
| 6 | +complete documentation of these. These can also be viewed by typing ``concoct |
| 7 | +-h`` on the command line: |
69 | 8 |
|
| 9 | +.. program-output:: (echo 'import conf'; cat ../../concoct/parser.py; echo 'args=arguments()') | python - --help |
| 10 | + :shell: |
0 commit comments