diff --git a/HISTORY.md b/HISTORY.md index 5a97c3c..f7b8561 100644 --- a/HISTORY.md +++ b/HISTORY.md @@ -1,6 +1,18 @@ History ======= +1.5.0 (2023-09-20) +------------------ + +* Adds support for `pyrodigal-gv` implementing `prodigal-gv` as a gene predictor ([pyrodigal-gv](https://github.com/althonos/pyrodigal-gv) and [prodigal-gv](https://github.com/apcamargo/prodigal-gv)). This can be specified with `-g prodigal-gv`. +* Thanks to @[althonos](https://github.com/althonos) and @[apcamargo](https://github.com/apcamargo) for making this possible, and to @[asierFernandezP](https://github.com/asierFernandezP) for raising this as an issue in the first place [here](https://github.com/gbouras13/pharokka/issues/290) in #290. +* Adds checks to determine if your input FASTA has duplicated contig headers from #293 [here](https://github.com/gbouras13/pharokka/issues/293). Thanks @[thauptfeld](https://github.com/thauptfeld) for raising this. +* `-g prodigal` and `-g prodigal-gv` should be much faster thanks to multithread support added by the inimitable @althonos. +* Genbank format output will be designated with PHG not VRL (following this issue https://github.com/RyanCook94/inphared/issues/22). +* The `_length_gc_cds_density.tsv` and `_cds_final_merged_output.tsv` files now contain the translation table/genetic code for each contig (usually 11 but now not always if you use `pyrodigal-gv`). +* `--skip_mash` flag added to skip finding the closest match for each contig in INPHARED using mash. +* `--skip_extra_annotations` flag added to skip running tRNA-scanSE, MinCED and Aragorn in case you only want CDS predictions and functional annotations. + 1.4.1 (2023-09-04) ------------------ diff --git a/README.md b/README.md index 2969e9c..69ebfd2 100644 --- a/README.md +++ b/README.md @@ -23,6 +23,35 @@ Extra special thanks to Ghais Houtak for making Pharokka's logo. If you are looking for rapid standardised annotation of bacterial genomes, please use [Bakta](https://github.com/oschwengers/bakta). [Prokka](https://github.com/tseemann/prokka), which inspired the creation and naming of `pharokka`, is another good option, but Bakta is [Prokka's worthy successor](https://twitter.com/torstenseemann/status/1565471892840259585). +# Table of Contents + +- [pharokka](#pharokka) + - [Fast Phage Annotation Tool](#fast-phage-annotation-tool) +- [Table of Contents](#table-of-contents) +- [Quick Start](#quick-start) +- [Documentation](#documentation) +- [Paper](#paper) +- [Pharokka with Galaxy Europe Webserver](#pharokka-with-galaxy-europe-webserver) +- [Brief Overview](#brief-overview) + - [Pharokka v 1.5.0 Update (20 September 2023)](#pharokka-v-150-update-20-september-2023) + - [Pharokka v 1.4.0 Update (27 August 2023)](#pharokka-v-140-update-27-august-2023) + - [Pharokka v 1.3.0 Update](#pharokka-v-130-update) +- [Installation](#installation) + - [Conda Installation](#conda-installation) + - [Pip](#pip) + - [Source](#source) +- [Database Installation](#database-installation) +- [Beginner Conda Installation](#beginner-conda-installation) +- [Usage](#usage) +- [Version Log](#version-log) +- [System](#system) +- [Time](#time) +- [Benchmarking v1.5.0](#benchmarking-v150) +- [Benchmarking v1.4.0](#benchmarking-v140) +- [Original Benchmarking (v1.1.0)](#original-benchmarking-v110) +- [Bugs and Suggestions](#bugs-and-suggestions) +- [Citation](#citation) + # Quick Start The easiest way to install `pharokka` is via conda: @@ -65,11 +94,22 @@ So if you can't get `pharokka` to install on your machine for whatever reason or pharokka Workflow

-`pharokka` uses [PHANOTATE](https://github.com/deprekate/PHANOTATE), the only gene prediction program tailored to bacteriophages, as the default program for gene prediction. [Prodigal](https://github.com/hyattpd/Prodigal) is also available as an alternative. Following this, functional annotations are assigned by matching each predicted coding sequence (CDS) to the [PHROGs](https://phrogs.lmge.uca.fr), [CARD](https://card.mcmaster.ca) and [VFDB](http://www.mgc.ac.cn/VFs/main.htm) databases using [MMseqs2](https://github.com/soedinglab/MMseqs2). As of v1.4.0, `pharokka` will also match each CDS to the PHROGs database using more sensitive Hidden Markov Models using [PyHMMER](https://github.com/althonos/pyhmmer). Pharokka's main output is a GFF file suitable for using in downstream pangenomic pipelines like [Roary](https://sanger-pathogens.github.io/Roary/). `pharokka` also generates a `cds_functions.tsv` file, which includes counts of CDSs, tRNAs, tmRNAs, CRISPRs and functions assigned to CDSs according to the PHROGs database. See the full [usage](#usage) and check out the full [documentation](https://pharokka.readthedocs.io) for more details. +`pharokka` uses [PHANOTATE](https://github.com/deprekate/PHANOTATE), the only gene prediction program tailored to bacteriophages, as the default program for gene prediction. [Prodigal](https://github.com/hyattpd/Prodigal) implemented with [pyrodigal](https://github.com/althonos/pyrodigal) and [Prodigal-gv](https://github.com/apcamargo/prodigal-gv) implemented with [pyrodigal-gv](https://github.com/althonos/pyrodigal-gv) are also available as alternatives. Following this, functional annotations are assigned by matching each predicted coding sequence (CDS) to the [PHROGs](https://phrogs.lmge.uca.fr), [CARD](https://card.mcmaster.ca) and [VFDB](http://www.mgc.ac.cn/VFs/main.htm) databases using [MMseqs2](https://github.com/soedinglab/MMseqs2). As of v1.4.0, `pharokka` will also match each CDS to the PHROGs database using more sensitive Hidden Markov Models using [PyHMMER](https://github.com/althonos/pyhmmer). Pharokka's main output is a GFF file suitable for using in downstream pangenomic pipelines like [Roary](https://sanger-pathogens.github.io/Roary/). `pharokka` also generates a `cds_functions.tsv` file, which includes counts of CDSs, tRNAs, tmRNAs, CRISPRs and functions assigned to CDSs according to the PHROGs database. See the full [usage](#usage) and check out the full [documentation](https://pharokka.readthedocs.io) for more details. + +## Pharokka v 1.5.0 Update (20 September 2023) + +* Adds support for `pyrodigal-gv` implementing `prodigal-gv` as a gene predictor for alternate genetic codes ([pyrodigal-gv](https://github.com/althonos/pyrodigal-gv) and [prodigal-gv](https://github.com/apcamargo/prodigal-gv)). This can be specified with `-g prodigal-gv` and is recommended for metagenomic input datasets. Thanks to @[althonos](https://github.com/althonos) and @[apcamargo](https://github.com/apcamargo) for making this possible, and to @[asierFernandezP](https://github.com/asierFernandezP) for raising this as an issue in the first place [here](https://github.com/gbouras13/pharokka/issues/290). +* `-g prodigal` and `-g prodigal-gv` should be much faster thanks to multithread support added by the inimitable @[althonos](https://github.com/althonos). +* Adds checks to determine if your input FASTA has duplicated [contig headers](https://github.com/gbouras13/pharokka/issues/293). Thanks @[thauptfeld](https://github.com/thauptfeld) for raising this. +* Genbank format output will be designated with PHG not VRL. +* The `_length_gc_cds_density.tsv` and `_cds_final_merged_output.tsv` files now contain the translation table/genetic code for each contig. +* `--skip_mash` flag added to skip finding the closest match for each contig in INPHARED using mash. +* `--skip_extra_annotations` flag added to skip running tRNA-scanSE, MinCED and Aragorn in case you only want CDS predictions and functional annotations. + ## Pharokka v 1.4.0 Update (27 August 2023) -Pharokka v1.4.0 is a large update implementing: +`pharokka` v1.4.0 is a large update implementing: * More sensitive search for PHROGs using Hidden Markov Models (HMMs) using the amazing [PyHMMER](https://github.com/althonos/pyhmmer). * By default, `pharokka` will now run searches using both MMseqs2 (PHROGs, CARD and VFDB) and HMMs (PHROGs). MMseqs2 was kept for PHROGs as it provides more information than the HMM results (e.g. sequence alignment identities & top hit PHROG protein) if it finds a hit. @@ -114,33 +154,6 @@ SAOMS1 phage (GenBank: MW460250.1) was isolated and sequenced by: Yerushalmy, O. Please see [plotting](docs/plotting.md) for details on all plotting parameter options. -# Table of Contents - -- [pharokka](#pharokka) - - [Fast Phage Annotation Tool](#fast-phage-annotation-tool) -- [Quick Start](#quick-start) -- [Documentation](#documentation) -- [Paper](#paper) -- [Pharokka with Galaxy Europe Webserver](#pharokka-with-galaxy-europe-webserver) -- [Brief Overview](#brief-overview) - - [Pharokka v 1.4.0 Update (27 August 2023)](#pharokka-v-140-update-27-august-2023) - - [Pharokka v 1.3.0 Update](#pharokka-v-130-update) -- [Table of Contents](#table-of-contents) -- [Installation](#installation) - - [Conda Installation](#conda-installation) - - [Pip](#pip) - - [Source](#source) -- [Database Installation](#database-installation) -- [Beginner Conda Installation](#beginner-conda-installation) -- [Usage](#usage) -- [Version Log](#version-log) -- [System](#system) -- [Time](#time) -- [Original Benchmarking (v1.1.0)](#original-benchmarking-v110) -- [Benchmarking v1.4.0](#benchmarking-v140) -- [Bugs and Suggestions](#bugs-and-suggestions) -- [Citation](#citation) - # Installation @@ -267,9 +280,9 @@ For a full explanation of all arguments, please see [usage](docs/run.md). pharokka defaults to 1 thread. ``` -usage: pharokka.py [-h] [-i INFILE] [-o OUTDIR] [-d DATABASE] [-t THREADS] [-f] [-p PREFIX] [-l LOCUSTAG] [-g GENE_PREDICTOR] [-m] [-s] [-c CODING_TABLE] - [-e EVALUE] [--fast] [--mmseqs2_only] [--meta_hmm] [--dnaapler] [--custom_hmm CUSTOM_HMM] [--genbank] [--terminase] - [--terminase_strand TERMINASE_STRAND] [--terminase_start TERMINASE_START] [-V] [--citation] +usage: pharokka.py [-h] [-i INFILE] [-o OUTDIR] [-d DATABASE] [-t THREADS] [-f] [-p PREFIX] [-l LOCUSTAG] [-g GENE_PREDICTOR] [-m] [-s] [-c CODING_TABLE] [-e EVALUE] [--fast] [--mmseqs2_only] + [--meta_hmm] [--dnaapler] [--custom_hmm CUSTOM_HMM] [--genbank] [--terminase] [--terminase_strand TERMINASE_STRAND] [--terminase_start TERMINASE_START] + [--skip_extra_annotations] [--skip_mash] [-V] [--citation] pharokka: fast phage annotation program @@ -289,13 +302,13 @@ options: -l LOCUSTAG, --locustag LOCUSTAG User specified locus tag for the gff/gbk files. This is not required. A random locus tag will be generated instead. -g GENE_PREDICTOR, --gene_predictor GENE_PREDICTOR - User specified gene predictor. Use "-g phanotate" or "-g prodigal". + User specified gene predictor. Use "-g phanotate" or "-g prodigal" or "-g prodigal-gv" or "-g genbank". Defaults to phanotate (not required unless prodigal is desired). -m, --meta meta mode for metavirome input samples -s, --split split mode for metavirome samples. -m must also be specified. Will output separate split FASTA, gff and genbank files for each input contig. -c CODING_TABLE, --coding_table CODING_TABLE - translation table for prodigal. Defaults to 11. Experimental only. + translation table for prodigal. Defaults to 11. -e EVALUE, --evalue EVALUE E-value threshold for MMseqs2 database PHROGs, VFDB and CARD and PyHMMER PHROGs database search. Defaults to 1E-05. --fast, --hmm_only Runs PyHMMER (HMMs) with PHROGs only, not MMseqs2 with PHROGs, CARD or VFDB. @@ -307,13 +320,17 @@ options: --custom_hmm CUSTOM_HMM Run pharokka with a custom HMM profile database suffixed .h3m. Please use create this with the create_custom_hmm.py script. - --genbank Flag denoting that -i/--input is a genbank file instead of the usual FASTA file + --genbank Flag denoting that -i/--input is a genbank file instead of the usual FASTA file. + The CDS calls in this file will be preserved and re-annotated. --terminase Runs terminase large subunit re-orientation mode. Single genome input only and requires --terminase_strand and --terminase_start to be specified. --terminase_strand TERMINASE_STRAND Strand of terminase large subunit. Must be "pos" or "neg". --terminase_start TERMINASE_START Start coordinate of the terminase large subunit. + --skip_extra_annotations + Skips tRNAscan-se, MINced and Aragorn. + --skip_mash Skips running mash to find the closest match for each contig in INPHARED. -V, --version Print pharokka Version --citation Print pharokka Citation ``` @@ -332,6 +349,53 @@ On a standard 16GB RAM laptop specifying 8 threads, `pharokka` should take betwe In `--fast` mode, it should take 45-75 seconds. +# Benchmarking v1.5.0 + +`pharokka v1.5.0` was run on the 673 crAss phage dataset to showcase the improved CDS prediction of `-g prodigal-gv` for metagenomic datasets where some phages likely have alternative genetic codes (i.e. not 11). + +All benchmarking was conducted on a Intel® Core™ i7-10700K CPU @ 3.80GHz on a machine running Ubuntu 20.04.6 LTS with 8 threads (`-t 8`). `pyrodigal-gv v0.1.0` and `pyrodigal v3.0.0` were used respectively. + +| 673 crAss-like genomes | `pharokka` v1.5.0 `-g prodigal-gv` | `pharokka` v1.5.0 `-g prodigal` | +|------------------------|------------------------------------|----------------------------------| +| Total CDS | **81730** | 91999 | +| Annotated Function CDS | **20344** | 17458 | +| Unknown Function CDS | 61386 | 74541 | +| Contigs with genetic code 15 | 229 | NA | +| Contigs with genetic code 4 | 38 | NA | +| Contigs with genetic code 11 | 406 | 673 | + +Fewer (larger) CDS were predicted more accurately, leading to an increase in the number of coding sequences with annotated functions. Approximately 40% of contigs in this dataset were predicted to use non-standard genetic codes according to `pyrodigal-gv`. + +# Benchmarking v1.4.0 + +`pharokka` v1.4.0 has also been run on phage SAOMS1 and also the same 673 crAss phage dataset to showcase: + +1. The improved sensitivity of gene annotation with PyHMMER and a demonstration of how `--fast` is slower for metagenomes. + * If you can deal with the compute cost (especially for large metagenomes), I highly recommend `--fast` or `--meta_hmm` for metagenomes given how much more sensitive HMM search is. +2. The large speed-up over v1.3.2 with `--fast` for phage isolates - with the proviso that no virulence factors or AMR genes will be detected. +3. The slight speed-up over v1.3.2 with `--mmseqs2_only`. + +All benchmarking was conducted on a Intel® Core™ i7-10700K CPU @ 3.80GHz on a machine running Ubuntu 20.04.6 LTS with 16 threads (`-t 16`). + +SAOMS1 was run with Phanotate + +| Phage SAOMS1 | `pharokka` v1.4.0 `--fast` | `pharokka` v1.4.0 | `pharokka` v1.3.2 | +|------------------------|-----------------------------|-------------------|-----------------| +| Time (min) | 0.70 | 3.73 | 5.08 | +| CDS | 246 | 246 | 246 | +| Annotated Function CDS | 93 | 93 | 92 | +| Unknown Function CDS | 153 | 153 | 154 | + +The 673 crAss-like genomes were run with `-m` (defaults to `--mmseqs2_only` in v 1.4.0) and with `-g prodigal` (pyrodigal v2.1.0). + +| 673 crAss-like genomes | `pharokka` v1.4.0 `--fast` | `pharokka` v1.4.0 `--mmseqs2_only` | `pharokka` v1.3.2 | +|------------------------|---------------------------|----------------------------------|-----------------| +| Time (min) | 35.62 | 11.05 | 13.27 | +| CDS | 91999 | 91999 | 91999 | +| Annotated Function CDS | **16713** | 9150 | 9150 | +| Unknown Function CDS | 75286 | 82849 | 82849 | + + # Original Benchmarking (v1.1.0) `pharokka` (v1.1.0) has been benchmarked on an Intel Xeon CPU E5-4610 v2 @ 2.30 specifying 16 threads. Below is benchamarking comparing `pharokka` run with PHANOTATE and Prodigal against Prokka v1.14.6 run with PHROGs HMM profiles, as modified by Andrew Millard (https://millardlab.org/2021/11/21/phage-annotation-with-phrogs/). @@ -372,35 +436,6 @@ For the crAss-like phage genomes, `pharokka` meta mode `-m` was enabled. If you require fast annotations of extremely large datasets (i.e. thousands of input contigs), running `pharokka` with Prodigal (`-g prodigal`) is recommended. -# Benchmarking v1.4.0 - -`pharokka` v1.4.0 has also been run on phage SAOMS1 and also the same 673 crAss phage dataset to showcase: - -1. The improved sensitivity of gene annotation with PyHMMER and a demonstration of how `--fast` is slower for metagenomes. - * If you can deal with the compute cost (especially for large metagenomes), I highly recommend `--fast` or `--meta_hmm` for metagenomes given how much more sensitive HMM search is. -2. The large speed-up over v1.3.2 with `--fast` for phage isolates - with the proviso that no virulence factors or AMR genes will be detected. -3. The slight speed-up over v1.3.2 with `--mmseqs2_only`. - -All benchmarking was conducted on a Intel® Core™ i7-10700K CPU @ 3.80GHz on a machine running Ubuntu 20.04.6 LTS with 16 threads (`-t 16`). - -SAOMS1 was run with Phanotate - -| Phage SAOMS1 | `pharokka` v1.4.0 `--fast` | `pharokka` v1.4.0 | `pharokka` v1.3.2 | -|------------------------|-----------------------------|-------------------|-----------------| -| Time (min) | 0.70 | 3.73 | 5.08 | -| CDS | 246 | 246 | 246 | -| Annotated Function CDS | 93 | 93 | 92 | -| Unknown Function CDS | 153 | 153 | 154 | - -The 673 crAss-like genomes were run with `-m` (defaults to `--mmseqs2_only` in v 1.4.0) and with `-g prodigal` (pyrodigal v2.1.0). - -| 673 crAss-like genomes | `pharokka` v1.4.0 `--fast` | `pharokka` v1.4.0 `--mmseqs2_only` | `pharokka` v1.3.2 | -|------------------------|---------------------------|----------------------------------|-----------------| -| Time (min) | 35.62 | 11.05 | 13.27 | -| CDS | 91999 | 91999 | 91999 | -| Annotated Function CDS | **16713** | 9150 | 9150 | -| Unknown Function CDS | 75286 | 82849 | 82849 | - # Bugs and Suggestions @@ -414,7 +449,7 @@ If you use `pharokka`, I would recommend a citation in your manuscript along the * All phages were annotated with Pharokka v ___ (Bouras, et al. 2023). Specifically, coding sequences (CDS) were predicted with PHANOTATE (McNair, et al. 2019), tRNAs were predicted with tRNAscan-SE 2.0 (Chan, et al. 2021), tmRNAs were predicted with Aragorn (Laslett, et al. 2004) and CRISPRs were preducted with CRT (Bland, et al. 2007). Functional annotation was generated by matching each CDS to the PHROGs (Terzian, et al. 2021), VFDB (Chen, et al. 2005) and CARD (Alcock, et al. 2020) databases using MMseqs2 (Steinegger, et al. 2017) and PyHMMER (Larralde, et al. 2023). Contigs were matched to their closest hit in the INPHARED database (Cook, et al. 2021) using mash (Ondov, et al. 2016). Plots were created with pyCirclize (Shimoyama 2022). -With the following full citations for the constituent tools below: +With the following full citations for the constituent tools below where relevant: * Cook R, Brown N, Redgwell T, Rihtman B, Barnes M, Clokie M, Stekel DJ, Hobman JL, Jones MA, Millard A. INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes. PHAGE. 2021. Available from: http://doi.org/10.1089/phage.2021.0007. * McNair K., Zhou C., Dinsdale E.A., Souza B., Edwards R.A. (2019) "PHANOTATE: a novel approach to gene identification in phage genomes", Bioinformatics, https://doi.org/10.1093/bioinformatics/btz26. @@ -428,4 +463,5 @@ With the following full citations for the constituent tools below: * Alcock et al, "CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database." Nucleic Acids Research (2020) https://doi.org/10.1093/nar/gkz935. * Larralde, M., (2022). Pyrodigal: Python bindings and interface to Prodigal, an efficient method for gene prediction in prokaryotes. Journal of Open Source Software, 7(72), 4296. doi:10.21105/joss.04296. * Larralde M., Zeller G., (2023). PyHMMER: a Python library binding to HMMER for efficient sequence analysis, Bioinformatics, Volume 39, Issue 5, May 2023, btad214, https://doi.org/10.1093/bioinformatics/btad214. -* Shimoyama, Y. (2022). pyCirclize: Circular visualization in Python [Computer software]. https://github.com/moshi4/pyCirclize \ No newline at end of file +* Larradle M. and Camargo A., (2023) Pyrodigal-gv: A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code. https://github.com/althonos/pyrodigal-gv. +* Shimoyama, Y. (2022). pyCirclize: Circular visualization in Python [Computer software]. https://github.com/moshi4/pyCirclize. \ No newline at end of file diff --git a/bin/input_commands.py b/bin/input_commands.py index e836e0e..cd7cbe0 100644 --- a/bin/input_commands.py +++ b/bin/input_commands.py @@ -4,9 +4,10 @@ import subprocess as sp from argparse import RawTextHelpFormatter +import pyrodigal +import pyrodigal_gv from Bio import SeqIO from loguru import logger -from pyrodigal import __version__ from util import get_version @@ -60,7 +61,7 @@ def get_input(): "-g", "--gene_predictor", action="store", - help='User specified gene predictor. Use "-g phanotate" or "-g prodigal". \nDefaults to phanotate (not required unless prodigal is desired).', + help='User specified gene predictor. Use "-g phanotate" or "-g prodigal" or "-g prodigal-gv" or "-g genbank". \nDefaults to phanotate (not required unless prodigal is desired).', default="phanotate", ) parser.add_argument( @@ -78,7 +79,7 @@ def get_input(): parser.add_argument( "-c", "--coding_table", - help="translation table for prodigal. Defaults to 11. Experimental only.", + help="translation table for prodigal. Defaults to 11.", action="store", default="11", ) @@ -138,6 +139,16 @@ def get_input(): action="store", default="nothing", ) + parser.add_argument( + "--skip_extra_annotations", + help="Skips tRNAscan-SE 2, MinCED and Aragorn.", + action="store_true", + ), + parser.add_argument( + "--skip_mash", + help="Skips running mash to find the closest match for each contig in INPHARED.", + action="store_true", + ) parser.add_argument( "-V", "--version", @@ -197,23 +208,56 @@ def instantiate_dirs(output_dir, meta, force): def validate_fasta(filename): - if os.path.isfile(filename) == False: # if file doesnt exist - logger.error(f"Error: Input file {filename} does not exist. Please check your input.") + if os.path.isfile(filename) == False: # if file doesnt exist + logger.error( + f"Error: Input file {filename} does not exist. Please check your input." + ) else: with open(filename, "r") as handle: fasta = SeqIO.parse(handle, "fasta") - logger.info("Checking Input FASTA.") + logger.info(f"Checking input {filename}.") if any(fasta): - logger.info("FASTA checked.") + logger.info(f"Input {filename} is in FASTA format.") else: logger.error("Error: Input file is not in the FASTA format.") + # check for duplicate headers + logger.info(f"Checking input {filename} for duplicate FASTA headers.") + check_duplicate_headers(filename) + logger.info(f"All headers in {filename} are unique.") + + +def check_duplicate_headers(fasta_file): + """ + checks if there are duplicated in the FASTA header + in response to Tina's issue + https://github.com/gbouras13/pharokka/issues/293 + """ + header_set = set() + + # Iterate through the FASTA file and check for duplicate headers + for record in SeqIO.parse(fasta_file, "fasta"): + header = record.description + if header in header_set: + logger.error( + f"Duplicate header found: {header}" + ) # errors if duplicate header found + else: + header_set.add(header) + # if it finished it will be fine + def validate_gene_predictor(gene_predictor, genbank_flag): if gene_predictor == "phanotate": logger.info("Phanotate will be used for gene prediction.") elif gene_predictor == "prodigal": - logger.info("Prodigal will be used for gene prediction.") + logger.info( + "Prodigal implemented with pyrodigal will be used for gene prediction." + ) + elif gene_predictor == "prodigal-gv": + logger.info( + "Prodigal-gv implemented with pyrodigal-gv will be used for gene prediction." + ) elif gene_predictor == "genbank": if genbank_flag is False: logger.error( @@ -221,7 +265,7 @@ def validate_gene_predictor(gene_predictor, genbank_flag): ) else: logger.error( - "Error: gene predictor was incorrectly specified. Please use 'phanotate' or 'prodigal'." + "Error: gene predictor was incorrectly specified. Please use 'phanotate', 'prodigal' or 'prodigal-gv'." ) @@ -300,8 +344,9 @@ def validate_threads(threads): ####### -def check_dependencies(): +def check_dependencies(skip_mash): """Checks the dependencies and versions + skip_mash flag from args, won't check mash is skip mash specified :return: """ ############# @@ -314,10 +359,10 @@ def check_dependencies(): except: logger.error("Phanotate not found. Please reinstall pharokka.") phan_out, _ = process.communicate() - phanotate_out = phan_out.decode().strip() - phanotate_major_version = int(phanotate_out.split(".")[0]) - phanotate_minor_version = int(phanotate_out.split(".")[1]) - phanotate_minorest_version = phanotate_out.split(".")[2] + phanotate_version = phan_out.decode().strip() + phanotate_major_version = int(phanotate_version.split(".")[0]) + phanotate_minor_version = int(phanotate_version.split(".")[1]) + phanotate_minorest_version = phanotate_version.split(".")[2] logger.info( f"Phanotate version found is v{phanotate_major_version}.{phanotate_minor_version}.{phanotate_minorest_version}" @@ -471,25 +516,26 @@ def check_dependencies(): ############# # mash ############# - try: - process = sp.Popen(["mash", "--version"], stdout=sp.PIPE, stderr=sp.STDOUT) - except: - logger.error("mash not found. Please reinstall pharokka.") + if skip_mash is False: + try: + process = sp.Popen(["mash", "--version"], stdout=sp.PIPE, stderr=sp.STDOUT) + except: + logger.error("mash not found. Please reinstall pharokka.") - mash_out, _ = process.communicate() - mash_out = mash_out.decode().strip() + mash_out, _ = process.communicate() + mash_out = mash_out.decode().strip() - mash_major_version = int(mash_out.split(".")[0]) - mash_minor_version = int(mash_out.split(".")[1]) + mash_major_version = int(mash_out.split(".")[0]) + mash_minor_version = int(mash_out.split(".")[1]) - logger.info(f"mash version found is v{mash_major_version}.{mash_minor_version}") + logger.info(f"mash version found is v{mash_major_version}.{mash_minor_version}") - if mash_major_version != 2: - logger.error("mash is the wrong version. Please re-install pharokka.") - if mash_minor_version < 2: - logger.error("mash is the wrong version. Please re-install pharokka.") + if mash_major_version != 2: + logger.error("mash is the wrong version. Please re-install pharokka.") + if mash_minor_version < 2: + logger.error("mash is the wrong version. Please re-install pharokka.") - logger.info("mash version is ok.") + logger.info("mash version is ok.") ############# # dnaapler @@ -521,14 +567,37 @@ def check_dependencies(): # pyrodigal ####### - pyrodigal_major_version = int(__version__.split(".")[0]) + pyrodigal_version = pyrodigal.__version__ + pyrodigal_major_version = int(pyrodigal_version.split(".")[0]) - if pyrodigal_major_version < 2: + if pyrodigal_major_version < 3: logger.error("Pyrodigal is the wrong version. Please re-install pharokka.") - logger.info(f"Pyrodigal version is v{__version__}") + logger.info(f"Pyrodigal version is v{pyrodigal_version}") logger.info(f"Pyrodigal version is ok.") + ####### + # pyrodigal gv + ####### + + pyrodigal_gv_version = pyrodigal_gv.__version__ + pyrodigal_major_version = int(pyrodigal_gv_version.split(".")[0]) + + if pyrodigal_major_version < 0: + logger.error("Pyrodigal_gv is the wrong version. Please re-install pharokka.") + + logger.info(f"Pyrodigal_gv version is v{pyrodigal_gv_version}") + logger.info(f"Pyrodigal_gv version is ok.") + + return ( + phanotate_version, + pyrodigal_version, + pyrodigal_gv_version, + trna_version, + aragorn_version, + minced_version, + ) + def instantiate_split_output(out_dir, split): """ diff --git a/bin/pharokka.py b/bin/pharokka.py index 699a235..6a843b4 100755 --- a/bin/pharokka.py +++ b/bin/pharokka.py @@ -22,8 +22,8 @@ run_dnaapler, run_mash_dist, run_mash_sketch, run_minced, run_mmseqs, run_phanotate, run_phanotate_fasta_meta, run_phanotate_txt_meta, - run_pyrodigal, run_trna_scan, run_trnascan_meta, - split_input_fasta, translate_fastas) + run_pyrodigal, run_pyrodigal_gv, run_trna_scan, + run_trnascan_meta, split_input_fasta, translate_fastas) from util import count_contigs, get_version @@ -106,7 +106,15 @@ def main(): # dependencies logger.info("Checking dependencies.") - check_dependencies() + # output versions + ( + phanotate_version, + pyrodigal_version, + pyrodigal_gv_version, + trna_version, + aragorn_version, + minced_version, + ) = check_dependencies(args.skip_mash) # instantiation/checking fasta and gene_predictor if args.genbank is True: @@ -275,24 +283,30 @@ def main(): run_phanotate(input_fasta, out_dir, logdir) elif gene_predictor == "prodigal": logger.info("Implementing Prodigal using Pyrodigal.") - run_pyrodigal(input_fasta, out_dir, args.meta, args.coding_table) + run_pyrodigal( + input_fasta, out_dir, args.meta, args.coding_table, int(args.threads) + ) elif gene_predictor == "genbank": logger.info("Extracting CDS information from your genbank file.") + elif gene_predictor == "prodigal-gv": + logger.info("Implementing Prodigal-gv using Pyrodigal-gv.") + run_pyrodigal_gv(input_fasta, out_dir, int(args.threads)) # translate fastas (parse genbank) translate_fastas(out_dir, gene_predictor, args.coding_table, args.infile) # run trna-scan meta mode if required - if args.meta == True: - logger.info("Starting tRNA-scanSE. Applying meta mode.") - run_trnascan_meta(input_fasta, out_dir, args.threads, num_fastas) - concat_trnascan_meta(out_dir, num_fastas) - else: - logger.info("Starting tRNA-scanSE.") - run_trna_scan(input_fasta, args.threads, out_dir, logdir) - # run minced and aragorn - run_minced(input_fasta, out_dir, prefix, logdir) - run_aragorn(input_fasta, out_dir, prefix, logdir) + if args.skip_extra_annotations is False: + if args.meta == True: + logger.info("Starting tRNA-scanSE. Applying meta mode.") + run_trnascan_meta(input_fasta, out_dir, args.threads, num_fastas) + concat_trnascan_meta(out_dir, num_fastas) + else: + logger.info("Starting tRNA-scanSE.") + run_trna_scan(input_fasta, args.threads, out_dir, logdir) + # run minced and aragorn + run_minced(input_fasta, out_dir, prefix, logdir) + run_aragorn(input_fasta, out_dir, prefix, logdir) # running mmseqs2 on the 3 databases if mmseqs_flag is True: @@ -358,23 +372,36 @@ def main(): pharok.mmseqs_flag = mmseqs_flag pharok.hmm_flag = hmm_flag pharok.custom_hmm_flag = custom_hmm_flag + pharok.phanotate_version = phanotate_version + pharok.pyrodigal_version = pyrodigal_version + pharok.pyrodigal_gv_version = pyrodigal_gv_version + pharok.trna_version = trna_version + pharok.aragorn_version = aragorn_version + pharok.minced_version = minced_version + pharok.skip_extra_annotations = args.skip_extra_annotations + if pharok.hmm_flag is True: pharok.pyhmmer_results_dict = best_results_pyhmmer if pharok.custom_hmm_flag is True: pharok.custom_pyhmmer_results_dict = best_results_custom_pyhmmer + ##################################### + # post processing + ##################################### + + # gets df of length and gc for each contig + pharok.get_contig_name_lengths() + # post process results # includes vfdb and card # adds the merged df, vfdb and card top hits dfs to the class objec # no need to specify params as they are in the class :) pharok.process_results() - # gets df of length and gc for each contig - pharok.get_contig_name_lengths() - # parse the aragorn output - # get flag whether there is a tmrna from aragor - pharok.parse_aragorn() + # only if not skipping annots + if args.skip_extra_annotations is False: + pharok.parse_aragorn() # create gff and save locustag to class for table pharok.create_gff() @@ -403,7 +430,7 @@ def main(): # convert to genbank logger.info("Converting gff to genbank.") # not part of the class so from processes.py - convert_gff_to_gbk(input_fasta, out_dir, out_dir, prefix, args.coding_table) + convert_gff_to_gbk(input_fasta, out_dir, out_dir, prefix, pharok.prot_seq_df) # update fasta headers and final output tsv pharok.update_fasta_headers() @@ -413,12 +440,16 @@ def main(): pharok.extract_terl() # run mash - logger.info("Finding the closest match for each contig in INPHARED using mash.") - # in process.py - run_mash_sketch(input_fasta, out_dir, logdir) - run_mash_dist(out_dir, db_dir, logdir) - # part of the class - pharok.inphared_top_hits() + if args.skip_mash is False: # skips mash + logger.info("Finding the closest match for each contig in INPHARED using mash.") + # in process.py + run_mash_sketch(input_fasta, out_dir, logdir) + run_mash_dist(out_dir, db_dir, logdir) + # part of the class + pharok.inphared_top_hits() + else: + logger.info("You have chosen --skip_mash.") + logger.info("Skipping finding the closest match for each contig in INPHARED using mash.") # delete tmp files remove_post_processing_files(out_dir, gene_predictor, args.meta) diff --git a/bin/post_processing.py b/bin/post_processing.py index db5a096..ccdb1b8 100644 --- a/bin/post_processing.py +++ b/bin/post_processing.py @@ -51,6 +51,9 @@ def __init__( ), gff_df: pd.DataFrame() = pd.DataFrame({"col1": [1, 2, 3], "col2": [4, 5, 6]}), locus_df: pd.DataFrame() = pd.DataFrame({"col1": [1, 2, 3], "col2": [4, 5, 6]}), + prot_seq_df: pd.DataFrame() = pd.DataFrame( + {"col1": [1, 2, 3], "col2": [4, 5, 6]} + ), tmrna_flag: bool = False, trna_empty: bool = False, crispr_count: int = 0, @@ -58,6 +61,13 @@ def __init__( mmseqs_flag: bool = True, hmm_flag: bool = True, custom_hmm_flag: bool = False, + phanotate_version: str = "1.5.0", + pyrodigal_version: str = "3.0.0", + pyrodigal_gv_version: str = "0.1.0", + trna_version: str = "2.0.12", + aragorn_version: str = "1.2.41", + minced_version: str = "0.4.2", + skip_extra_annotations: bool = False ) -> None: """ Parameters @@ -104,6 +114,22 @@ def __init__( whether HMM was run custom_hmm_flag: bool whether a custom db of HMMs was run + phanotate_version: str + phanotate_version from check_dependencies() + prodigal_version: str + prodigal_version from check_dependencies() + pyrodigal_gv_version: str + pyrodigal_gv_version from check dependencies() + trna_version: str + trnascan_version from check_dependencies() + aragorn_version: str + aragorn_version from check_dependencies() + minced_version: str + minced_version from check dependencies() + prot_seq_df: pd.DataFrame, + dataframe with protein sequence information for each egene + skip_extra_annotations: bool + boolean whether extra annotations are skipped """ self.out_dir = out_dir self.db_dir = db_dir @@ -127,6 +153,14 @@ def __init__( self.mmseqs_flag = mmseqs_flag self.hmm_flag = hmm_flag self.custom_hmm_flag = custom_hmm_flag + self.phanotate_version = phanotate_version + self.pyrodigal_version = pyrodigal_version + self.pyrodigal_gv_version = pyrodigal_gv_version + self.trna_version = trna_version + self.aragorn_version = aragorn_version + self.minced_version = minced_version + self.prot_seq_df = prot_seq_df + self.skip_extra_annotations = skip_extra_annotations def process_results(self): """ @@ -143,6 +177,34 @@ def process_results(self): cds_file = os.path.join(self.out_dir, "cleaned_" + self.gene_predictor + ".tsv") cds_df = pd.read_csv(cds_file, sep="\t", index_col=False) + ########################################### + # add the sequence to the df for the genbank conversion later on + fasta_input_aas_tmp = os.path.join( + self.out_dir, f"{self.gene_predictor}_aas_tmp.fasta" + ) + prot_dict = SeqIO.to_dict(SeqIO.parse(fasta_input_aas_tmp, "fasta")) + + # make a copy of cds_df + self.prot_seq_df = cds_df.copy() + + # to match the output for gff + self.prot_seq_df[["gene", "st"]] = self.prot_seq_df["gene"].str.split( + " ", expand=True + ) + + self.prot_seq_df = self.prot_seq_df.drop(columns=["st"]) + + # get sequences for each gene in df + self.prot_seq_df["sequence"] = "MA" + + for index, row in self.prot_seq_df.iterrows(): + # get the gene id + gene = row["gene"] + if gene in prot_dict.keys(): + # add the AA sequence + self.prot_seq_df.at[index, "sequence"] = prot_dict[gene].seq + + ########################################## # create the tophits_df and write it to file if self.mmseqs_flag is True: tophits_df = create_mmseqs_tophits(self.out_dir) @@ -248,11 +310,13 @@ def process_results(self): # add columns if self.gene_predictor == "phanotate": - merged_df["Method"] = "PHANOTATE" + merged_df["Method"] = f"PHANOTATE_{self.phanotate_version}" elif self.gene_predictor == "prodigal": - merged_df["Method"] = "PRODIGAL" + merged_df["Method"] = f"Pyrodigal_{self.pyrodigal_version}" elif self.gene_predictor == "genbank": merged_df["Method"] = "CUSTOM" + elif self.gene_predictor == "prodigal-gv": + merged_df["Method"] = f"Pyrodigal-gv_{self.pyrodigal_gv_version}" merged_df["Region"] = "CDS" # # replace with No_PHROG if nothing found @@ -301,23 +365,91 @@ def process_results(self): def get_contig_name_lengths(self): """ - Gets contig name and length in the input fasta file and calculates gc + Gets contig name and length in the input fasta file and calculates gc. + Also adds translation table :param fasta_input: input fasta file :return: length_df a pandas dataframe (to class) """ + fasta_sequences = SeqIO.parse(open(self.input_fasta), "fasta") + + if self.gene_predictor == "prodigal-gv": + # define col list + col_list = [ + "contig", + "Method", + "Region", + "start", + "stop", + "score", + "frame", + "phase", + "attributes", + ] + # read gff (no fasta output) + pyrodigal_gv_gff = pd.read_csv( + os.path.join(self.out_dir, "prodigal-gv_out.gff"), + delimiter="\t", + index_col=False, + names=col_list, + ) + + pyrodigal_gv_gff[["attributes", "transl_table"]] = pyrodigal_gv_gff[ + "attributes" + ].str.split("transl_table=", expand=True) + pyrodigal_gv_gff[["transl_table", "rest"]] = pyrodigal_gv_gff[ + "transl_table" + ].str.split(";conf", expand=True) + # drop and then remove duplicates in df + pyrodigal_gv_gff = pyrodigal_gv_gff.drop( + columns=[ + "rest", + "Method", + "Region", + "start", + "stop", + "score", + "frame", + "phase", + "attributes", + ] + ) + # Remove duplicate rows based on all columns + pyrodigal_gv_gff = pyrodigal_gv_gff.drop_duplicates() + # Convert to a dictionary + transl_table_dict = pyrodigal_gv_gff.set_index("contig")[ + "transl_table" + ].to_dict() + contig_names = [] lengths = [] gc = [] - for fasta in fasta_sequences: - contig_names.append(fasta.id) - lengths.append(len(fasta.seq)) - gc.append(round(GC(fasta.seq), 2)) + transl_tables = [] + + transl_table = "11" + if self.gene_predictor == "phanotate": + transl_table = "11" + elif self.gene_predictor == "prodigal": + transl_table = self.coding_table + elif self.gene_predictor == "genbank": + transl_table = "custom_gene_calls_from_genbank" + + for record in fasta_sequences: + contig_names.append(record.id) + lengths.append(len(record.seq)) + gc.append(round(GC(record.seq), 2)) + # pyrodigal-gv lookup from teh dict + if self.gene_predictor == "prodigal-gv": + transl_table = transl_table_dict[record.id] + + transl_tables.append(transl_table) + length_df = pd.DataFrame( { "contig": contig_names, "length": lengths, "gc_perc": gc, + "transl_table": transl_tables, } ) self.length_df = length_df @@ -370,7 +502,7 @@ def parse_aragorn(self): split = line.split() start_stops = split[2].replace("[", "").replace("]", "").split(",") contig = self.length_df["contig"][0] - method = "Aragorn" + method = f"Aragorn_{self.aragorn_version}" region = "tmRNA" start = start_stops[0].replace( "c", "" @@ -430,7 +562,7 @@ def parse_aragorn(self): split[2].replace("[", "").replace("]", "").split(",") ) contig = self.length_df["contig"][j] - method = "Aragorn" + method = f"Aragorn_{self.aragorn_version}" region = "tmRNA" start = start_stops[0].replace( "c", "" @@ -516,6 +648,10 @@ def create_gff(self): # get all contigs contigs = self.length_df["contig"].astype("string") + # add the translation table + transl_table_df = self.length_df.drop(columns=["length", "gc_perc"]) + self.merged_df = self.merged_df.merge(transl_table_df, how="left", on="contig") + ############ locus tag ######### # write df for locus tag parsing # zfill - makes the CDS 4 digits trailing zeroes for vcontact @@ -550,6 +686,7 @@ def create_gff(self): # assign count and locus_tag to merged_df (for meta) self.merged_df["locus_tag"] = locus_df["locus_tag"] self.merged_df["count"] = locus_df["count"] + ################################# ######### @@ -573,6 +710,9 @@ def create_gff(self): "ID=" + locus_df["locus_tag"].astype(str) + ";" + + "transl_table=" + + locus_df["transl_table"].astype(str) + + ";" + "phrog=" + self.merged_df["phrog"].astype(str) + ";" @@ -650,215 +790,231 @@ def create_gff(self): ### trnas # check if no trnas - col_list = [ - "contig", - "Method", - "Region", - "start", - "stop", - "score", - "frame", - "phase", - "attributes", - ] - trna_empty = is_trna_empty(self.out_dir) - if trna_empty == False: - trna_df = pd.read_csv( - os.path.join(self.out_dir, "trnascan_out.gff"), - delimiter="\t", - index_col=False, - names=col_list, - ) - # index hack if meta mode - if self.meta_mode == True: - subset_dfs = [] - for contig in contigs: - subset_df = trna_df[trna_df["contig"] == contig].reset_index() - # keep only trnas before indexing - subset_df = subset_df[ - (subset_df["Region"] == "tRNA") - | (subset_df["Region"] == "pseudogene") + + # to make sure you aren't skipping trnas + if self.skip_extra_annotations is False: + col_list = [ + "contig", + "Method", + "Region", + "start", + "stop", + "score", + "frame", + "phase", + "attributes", + ] + trna_empty = is_trna_empty(self.out_dir) + if trna_empty == False: + trna_df = pd.read_csv( + os.path.join(self.out_dir, "trnascan_out.gff"), + delimiter="\t", + index_col=False, + names=col_list, + ) + + # convert the method to update with version + trna_df["Method"] = f"tRNAscan-SE_{self.trna_version}" + + # index hack if meta mode + if self.meta_mode == True: + subset_dfs = [] + for contig in contigs: + subset_df = trna_df[trna_df["contig"] == contig].reset_index() + # keep only trnas before indexing + subset_df = subset_df[ + (subset_df["Region"] == "tRNA") + | (subset_df["Region"] == "pseudogene") + ] + subset_df = subset_df.reset_index(drop=True) + subset_df["count"] = subset_df.index + # so not 0 indexed + subset_df["count"] = subset_df["count"] + 1 + # z fill to make the locus tag 4 + subset_df["locus_tag"] = ( + contig + "_tRNA_" + subset_df["count"].astype(str).str.zfill(4) + ) + subset_df = subset_df.drop(columns=["count"]) + subset_dfs.append(subset_df) + trna_df = pd.concat(subset_dfs, axis=0, ignore_index=True) + trna_df = trna_df.drop(columns=["index"]) + else: + # keep only trnas + trna_df = trna_df[ + (trna_df["Region"] == "tRNA") | (trna_df["Region"] == "pseudogene") ] - subset_df = subset_df.reset_index(drop=True) - subset_df["count"] = subset_df.index - # so not 0 indexed - subset_df["count"] = subset_df["count"] + 1 - # z fill to make the locus tag 4 - subset_df["locus_tag"] = ( - contig + "_tRNA_" + subset_df["count"].astype(str).str.zfill(4) + trna_df = trna_df.reset_index(drop=True) + trna_df["count"] = trna_df.index + trna_df["count"] = trna_df["count"] + 1 + trna_df["locus_tag"] = ( + self.locustag + "_tRNA_" + trna_df["count"].astype(str).str.zfill(4) ) - subset_df = subset_df.drop(columns=["count"]) - subset_dfs.append(subset_df) - trna_df = pd.concat(subset_dfs, axis=0, ignore_index=True) - trna_df = trna_df.drop(columns=["index"]) - else: - # keep only trnas - trna_df = trna_df[ - (trna_df["Region"] == "tRNA") | (trna_df["Region"] == "pseudogene") - ] - trna_df = trna_df.reset_index(drop=True) - trna_df["count"] = trna_df.index - trna_df["count"] = trna_df["count"] + 1 - trna_df["locus_tag"] = ( - self.locustag + "_tRNA_" + trna_df["count"].astype(str).str.zfill(4) - ) - trna_df = trna_df.drop(columns=["count"]) + trna_df = trna_df.drop(columns=["count"]) - trna_df.start = trna_df.start.astype(int) - trna_df.stop = trna_df.stop.astype(int) - trna_df[["attributes", "isotypes"]] = trna_df["attributes"].str.split( - ";isotype=", expand=True - ) - trna_df[["isotypes", "anticodon"]] = trna_df["isotypes"].str.split( - ";anticodon=", expand=True - ) - trna_df[["anticodon", "rest"]] = trna_df["anticodon"].str.split( - ";gene_biotype", expand=True - ) - trna_df["trna_product"] = ( - "tRNA-" + trna_df["isotypes"] + "(" + trna_df["anticodon"] + ")" - ) - trna_df = trna_df.drop(columns=["attributes"]) - trna_df["attributes"] = ( - "ID=" - + trna_df["locus_tag"] - + ";" - + "trna=" - + trna_df["trna_product"].astype(str) - + ";" - + "isotype=" - + trna_df["isotypes"].astype(str) - + ";" - + "anticodon=" - + trna_df["anticodon"].astype(str) - + ";" - + "locus_tag=" - + trna_df["locus_tag"] - ) - trna_df = trna_df.drop( - columns=["isotypes", "anticodon", "rest", "trna_product", "locus_tag"] - ) + trna_df.start = trna_df.start.astype(int) + trna_df.stop = trna_df.stop.astype(int) + trna_df[["attributes", "isotypes"]] = trna_df["attributes"].str.split( + ";isotype=", expand=True + ) + trna_df[["isotypes", "anticodon"]] = trna_df["isotypes"].str.split( + ";anticodon=", expand=True + ) + trna_df[["anticodon", "rest"]] = trna_df["anticodon"].str.split( + ";gene_biotype", expand=True + ) + trna_df["trna_product"] = ( + "tRNA-" + trna_df["isotypes"] + "(" + trna_df["anticodon"] + ")" + ) + trna_df = trna_df.drop(columns=["attributes"]) + trna_df["attributes"] = ( + "ID=" + + trna_df["locus_tag"] + + ";" + + "transl_table=" + + locus_df["transl_table"].astype(str) + + ";" + + "trna=" + + trna_df["trna_product"].astype(str) + + ";" + + "isotype=" + + trna_df["isotypes"].astype(str) + + ";" + + "anticodon=" + + trna_df["anticodon"].astype(str) + + ";" + + "locus_tag=" + + trna_df["locus_tag"] + ) + trna_df = trna_df.drop( + columns=["isotypes", "anticodon", "rest", "trna_product", "locus_tag"] + ) - ### crisprs - crispr_count = get_crispr_count(self.out_dir, self.prefix) - # add to gff if > 0 - if crispr_count > 0: - minced_df = pd.read_csv( - os.path.join(self.out_dir, self.prefix + "_minced.gff"), - delimiter="\t", - index_col=False, - names=col_list, - comment="#", - ) - minced_df.start = minced_df.start.astype(int) - minced_df.stop = minced_df.stop.astype(int) - minced_df[["attributes", "rpt_unit_seq"]] = minced_df[ - "attributes" - ].str.split(";rpt_unit_seq=", expand=True) - minced_df[["attributes", "rpt_family"]] = minced_df["attributes"].str.split( - ";rpt_family=", expand=True - ) - minced_df[["attributes", "rpt_type"]] = minced_df["attributes"].str.split( - ";rpt_type=", expand=True - ) - minced_df = minced_df.drop(columns=["attributes"]) - # index hack if meta mode - subset_dfs = [] - if self.meta_mode == True: - for contig in contigs: - subset_df = minced_df[minced_df["contig"] == contig].reset_index() - subset_df["count"] = subset_df.index - # so not 0 indexed - subset_df["count"] = subset_df["count"] + 1 - # z fill to make the locus tag 4 - subset_df["count"] = subset_df["count"].astype(str).str.zfill(4) - subset_df["locus_tag"] = ( - contig + ### crisprs + crispr_count = get_crispr_count(self.out_dir, self.prefix) + # add to gff if > 0 + if crispr_count > 0: + minced_df = pd.read_csv( + os.path.join(self.out_dir, self.prefix + "_minced.gff"), + delimiter="\t", + index_col=False, + names=col_list, + comment="#", + ) + minced_df.start = minced_df.start.astype(int) + minced_df.stop = minced_df.stop.astype(int) + minced_df[["attributes", "rpt_unit_seq"]] = minced_df[ + "attributes" + ].str.split(";rpt_unit_seq=", expand=True) + minced_df[["attributes", "rpt_family"]] = minced_df["attributes"].str.split( + ";rpt_family=", expand=True + ) + minced_df[["attributes", "rpt_type"]] = minced_df["attributes"].str.split( + ";rpt_type=", expand=True + ) + minced_df = minced_df.drop(columns=["attributes"]) + # index hack if meta mode + subset_dfs = [] + if self.meta_mode == True: + for contig in contigs: + subset_df = minced_df[minced_df["contig"] == contig].reset_index() + subset_df["count"] = subset_df.index + # so not 0 indexed + subset_df["count"] = subset_df["count"] + 1 + # z fill to make the locus tag 4 + subset_df["count"] = subset_df["count"].astype(str).str.zfill(4) + subset_df["locus_tag"] = ( + contig + + "_CRISPR_" + + subset_df["count"].astype(str).str.zfill(4) + ) + subset_df = subset_df.drop(columns=["count"]) + subset_dfs.append(subset_df) + minced_df = pd.concat(subset_dfs, axis=0, ignore_index=True) + minced_df = minced_df.drop(columns=["index"]) + else: + minced_df["count"] = minced_df.index + minced_df["count"] = minced_df["count"] + 1 + minced_df["locus_tag"] = ( + self.locustag + "_CRISPR_" - + subset_df["count"].astype(str).str.zfill(4) + + minced_df["count"].astype(str).str.zfill(4) ) - subset_df = subset_df.drop(columns=["count"]) - subset_dfs.append(subset_df) - minced_df = pd.concat(subset_dfs, axis=0, ignore_index=True) - minced_df = minced_df.drop(columns=["index"]) - else: - minced_df["count"] = minced_df.index - minced_df["count"] = minced_df["count"] + 1 - minced_df["locus_tag"] = ( - self.locustag - + "_CRISPR_" - + minced_df["count"].astype(str).str.zfill(4) + minced_df = minced_df.drop(columns=["count"]) + + minced_df["attributes"] = ( + "ID=" + + minced_df["locus_tag"] + + ";" + + "transl_table=" + + locus_df["transl_table"].astype(str) + + ";" + + "rpt_type=" + + minced_df["rpt_type"].astype(str) + + ";" + + "rpt_family=" + + minced_df["rpt_family"].astype(str) + + ";" + + "rpt_unit_seq=" + + minced_df["rpt_unit_seq"].astype(str) + + ";" + + "locus_tag=" + + minced_df["locus_tag"] + ) + minced_df = minced_df.drop( + columns=["rpt_unit_seq", "rpt_family", "rpt_type", "locus_tag"] ) - minced_df = minced_df.drop(columns=["count"]) - - minced_df["attributes"] = ( - "ID=" - + minced_df["locus_tag"] - + ";" - + "rpt_type=" - + minced_df["rpt_type"].astype(str) - + ";" - + "rpt_family=" - + minced_df["rpt_family"].astype(str) - + ";" - + "rpt_unit_seq=" - + minced_df["rpt_unit_seq"].astype(str) - + ";" - + "locus_tag=" - + minced_df["locus_tag"] - ) - minced_df = minced_df.drop( - columns=["rpt_unit_seq", "rpt_family", "rpt_type", "locus_tag"] - ) - ### tmrna - # add to gff there is a tmrna - if self.tmrna_flag == True: - tmrna_df = pd.read_csv( - os.path.join(self.out_dir, self.prefix + "_aragorn.gff"), - delimiter="\t", - index_col=False, - names=col_list, - ) - tmrna_df.start = tmrna_df.start.astype(int) - tmrna_df.stop = tmrna_df.stop.astype(int) + ### tmrna + # add to gff there is a tmrna + if self.tmrna_flag == True: + tmrna_df = pd.read_csv( + os.path.join(self.out_dir, self.prefix + "_aragorn.gff"), + delimiter="\t", + index_col=False, + names=col_list, + ) + tmrna_df.start = tmrna_df.start.astype(int) + tmrna_df.stop = tmrna_df.stop.astype(int) - # index hack if meta mode - subset_dfs = [] - if self.meta_mode == True: - for contig in contigs: - subset_df = tmrna_df[tmrna_df["contig"] == contig].reset_index() - subset_df["count"] = subset_df.index - # so not 0 indexed - subset_df["count"] = subset_df["count"] + 1 - # z fill to make the locus tag 4 - subset_df["count"] = subset_df["count"].astype(str).str.zfill(4) - subset_df["locus_tag"] = ( - contig + "_tmRNA_" + subset_df["count"].astype(str).str.zfill(4) + # index hack if meta mode + subset_dfs = [] + if self.meta_mode == True: + for contig in contigs: + subset_df = tmrna_df[tmrna_df["contig"] == contig].reset_index() + subset_df["count"] = subset_df.index + # so not 0 indexed + subset_df["count"] = subset_df["count"] + 1 + # z fill to make the locus tag 4 + subset_df["count"] = subset_df["count"].astype(str).str.zfill(4) + subset_df["locus_tag"] = ( + contig + "_tmRNA_" + subset_df["count"].astype(str).str.zfill(4) + ) + subset_df = subset_df.drop(columns=["count"]) + subset_dfs.append(subset_df) + tmrna_df = pd.concat(subset_dfs, axis=0, ignore_index=True) + tmrna_df = tmrna_df.drop(columns=["index"]) + else: + tmrna_df["count"] = tmrna_df.index + tmrna_df["count"] = tmrna_df["count"] + 1 + tmrna_df["locus_tag"] = ( + self.locustag + + "_tmRNA_" + + tmrna_df["count"].astype(str).str.zfill(4) ) - subset_df = subset_df.drop(columns=["count"]) - subset_dfs.append(subset_df) - tmrna_df = pd.concat(subset_dfs, axis=0, ignore_index=True) - tmrna_df = tmrna_df.drop(columns=["index"]) - else: - tmrna_df["count"] = tmrna_df.index - tmrna_df["count"] = tmrna_df["count"] + 1 - tmrna_df["locus_tag"] = ( - self.locustag - + "_tmRNA_" - + tmrna_df["count"].astype(str).str.zfill(4) + tmrna_df = tmrna_df.drop(columns=["count"]) + + tmrna_df["attributes"] = ( + "ID=" + + tmrna_df["locus_tag"] + + ";" + + "transl_table=" + + locus_df["transl_table"].astype(str) + + ";" + + tmrna_df["attributes"].astype(str) + + ";locus_tag=" + + tmrna_df["locus_tag"] ) - tmrna_df = tmrna_df.drop(columns=["count"]) - - tmrna_df["attributes"] = ( - "ID=" - + tmrna_df["locus_tag"] - + ";" - + tmrna_df["attributes"].astype(str) - + ";locus_tag=" - + tmrna_df["locus_tag"] - ) - tmrna_df = tmrna_df.drop(columns=["locus_tag"]) + tmrna_df = tmrna_df.drop(columns=["locus_tag"]) # write header of final gff files with open(os.path.join(self.out_dir, self.prefix + ".gff"), "w") as f: @@ -874,23 +1030,27 @@ def create_gff(self): # combine dfs depending on whether the elements were detected - if trna_empty is True and self.tmrna_flag is False and crispr_count == 0: # all + # skip extra trna setc + if self.skip_extra_annotations is True: df_list = [gff_df] - elif trna_empty is False and self.tmrna_flag is False and crispr_count == 0: - df_list = [gff_df, trna_df] - elif trna_empty is True and self.tmrna_flag is True and crispr_count == 0: - df_list = [gff_df, tmrna_df] - elif trna_empty is True and self.tmrna_flag is False and crispr_count > 0: - df_list = [gff_df, minced_df] - elif trna_empty is False and self.tmrna_flag is True and crispr_count == 0: - df_list = [gff_df, trna_df, tmrna_df] - elif trna_empty is False and self.tmrna_flag is False and crispr_count > 0: - df_list = [gff_df, trna_df, minced_df] - elif trna_empty is True and self.tmrna_flag is True and crispr_count > 0: - df_list = [gff_df, tmrna_df, minced_df] - # if trna_empty is False and self.tmrna_flag is True and crispr_count > 0: # all detected - else: # all detected - df_list = [gff_df, trna_df, tmrna_df, minced_df] + else: + if trna_empty is True and self.tmrna_flag is False and crispr_count == 0: # all + df_list = [gff_df] + elif trna_empty is False and self.tmrna_flag is False and crispr_count == 0: + df_list = [gff_df, trna_df] + elif trna_empty is True and self.tmrna_flag is True and crispr_count == 0: + df_list = [gff_df, tmrna_df] + elif trna_empty is True and self.tmrna_flag is False and crispr_count > 0: + df_list = [gff_df, minced_df] + elif trna_empty is False and self.tmrna_flag is True and crispr_count == 0: + df_list = [gff_df, trna_df, tmrna_df] + elif trna_empty is False and self.tmrna_flag is False and crispr_count > 0: + df_list = [gff_df, trna_df, minced_df] + elif trna_empty is True and self.tmrna_flag is True and crispr_count > 0: + df_list = [gff_df, tmrna_df, minced_df] + # if trna_empty is False and self.tmrna_flag is True and crispr_count > 0: # all detected + else: # all detected + df_list = [gff_df, trna_df, tmrna_df, minced_df] total_gff = pd.concat(df_list, ignore_index=True) @@ -899,7 +1059,7 @@ def create_gff(self): total_gff.stop = total_gff.stop.astype(int) # sorts all features by start - total_gff = total_gff.groupby(["contig"], sort=False, as_index=False).apply( + total_gff = total_gff.groupby(["contig"], sort=False, as_index=False, group_keys=True).apply( pd.DataFrame.sort_values, "start", ascending=True ) @@ -923,8 +1083,13 @@ def create_gff(self): self.locus_df = locus_df self.gff_df = gff_df self.total_gff = total_gff - self.trna_empty = trna_empty - self.crispr_count = crispr_count + if self.skip_extra_annotations is False: + self.trna_empty = trna_empty + self.crispr_count = crispr_count + else: # skip annotations + self.trna_empty = True + self.crispr_count = 0 + self.tmrna_flag = False def create_tbl( self, @@ -944,10 +1109,20 @@ def create_tbl( # get the cds + self.total_gff = self.total_gff.reset_index(drop=True) + if self.gene_predictor == "phanotate": - cds_df = self.total_gff[self.total_gff["Method"] == "PHANOTATE"] + cds_df = self.total_gff[ + self.total_gff["Method"] == f"PHANOTATE_{self.phanotate_version}" + ] elif self.gene_predictor == "prodigal": - cds_df = self.total_gff[self.total_gff["Method"] == "PRODIGAL"] + cds_df = self.total_gff[ + self.total_gff["Method"] == f"Pyrodigal_{self.pyrodigal_version}" + ] + elif self.gene_predictor == "prodigal-gv": + cds_df = self.total_gff[ + self.total_gff["Method"] == f"Pyrodigal-gv_{self.pyrodigal_gv_version}" + ] elif self.gene_predictor == "genbank": cds_df = self.total_gff[self.total_gff["Method"] == "CUSTOM"] @@ -958,10 +1133,13 @@ def create_tbl( ";function=", expand=True ) + ### trnas # check if no trnas - if self.trna_empty == False: - trna_df = self.total_gff[self.total_gff["Method"] == "tRNAscan-SE"] + if self.trna_empty is False: + trna_df = self.total_gff[ + self.total_gff["Method"] == f"tRNAscan-SE_{self.trna_version}" + ] # keep only trnas and pseudogenes trna_df.start = trna_df.start.astype(int) trna_df.stop = trna_df.stop.astype(int) @@ -991,7 +1169,7 @@ def create_tbl( ].str.split(";rpt_unit_seq=", expand=True) ### TMRNAs - if self.tmrna_flag == True: + if self.tmrna_flag is True: tmrna_df = self.total_gff[self.total_gff["Region"] == "tmRNA"] tmrna_df.start = tmrna_df.start.astype(int) tmrna_df.stop = tmrna_df.stop.astype(int) @@ -1045,7 +1223,7 @@ def create_tbl( + "\t" + "transl_table" + "\t" - + str(self.coding_table) + + str(subset_df["transl_table"]) + "\n" ) if self.trna_empty == False: @@ -1090,7 +1268,7 @@ def create_tbl( + "\t" + "transl_table" + "\t" - + str(self.coding_table) + + str(subset_df["transl_table"]) + "\n" ) if self.crispr_count > 0: @@ -1135,7 +1313,7 @@ def create_tbl( + "\t" + "transl_table" + "\t" - + str(self.coding_table) + + str(subset_df["transl_table"]) + "\n" ) if self.tmrna_flag == True: @@ -1180,7 +1358,7 @@ def create_tbl( + "\t" + "transl_table" + "\t" - + str(self.coding_table) + + str(subset_df["transl_table"]) + "\n" ) @@ -1245,7 +1423,7 @@ def convert_singles_gff_to_gbk(self): ) contig = row["contig"] convert_gff_to_gbk( - fasta_file, single_gff_dir, single_gbk_dir, contig, self.coding_table + fasta_file, single_gff_dir, single_gbk_dir, contig, self.prot_seq_df ) def split_fasta_singles(self): @@ -1385,44 +1563,45 @@ def create_txt(self): #### trna scan # read in trnascan - col_list = [ - "contig", - "Method", - "Region", - "start", - "stop", - "score", - "frame", - "phase", - "attributes", - ] - trna_df = pd.read_csv( - os.path.join(self.out_dir, "trnascan_out.gff"), - delimiter="\t", - index_col=False, - names=col_list, - ) - # keep only trnas and pseudogenes - trna_df = trna_df[ - (trna_df["Region"] == "tRNA") | (trna_df["Region"] == "pseudogene") - ] + if self.skip_extra_annotations is False: + col_list = [ + "contig", + "Method", + "Region", + "start", + "stop", + "score", + "frame", + "phase", + "attributes", + ] + trna_df = pd.read_csv( + os.path.join(self.out_dir, "trnascan_out.gff"), + delimiter="\t", + index_col=False, + names=col_list, + ) + # keep only trnas and pseudogenes + trna_df = trna_df[ + (trna_df["Region"] == "tRNA") | (trna_df["Region"] == "pseudogene") + ] - #### crispr - crispr_df = pd.read_csv( - os.path.join(self.out_dir, self.prefix + "_minced.gff"), - delimiter="\t", - index_col=False, - names=col_list, - comment="#", - ) + #### crispr + crispr_df = pd.read_csv( + os.path.join(self.out_dir, self.prefix + "_minced.gff"), + delimiter="\t", + index_col=False, + names=col_list, + comment="#", + ) - #### tmrna - tmrna_df = pd.read_csv( - os.path.join(self.out_dir, self.prefix + "_aragorn.gff"), - delimiter="\t", - index_col=False, - names=col_list, - ) + #### tmrna + tmrna_df = pd.read_csv( + os.path.join(self.out_dir, self.prefix + "_aragorn.gff"), + delimiter="\t", + index_col=False, + names=col_list, + ) # write descriptions for each contig for contig in contigs: @@ -1434,9 +1613,10 @@ def create_txt(self): cds_count = len( cds_mmseqs_merge_cont_df[cds_mmseqs_merge_cont_df["Region"] == "CDS"] ) - trna_count = len(trna_df[trna_df["contig"] == contig]) - tmrna_count = len(tmrna_df[tmrna_df["contig"] == contig]) - crispr_count = len(crispr_df[crispr_df["contig"] == contig]) + if self.skip_extra_annotations is False: + trna_count = len(trna_df[trna_df["contig"] == contig]) + tmrna_count = len(tmrna_df[tmrna_df["contig"] == contig]) + crispr_count = len(crispr_df[crispr_df["contig"] == contig]) if len(self.vfdb_results["contig"]) != 0: vfdb_count = len( self.vfdb_results[self.vfdb_results["contig"] == contig] @@ -1590,20 +1770,23 @@ def create_txt(self): } ) - # add other features - trna_row = pd.DataFrame( - {"Description": ["tRNAs"], "Count": [trna_count], "contig": [contig]} - ) - crispr_row = pd.DataFrame( - { - "Description": ["CRISPRs"], - "Count": [crispr_count], - "contig": [contig], - } - ) - tmrna_row = pd.DataFrame( - {"Description": ["tmRNAs"], "Count": [tmrna_count], "contig": [contig]} - ) + # only if not skipped + if self.skip_extra_annotations is False: + # add other features + trna_row = pd.DataFrame( + {"Description": ["tRNAs"], "Count": [trna_count], "contig": [contig]} + ) + crispr_row = pd.DataFrame( + { + "Description": ["CRISPRs"], + "Count": [crispr_count], + "contig": [contig], + } + ) + tmrna_row = pd.DataFrame( + {"Description": ["tmRNAs"], "Count": [tmrna_count], "contig": [contig]} + ) + vfdb_row = pd.DataFrame( { "Description": ["VFDB_Virulence_Factors"], @@ -1626,9 +1809,10 @@ def create_txt(self): ] = cds_coding_density # eappend it all to combo_list combo_list.append(cds_df) - combo_list.append(trna_row) - combo_list.append(crispr_row) - combo_list.append(tmrna_row) + if self.skip_extra_annotations is False: + combo_list.append(trna_row) + combo_list.append(crispr_row) + combo_list.append(tmrna_row) combo_list.append(vfdb_row) combo_list.append(CARD_row) @@ -1917,7 +2101,7 @@ def create_mmseqs_tophits(out_dir): ##mmseqs mmseqs_file = os.path.join(out_dir, "mmseqs_results.tsv") - logger.info("Processing mmseqs2 outputs.") + logger.info("Processing MMseqs2 outputs.") logger.info("Processing PHROGs output.") col_list = [ "mmseqs_phrog", @@ -1935,40 +2119,23 @@ def create_mmseqs_tophits(out_dir): mmseqs_df = pd.read_csv( mmseqs_file, delimiter="\t", index_col=False, names=col_list ) - # get list of genes - genes = mmseqs_df.gene.unique() - - # instantiate tophits list - tophits = [] - - for gene in genes: - tmp_df = ( - mmseqs_df.loc[mmseqs_df["gene"] == gene] - .sort_values("mmseqs_eVal") - .reset_index(drop=True) - .loc[0] - ) - tophits.append( - [ - tmp_df.mmseqs_phrog, - tmp_df.gene, - tmp_df.mmseqs_alnScore, - tmp_df.mmseqs_seqIdentity, - tmp_df.mmseqs_eVal, - ] - ) + # optimise the tophits generation + # Group by 'gene' and find the top hit for each group + tophits_df = mmseqs_df.groupby('gene', group_keys=True).apply(lambda group: group.nsmallest(1, 'mmseqs_eVal')).reset_index(drop=True) + + # create tophits df - tophits_df = pd.DataFrame( - tophits, - columns=[ - "mmseqs_phrog", - "gene", - "mmseqs_alnScore", - "mmseqs_seqIdentity", - "mmseqs_eVal", - ], - ) + tophits_df = tophits_df[[ + "mmseqs_phrog", + "gene", + "mmseqs_alnScore", + "mmseqs_seqIdentity", + "mmseqs_eVal", + ]] + + + tophits_df.to_csv( os.path.join(out_dir, "top_hits_mmseqs.tsv"), sep="\t", index=False ) @@ -2010,6 +2177,10 @@ def remove_post_processing_files(out_dir, gene_predictor, meta): remove_file(os.path.join(out_dir, "phanotate_out.txt")) if gene_predictor == "prodigal": remove_file(os.path.join(out_dir, "prodigal_out.gff")) + remove_file(os.path.join(out_dir, "prodigal_out_aas_tmp.fasta")) + elif gene_predictor == "prodigal-gv": + remove_file(os.path.join(out_dir, "prodigal-gv_out.gff")) + remove_file(os.path.join(out_dir, "prodigal-gv_out_aas_tmp.fasta")) # delete the tmp meta files if meta == True: remove_directory(os.path.join(out_dir, "input_split_tmp/")) @@ -2150,32 +2321,23 @@ def process_vfdb_results(out_dir, merged_df): touch_file(vfdb_file) vfdb_df = pd.read_csv(vfdb_file, delimiter="\t", index_col=False, names=col_list) - genes = vfdb_df.gene.unique() - # get top hit - tophits = [] - for gene in genes: - tmp_df = ( - vfdb_df.loc[vfdb_df["gene"] == gene] - .sort_values("vfdb_eVal") - .reset_index(drop=True) - .loc[0] - ) - tophits.append( - [ - tmp_df.vfdb_hit, - tmp_df.gene, - tmp_df.vfdb_alnScore, - tmp_df.vfdb_seqIdentity, - tmp_df.vfdb_eVal, - ] - ) + # optimise the tophits generation + # Group by 'gene' and find the top hit for each group + tophits_df = vfdb_df.groupby('gene', group_keys=True).apply(lambda group: group.nsmallest(1, 'vfdb_eVal')).reset_index(drop=True) + + + + # create tophits df + tophits_df = tophits_df[[ + "vfdb_hit", + "gene", + "vfdb_alnScore", + "vfdb_seqIdentity", + "vfdb_eVal", + ]] - tophits_df = pd.DataFrame( - tophits, - columns=["vfdb_hit", "gene", "vfdb_alnScore", "vfdb_seqIdentity", "vfdb_eVal"], - ) # left join vfdb to merged_df tophits_df["gene"] = tophits_df["gene"].astype(str) @@ -2254,32 +2416,21 @@ def process_card_results(out_dir, merged_df, db_dir): ] touch_file(card_file) card_df = pd.read_csv(card_file, delimiter="\t", index_col=False, names=col_list) - genes = card_df.gene.unique() - # get top hit - tophits = [] + # + tophits_df = card_df.groupby('gene', group_keys=True).apply(lambda group: group.nsmallest(1, 'CARD_eVal')).reset_index(drop=True) - for gene in genes: - tmp_df = ( - card_df.loc[card_df["gene"] == gene] - .sort_values("CARD_eVal") - .reset_index(drop=True) - .loc[0] - ) - tophits.append( - [ - tmp_df.CARD_hit, - tmp_df.gene, - tmp_df.CARD_alnScore, - tmp_df.CARD_seqIdentity, - tmp_df.CARD_eVal, - ] - ) + - tophits_df = pd.DataFrame( - tophits, - columns=["CARD_hit", "gene", "CARD_alnScore", "CARD_seqIdentity", "CARD_eVal"], - ) + # create tophits df + tophits_df = tophits_df[[ + "CARD_hit", + "gene", + "CARD_alnScore", + "CARD_seqIdentity", + "CARD_eVal", + ]] + # left join tophits_df to merged_df tophits_df["gene"] = tophits_df["gene"].astype(str) diff --git a/bin/processes.py b/bin/processes.py index 96bf2ef..8aa1f11 100644 --- a/bin/processes.py +++ b/bin/processes.py @@ -1,9 +1,11 @@ +import multiprocessing.pool import os import subprocess as sp from datetime import datetime import pandas as pd import pyrodigal +import pyrodigal_gv from BCBio import GFF from Bio import SeqIO from Bio.Seq import Seq @@ -12,6 +14,41 @@ from loguru import logger from util import remove_directory + +def run_pyrodigal_gv(filepath_in, out_dir, threads): + """ + Gets CDS using pyrodigal_gv + :param filepath_in: input filepath + :param out_dir: output directory + :param logger logger + :param meta Boolean - metagenomic mode flag + :param coding_table coding table for prodigal (default 11) + :return: + """ + + # true + orf_finder = pyrodigal_gv.ViralGeneFinder(meta=True) + + def _find_genes(record): + genes = orf_finder.find_genes(str(record.seq)) + return (record.id, genes) + + with multiprocessing.pool.ThreadPool(threads) as pool: + with open(os.path.join(out_dir, "prodigal-gv_out.gff"), "w") as gff: + with open(os.path.join(out_dir, "prodigal-gv_out_tmp.fasta"), "w") as dst: + with open( + os.path.join(out_dir, "prodigal-gv_out_aas_tmp.fasta"), "w" + ) as aa_fasta: + records = SeqIO.parse(filepath_in, "fasta") + for record_id, genes in pool.imap(_find_genes, records): + genes.write_gff( + gff, sequence_id=record_id, include_translation_table=True + ) + genes.write_genes(dst, sequence_id=record_id) + # need to write the translation + genes.write_translations(aa_fasta, sequence_id=record_id) + + ##### phanotate meta mode ######## @@ -254,7 +291,7 @@ def run_phanotate(filepath_in, out_dir, logdir): logger.error("Error with Phanotate\n") -def run_pyrodigal(filepath_in, out_dir, meta, coding_table): +def run_pyrodigal(filepath_in, out_dir, meta, coding_table, threads): """ Gets CDS using pyrodigal :param filepath_in: input filepath @@ -262,6 +299,7 @@ def run_pyrodigal(filepath_in, out_dir, meta, coding_table): :param logger logger :param meta Boolean - metagenomic mode flag :param coding_table coding table for prodigal (default 11) + :param threads: threads :return: """ @@ -270,25 +308,52 @@ def run_pyrodigal(filepath_in, out_dir, meta, coding_table): prodigal_metamode = True logger.info("Prodigal Meta Mode Enabled") - # for training if you want different coding table - seqs = [bytes(record.seq) for record in SeqIO.parse(filepath_in, "fasta")] - record = SeqIO.parse(filepath_in, "fasta") - orf_finder = pyrodigal.OrfFinder(meta=prodigal_metamode) - - # coding table possible if false - if prodigal_metamode == False: - trainings_info = orf_finder.train(*seqs, translation_table=int(coding_table)) - orf_finder = pyrodigal.OrfFinder(trainings_info, meta=prodigal_metamode) - - with open(os.path.join(out_dir, "prodigal_out.gff"), "w") as dst: - for i, record in enumerate(SeqIO.parse(filepath_in, "fasta")): - genes = orf_finder.find_genes(str(record.seq)) - genes.write_gff(dst, sequence_id=record.id) - - with open(os.path.join(out_dir, "prodigal_out_tmp.fasta"), "w") as dst: - for i, record in enumerate(SeqIO.parse(filepath_in, "fasta")): - genes = orf_finder.find_genes(str(record.seq)) - genes.write_genes(dst, sequence_id=record.id) + ####################################################### + # if under 20000, pyrodigal will only work in meta mode + # https://github.com/hyattpd/prodigal/wiki/Advice-by-Input-Type#plasmids-phages-viruses-and-other-short-sequences + # https://github.com/hyattpd/Prodigal/issues/51 + # so make sure of this + ####################################################### + + # get total length of input + total_length = 0 + + with open(filepath_in, "r") as handle: + for record in SeqIO.parse(handle, "fasta"): + total_length += len(record.seq) + + # if the length is 100000 or under, use meta mode by default + if total_length < 100001: + orf_finder = pyrodigal.GeneFinder(meta=True) + # otherwise train it + # recommend pyrodigal-gv anyway + else: + # for training if you want different coding table + seqs = [bytes(record.seq) for record in SeqIO.parse(filepath_in, "fasta")] + record = SeqIO.parse(filepath_in, "fasta") + orf_finder = pyrodigal.GeneFinder(meta=prodigal_metamode) + + # make coding table possible if false + if prodigal_metamode == False: + orf_finder.train(*seqs, translation_table=int(coding_table)) + + # define for the multithreadpool + def _find_genes(record): + genes = orf_finder.find_genes(str(record.seq)) + return (record.id, genes) + + with multiprocessing.pool.ThreadPool(threads) as pool: + with open(os.path.join(out_dir, "prodigal_out.gff"), "w") as gff: + with open(os.path.join(out_dir, "prodigal_out_tmp.fasta"), "w") as dst: + with open( + os.path.join(out_dir, "prodigal_out_aas_tmp.fasta"), "w" + ) as aa_fasta: + records = SeqIO.parse(filepath_in, "fasta") + for record_id, genes in pool.imap(_find_genes, records): + genes.write_gff(gff, sequence_id=record_id) + genes.write_genes(dst, sequence_id=record_id) + # need to write the translation + genes.write_translations(aa_fasta, sequence_id=record_id) def tidy_phanotate_output(out_dir): @@ -319,13 +384,19 @@ def tidy_phanotate_output(out_dir): return phan_df -def tidy_prodigal_output(out_dir): +def tidy_prodigal_output(out_dir, gv_flag): """ Tidies prodigal output :param out_dir: output directory + :param gv_flag: if prodigal-gv, then True :return: prod_filt_df pandas dataframe """ - prod_file = os.path.join(out_dir, "prodigal_out.gff") + if gv_flag is True: + prefix = "prodigal-gv" + else: + prefix = "prodigal" + + prod_file = os.path.join(out_dir, f"{prefix}_out.gff") col_list = [ "contig", "prod", @@ -367,7 +438,7 @@ def tidy_prodigal_output(out_dir): + prod_filt_df["stop"].astype(str) ) prod_filt_df.to_csv( - os.path.join(out_dir, "cleaned_prodigal.tsv"), sep="\t", index=False + os.path.join(out_dir, f"cleaned_{prefix}.tsv"), sep="\t", index=False ) return prod_filt_df @@ -479,14 +550,17 @@ def translate_fastas(out_dir, gene_predictor, coding_table, genbank_file): if gene_predictor == "phanotate": clean_df = tidy_phanotate_output(out_dir) elif gene_predictor == "prodigal": - clean_df = tidy_prodigal_output(out_dir) + clean_df = tidy_prodigal_output(out_dir, False) # gv_flag is false + elif gene_predictor == "prodigal-gv": + clean_df = tidy_prodigal_output(out_dir, True) # gv_flag is true elif gene_predictor == "genbank": clean_df = tidy_genbank_output(out_dir, genbank_file, coding_table) - fasta_input_tmp = gene_predictor + "_out_tmp.fasta" fasta_output_aas_tmp = gene_predictor + "_aas_tmp.fasta" - if gene_predictor != "genbank": + if gene_predictor == "phanotate": + # read the nucl fasta + fasta_input_tmp = gene_predictor + "_out_tmp.fasta" # translate for temporary AA output with open(os.path.join(out_dir, fasta_output_aas_tmp), "w") as aa_fa: i = 0 @@ -504,6 +578,26 @@ def translate_fastas(out_dir, gene_predictor, coding_table, genbank_file): ) SeqIO.write(aa_record, aa_fa, "fasta") i += 1 + elif gene_predictor == "prodigal-gv" or gene_predictor == "prodigal": + # read in the AA file instead and parse that to clean the header + fasta_input_tmp = gene_predictor + "_out_aas_tmp.fasta" + with open(os.path.join(out_dir, fasta_output_aas_tmp), "w") as aa_fa: + i = 0 + for dna_record in SeqIO.parse( + os.path.join(out_dir, fasta_input_tmp), "fasta" + ): + dna_header = str(clean_df["contig"].iloc[i]) + str(i) + dna_description = ( + str(clean_df["start"].iloc[i]) + "_" + str(clean_df["stop"].iloc[i]) + ) + aa_record = SeqRecord( + dna_record.seq, + id=dna_header, + description=dna_description, + ) + SeqIO.write(aa_record, aa_fa, "fasta") + i += 1 + # for genbank do nothing def run_trna_scan(filepath_in, threads, out_dir, logdir): @@ -529,7 +623,7 @@ def run_trna_scan(filepath_in, threads, out_dir, logdir): try: ExternalTool.run_tool(trna) except: - logger.error("Error: tRNAscan-SE not found\n") + logger.error("Error with tRNAscan-SE") return 0 @@ -631,53 +725,51 @@ def run_mmseqs(db_dir, out_dir, threads, logdir, gene_predictor, evalue, db_name remove_directory(target_db_dir) -def convert_gff_to_gbk(filepath_in, input_dir, out_dir, prefix, coding_table): +def convert_gff_to_gbk(filepath_in, input_dir, out_dir, prefix, prot_seq_df): """ Converts the gff to genbank :param filepath_in: input fasta file - :param input_dir: input directory of the gff. same as output_dir for the overall gff, diff for meta mode + :param input_dir: input directory of the gff. same as output_dir for the overall gff in normal mode, differeny for meta mode :param out_dir: output directory of the gbk :param prefix: prefix + :param prefix: prot_seq_df from pharok object with gene name + protein sequence for all genes (from create_gff()). :return: """ - gff_file = os.path.join(input_dir, prefix + ".gff") - gbk_file = os.path.join(out_dir, prefix + ".gbk") + + gff_file = os.path.join(input_dir, f"{prefix}.gff") + gbk_file = os.path.join(out_dir, f"{prefix}.gbk") + with open(gbk_file, "wt") as gbk_handler: fasta_handler = SeqIO.to_dict(SeqIO.parse(filepath_in, "fasta")) for record in GFF.parse(gff_file, fasta_handler): + # sequence in each contig (record) + subset_seqs_df = prot_seq_df.loc[prot_seq_df["contig"] == record.id] + # get all the seqs in the contigs - and drop the index to reset for 0 indexed loop + subset_seqs = subset_seqs_df["sequence"].reset_index(drop=True) + # start the loop + i = 0 + # instantiate record record.annotations["molecule_type"] = "DNA" record.annotations["date"] = datetime.today() record.annotations["topology"] = "linear" - record.annotations["data_file_division"] = "VRL" + record.annotations[ + "data_file_division" + ] = "PHG" # https://github.com/RyanCook94/inphared/issues/22 # add features to the record for feature in record.features: # add translation only if CDS if feature.type == "CDS": + # aa = prot_records[i].seq if feature.strand == 1: feature.qualifiers.update( - { - "translation": Seq.translate( - record.seq[ - feature.location.start.position : feature.location.end.position - ], - to_stop=True, - table=coding_table, - ) - } + {"translation": subset_seqs[i]} # from the aa seq ) else: # reverse strand -1 needs reverse compliment feature.qualifiers.update( - { - "translation": Seq.translate( - record.seq[ - feature.location.start.position : feature.location.end.position - ].reverse_complement(), - to_stop=True, - table=coding_table, - ) - } + {"translation": subset_seqs[i]} # from the aa seq ) + i += 1 SeqIO.write(record, gbk_handler, "genbank") diff --git a/bin/run_pyrodigal_gv.py b/bin/run_pyrodigal_gv.py new file mode 100644 index 0000000..96b2795 --- /dev/null +++ b/bin/run_pyrodigal_gv.py @@ -0,0 +1,35 @@ +import os + +import pyrodigal_gv +from Bio import SeqIO + +# from Bio.Seq import Seq +# from Bio.SeqRecord import SeqRecord +# from external_tools import ExternalTool +# from loguru import logger +# from util import remove_directory + + +def run_pyrodiga_gv(filepath_in, out_dir, coding_table): + """ + Gets CDS using pyrodigal_gv + :param filepath_in: input filepath + :param out_dir: output directory + :param logger logger + :param meta Boolean - metagenomic mode flag + :param coding_table coding table for prodigal (default 11) + :return: + """ + + # true + orf_finder = pyrodigal_gv.ViralGeneFinder(meta=True) + + with open(os.path.join(out_dir, "prodigal_out.gff"), "w") as dst: + for i, record in enumerate(SeqIO.parse(filepath_in, "fasta")): + genes = orf_finder.find_genes(str(record.seq)) + genes.write_gff(dst, sequence_id=record.id) + + with open(os.path.join(out_dir, "prodigal_out_tmp.fasta"), "w") as dst: + for i, record in enumerate(SeqIO.parse(filepath_in, "fasta")): + genes = orf_finder.find_genes(str(record.seq)) + genes.write_genes(dst, sequence_id=record.id) diff --git a/bin/version.py b/bin/version.py index bf25615..5b60188 100644 --- a/bin/version.py +++ b/bin/version.py @@ -1 +1 @@ -__version__ = "1.4.1" +__version__ = "1.5.0" diff --git a/build/build.sh b/build/build.sh deleted file mode 100644 index 36241a3..0000000 --- a/build/build.sh +++ /dev/null @@ -1,8 +0,0 @@ -#!/bin/sh -set -e - -mkdir -p "${PREFIX}/bin" -mkdir -p "${PREFIX}/bin/modules" - -cp -r bin/* "${PREFIX}/bin/" -cp -r bin/modules/* "${PREFIX}/bin/modules" diff --git a/build/meta.yaml b/build/meta.yaml deleted file mode 100644 index 983ef57..0000000 --- a/build/meta.yaml +++ /dev/null @@ -1,49 +0,0 @@ -{% set version = "1.3.0" %} -{% set name = "pharokka" %} -{% set sha256 = "17055ecc532dbc18908ba2d6da5675ce7c08b3ab2767d4def7de9935281bc4df" %} -{% set user = "gbouras13" %} - -package: - name: {{ name }} - version: {{ version }} - -build: - number: 0 - noarch: python - -source: - url: https://github.com/{{ user }}/{{ name }}/archive/refs/tags/v{{ version }}.tar.gz - sha256: {{ sha256 }} - -requirements: - run: - - bcbio-gff - - biopython >=1.78,<1.81 - - phanotate >=1.5.0 - - mmseqs2 ==13.45111 - - trnascan-se >=2.0.9 - - minced >=0.4.2 - - aragorn >=1.2.41 - - mash >=2.2 - - pyrodigal >=2.0.1 - - pycirclize >=0.3.1 - - dnaapler >= 0.3.0 - -test: - commands: - - install_databases.py -h - - pharokka.py -h - - pharokka_plotter.py -h - - pharokka_proteins.py -h - -about: - home: https://github.com/gbouras13/pharokka - license: MIT - license_file: LICENSE - summary: "Fast Phage Annotation Program" - dev_url: https://github.com/gbouras13/pharokka - doc_url: https://pharokka.readthedocs.io - -extra: - recipe-maintainers: - - gbouras13 diff --git a/docs/benchmarking.md b/docs/benchmarking.md index 4d04e21..55ca0e4 100644 --- a/docs/benchmarking.md +++ b/docs/benchmarking.md @@ -67,4 +67,20 @@ The 673 crAss-like genomes were run with `-m` (defaults to `--mmseqs2_only` in v | Annotated Function CDS | **16713** | 9150 | 9150 | | Unknown Function CDS | 75286 | 82849 | 82849 | - \ No newline at end of file + +# Benchmarking v1.5.0 + +`pharokka v1.5.0` was run on the 673 crAss phage dataset to showcase the improved CDS prediction of `-g prodigal-gv` for metagenomic datasets where some phages likely have alternative genetic codes. + +All benchmarking was conducted on a Intel® Core™ i7-10700K CPU @ 3.80GHz on a machine running Ubuntu 20.04.6 LTS with 8 threads (`-t 8`). `pyrodigal-gv v0.1.0` and `pyrodigal v3.0.0` were used respectively with `--fast`. + +| 673 crAss-like genomes | `pharokka` v1.5.0 `-g prodigal-gv` | `pharokka` v1.5.0 `-g prodigal` | +|------------------------|------------------------------------|----------------------------------| +| Total CDS | 81730 | 91999 | +| Annotated Function CDS | **20344** | 17458 | +| Unknown Function CDS | 61386 | 74541 | +| Contigs with genetic code 15 | 229 | NA | +| Contigs with genetic code 4 | 38 | NA | +| Contigs with genetic code 11 | 406 | 673 | + +Fewer larger CDS were predicted more accurately, leading to an increase in the number of coding sequences with annotated functions. Approximately 40% of contigs in this dataset were predicted to use non-standard genetic codes according to `pyrodigal-gv`. \ No newline at end of file diff --git a/docs/citation.md b/docs/citation.md index df6d077..0ce2144 100644 --- a/docs/citation.md +++ b/docs/citation.md @@ -7,7 +7,7 @@ If you use pharokka, I would recommend a citation in your manuscript along the l * All phages were annotated with Pharokka v ___ (Bouras, et al. 2023). Specifically, coding sequences (CDS) were predicted with PHANOTATE (McNair, et al. 2019), tRNAs were predicted with tRNAscan-SE 2.0 (Chan, et al. 2021), tmRNAs were predicted with Aragorn (Laslett, et al. 2004) and CRISPRs were preducted with CRT (Bland, et al. 2007). Functional annotation was generated by matching each CDS to the PHROGs (Terzian, et al. 2021), VFDB (Chen, et al. 2005) and CARD (Alcock, et al. 2020) databases using MMseqs2 (Steinegger, et al. 2017) and PyHMMER (Larralde and Zeller 2023). Contigs were matched to their closest hit in the INPHARED database (Cook, et al. 2021) using mash (Ondov, et al. 2016). Plots were created with pyCirclize (Shimoyama 2022). -With the following full citations for the constituent tools below: +With the following full citations for the constituent tools below where relevant: * Cook R, Brown N, Redgwell T, Rihtman B, Barnes M, Clokie M, Stekel DJ, Hobman JL, Jones MA, Millard A. INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes. PHAGE. 2021. Available from: http://doi.org/10.1089/phage.2021.0007. * McNair K., Zhou C., Dinsdale E.A., Souza B., Edwards R.A. (2019) "PHANOTATE: a novel approach to gene identification in phage genomes", Bioinformatics, https://doi.org/10.1093/bioinformatics/btz26. @@ -21,4 +21,5 @@ With the following full citations for the constituent tools below: * Alcock et al, "CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database." Nucleic Acids Research (2020) https://doi.org/10.1093/nar/gkz935. * Larralde, M., (2022). Pyrodigal: Python bindings and interface to Prodigal, an efficient method for gene prediction in prokaryotes. Journal of Open Source Software, 7(72), 4296. doi:10.21105/joss.04296. * Larralde M., Zeller G., (2023). PyHMMER: a Python library binding to HMMER for efficient sequence analysis, Bioinformatics, Volume 39, Issue 5, May 2023, btad214, https://doi.org/10.1093/bioinformatics/btad214. +* Larradle M. and Camargo A., (2023) Pyrodigal-gv: A Pyrodigal extension to predict genes in giant viruses and viruses with alternative genetic code. https://github.com/althonos/pyrodigal-gv. * Shimoyama, Y. (2022). pyCirclize: Circular visualization in Python [Computer software]. https://github.com/moshi4/pyCirclize \ No newline at end of file diff --git a/docs/index.md b/docs/index.md index 46ee94b..f1624f3 100644 --- a/docs/index.md +++ b/docs/index.md @@ -1,17 +1,17 @@ +# `pharokka` + `pharokka` is a fast phage annotation pipeline. ![Image](pharokka_logo.png) -`pharokka` uses PHANOTATE (McNair et al 2019) to conduct gene prediction, tRNAscan-SE 2 (Chan et al 2021) to call tRNAs, MinCED (Bland et al 2007) to detect CRISPRs and Aragorn (Laslett et al 2004) to detect tmRNAs. There is also the option to specify Prodigal (Hyatt et al 2010) implemented with Pyrodigal (Larralde, 2022) instead of PHANOTATE. - -`pharokka` then uses the lightweight PHROGS database (Terzian et al 2021) for functional annotation of all predicted CDSs using MMseqs2 (Steinegger et al 2017), and as of v1.4.0, PyHMMER (Larralde and Zeller 2023) for more sensitive annotations. `pharokka` also matches each predicted CDS against the VFDB (Chen et al 2005) and CARD (Alcock et al 2020) databases to predict virulence factors and antimicrobial resistance, respectively. - -For more information, please read the `pharokka` manuscript: +## Overview -George Bouras, Roshan Nepal, Ghais Houtak, Alkis James Psaltis, Peter-John Wormald, Sarah Vreugde, Pharokka: a fast scalable bacteriophage annotation tool, Bioinformatics, Volume 39, Issue 1, January 2023, btac776, [https://doi.org/10.1093/bioinformatics/btac776](https://doi.org/10.1093/bioinformatics/btac776) +`pharokka` uses [PHANOTATE](https://github.com/deprekate/PHANOTATE), the only gene prediction program tailored to bacteriophages, as the default program for gene prediction. [Prodigal](https://github.com/hyattpd/Prodigal) implemented with [pyrodigal](https://github.com/althonos/pyrodigal) and [Prodigal-gv](https://github.com/apcamargo/prodigal-gv) implemented with [pyrodigal-gv](https://github.com/althonos/pyrodigal-gv) are also available as alternatives. Following this, functional annotations are assigned by matching each predicted coding sequence (CDS) to the [PHROGs](https://phrogs.lmge.uca.fr), [CARD](https://card.mcmaster.ca) and [VFDB](http://www.mgc.ac.cn/VFs/main.htm) databases using [MMseqs2](https://github.com/soedinglab/MMseqs2). As of v1.4.0, `pharokka` will also match each CDS to the PHROGs database using more sensitive Hidden Markov Models using [PyHMMER](https://github.com/althonos/pyhmmer). Pharokka's main output is a GFF file suitable for using in downstream pangenomic pipelines like [Roary](https://sanger-pathogens.github.io/Roary/). `pharokka` also generates a `cds_functions.tsv` file, which includes counts of CDSs, tRNAs, tmRNAs, CRISPRs and functions assigned to CDSs according to the PHROGs database. See the full [usage](#usage) and check out the full [documentation](https://pharokka.readthedocs.io) for more details. ![Image](pharokka_workflow.png) +## Manuscript +For more information, please read the `pharokka` manuscript: - +George Bouras, Roshan Nepal, Ghais Houtak, Alkis James Psaltis, Peter-John Wormald, Sarah Vreugde, Pharokka: a fast scalable bacteriophage annotation tool, Bioinformatics, Volume 39, Issue 1, January 2023, btac776, [https://doi.org/10.1093/bioinformatics/btac776](https://doi.org/10.1093/bioinformatics/btac776) diff --git a/docs/install.md b/docs/install.md index 0857060..112fbb3 100644 --- a/docs/install.md +++ b/docs/install.md @@ -53,7 +53,7 @@ pharokka.py --help # Database Installation -* **Note v 1.4.0 implements a new database with PHROGs HMM profiles. You will need to update the Pharokka database to use v1.4.0** +* **Note v 1.4.0 implements a new database with PHROGs HMM profiles. You will need to update the Pharokka database to use v1.4.0 and higher** To install the pharokka database to the default directory: diff --git a/docs/output.md b/docs/output.md index dee51e4..a3d6bc7 100644 --- a/docs/output.md +++ b/docs/output.md @@ -18,7 +18,7 @@ Other Files * A `_cds_functions.tsv` file, which includes counts of CDSs, tRNAs, CRISPRs and tmRNAs and functions assigned to CDSs according to the PHROGs database. -* A `_length_gc_cds_density.tsv` file, which outputs the phage's length, GC percentage and CDS coding density. +* A `_length_gc_cds_density.tsv` file, which outputs the phage's length, GC percentage, translation table and CDS coding density. * `phanotate.ffn` or `prodigal.ffn` which will hold all nucleotide sequences of predicted CDSs. diff --git a/docs/pharokka_workflow.png b/docs/pharokka_workflow.png index 3caf568..6b86751 100644 Binary files a/docs/pharokka_workflow.png and b/docs/pharokka_workflow.png differ diff --git a/docs/run.md b/docs/run.md index 86bb718..868a8b5 100644 --- a/docs/run.md +++ b/docs/run.md @@ -50,12 +50,32 @@ To use Prodigal (pyrodigal) gene predictions instead of PHANOTATE use `-g prodig pharokka.py -i -o -d -t -g prodigal ``` -If you are annotating more than 1 contig, it is recommended that you run pharokka in meta mode using the `-m` flag, which will enable pharokka to finish faster by making full use of all available threads when running PHANOTATE and tRNAscan-SE 2. +To use Prodigal-gv (pyrodigal-gv) gene predictions instead of PHANOTATE use `-g prodigal-gv`. This is recommended for metagenomic datasets where some phages likely have [alternate genetic codes](https://github.com/apcamargo/prodigal-gv). ``` -pharokka.py -i -o -d -t -m +pharokka.py -i -o -d -t -g prodigal-gv ``` +If you are annotating more than 1 contig, it is recommended that you run pharokka in meta mode using the `-m` flag, which will enable pharokka to finish faster by making full use of all available threads when running tRNAscan-SE 2. + +``` +pharokka.py -i -o -d -t -m -g prodigal-gv +``` + +As of v1.5.0, you can skip running mash to find the closest match for each contig in INPHARED using `skip_mash`. + +``` +pharokka.py -i -o -d -t --skip_mash +``` + +As of v1.5.0, you can skip running tRNAscan-SE 2, MinCED and Aragorn fusingg `skip_extra_annotations`. + +``` +pharokka.py -i -o -d -t --skip_extra_annotations +``` + + + ## Advanced Parameters As of v1.4.0, `pharokka` will automatically run MMseqs2 (PHROGs, CARD, VFDB) and PyHMMER (PHROGs). To turn off PyHMMER, use `--mmseqs2_only`. @@ -128,9 +148,9 @@ Of course, you can also use this functionality to reorient your phage however yo ``` -usage: pharokka.py [-h] [-i INFILE] [-o OUTDIR] [-d DATABASE] [-t THREADS] [-f] [-p PREFIX] [-l LOCUSTAG] [-g GENE_PREDICTOR] [-m] [-s] [-c CODING_TABLE] - [-e EVALUE] [--fast] [--mmseqs2_only] [--meta_hmm] [--dnaapler] [--custom_hmm CUSTOM_HMM] [--genbank] [--terminase] - [--terminase_strand TERMINASE_STRAND] [--terminase_start TERMINASE_START] [-V] [--citation] +usage: pharokka.py [-h] [-i INFILE] [-o OUTDIR] [-d DATABASE] [-t THREADS] [-f] [-p PREFIX] [-l LOCUSTAG] [-g GENE_PREDICTOR] [-m] [-s] [-c CODING_TABLE] [-e EVALUE] [--fast] [--mmseqs2_only] + [--meta_hmm] [--dnaapler] [--custom_hmm CUSTOM_HMM] [--genbank] [--terminase] [--terminase_strand TERMINASE_STRAND] [--terminase_start TERMINASE_START] + [--skip_extra_annotations] [--skip_mash] [-V] [--citation] pharokka: fast phage annotation program @@ -150,13 +170,13 @@ options: -l LOCUSTAG, --locustag LOCUSTAG User specified locus tag for the gff/gbk files. This is not required. A random locus tag will be generated instead. -g GENE_PREDICTOR, --gene_predictor GENE_PREDICTOR - User specified gene predictor. Use "-g phanotate" or "-g prodigal". + User specified gene predictor. Use "-g phanotate" or "-g prodigal" or "-g prodigal-gv" or "-g genbank". Defaults to phanotate (not required unless prodigal is desired). -m, --meta meta mode for metavirome input samples -s, --split split mode for metavirome samples. -m must also be specified. Will output separate split FASTA, gff and genbank files for each input contig. -c CODING_TABLE, --coding_table CODING_TABLE - translation table for prodigal. Defaults to 11. Experimental only. + translation table for prodigal. Defaults to 11. -e EVALUE, --evalue EVALUE E-value threshold for MMseqs2 database PHROGs, VFDB and CARD and PyHMMER PHROGs database search. Defaults to 1E-05. --fast, --hmm_only Runs PyHMMER (HMMs) with PHROGs only, not MMseqs2 with PHROGs, CARD or VFDB. @@ -168,13 +188,17 @@ options: --custom_hmm CUSTOM_HMM Run pharokka with a custom HMM profile database suffixed .h3m. Please use create this with the create_custom_hmm.py script. - --genbank Flag denoting that -i/--input is a genbank file instead of the usual FASTA file + --genbank Flag denoting that -i/--input is a genbank file instead of the usual FASTA file. + The CDS calls in this file will be preserved and re-annotated. --terminase Runs terminase large subunit re-orientation mode. Single genome input only and requires --terminase_strand and --terminase_start to be specified. --terminase_strand TERMINASE_STRAND Strand of terminase large subunit. Must be "pos" or "neg". --terminase_start TERMINASE_START Start coordinate of the terminase large subunit. + --skip_extra_annotations + Skips tRNAscan-se, MINced and Aragorn. + --skip_mash Skips running mash to find the closest match for each contig in INPHARED. -V, --version Print pharokka Version --citation Print pharokka Citation ``` \ No newline at end of file diff --git a/environment.yml b/environment.yml index 11af98d..69fcb9f 100644 --- a/environment.yml +++ b/environment.yml @@ -12,8 +12,8 @@ dependencies: - minced >=0.4.2 - aragorn >=1.2.41 - mash >=2.2 - - dnaapler >=0.3.0 - - pyrodigal >=2.0.1 + - dnaapler >=0.3.2 + - pyrodigal >=3.0.0 - pycirclize >=0.3.1 - alive-progress >=3.0.1 - requests >=2.25.1 diff --git a/img/pharokka_workflow.png b/img/pharokka_workflow.png index 3caf568..6b86751 100644 Binary files a/img/pharokka_workflow.png and b/img/pharokka_workflow.png differ diff --git a/setup.py b/setup.py index cd7fb08..1150cac 100644 --- a/setup.py +++ b/setup.py @@ -30,7 +30,7 @@ def package_files(directory): setup( name="Pharokka", - version="1.4.1", + version="1.5.0", author="George Bouras", author_email="george.bouras@adelaide.edu.au", description="Fast phage annotation tool", @@ -84,7 +84,6 @@ def package_files(directory): "pyyaml>=6.0", "pandas>=1.4.2", "biopython>=1.76", - "pyrodigal>=2.0.0", "pyhmmer>=0.10.0", "black>=22.3.0", "isort>=5.10.1", @@ -92,6 +91,8 @@ def package_files(directory): "pytest-cov>=3.0.0", "alive-progress>=3.0.1", "requests>=2.25.1", - "bcbio-gff >=0.7.0", + "bcbio-gff>=0.7.0", + "pyrodigal>=3.0.0", + "pyrodigal_gv>=0.1.0" ], ) diff --git a/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out.gff b/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out.gff new file mode 100644 index 0000000..9857c34 --- /dev/null +++ b/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out.gff @@ -0,0 +1,215 @@ +##gff-version 3 +# Sequence Data: seqnum=1;seqlen=140135;seqhdr="MW460250_1" +# Model Data: version=pyrodigal.v3.0.0;run_type=Metagenomic;model="59|Gut_phage_code_11c|V|29.9|11|1";gc_cont=29.95;transl_table=11;uses_sd=1 +MW460250_1 pyrodigal_v3.0.0 CDS 183 392 31.0 - 0 ID=MW460250_1_1;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.267;transl_table=11;conf=99.92;score=30.98;cscore=23.88;sscore=7.10;rscore=9.44;uscore=-4.25;tscore=1.91; +MW460250_1 pyrodigal_v3.0.0 CDS 405 737 34.8 - 0 ID=MW460250_1_2;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.306;transl_table=11;conf=99.97;score=34.78;cscore=45.24;sscore=-10.46;rscore=8.03;uscore=-7.85;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 750 1076 25.4 - 0 ID=MW460250_1_3;partial=00;start_type=TTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.306;transl_table=11;conf=99.71;score=25.42;cscore=17.96;sscore=7.46;rscore=12.88;uscore=5.21;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 1636 1902 37.4 + 0 ID=MW460250_1_4;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.292;transl_table=11;conf=99.98;score=37.43;cscore=36.35;sscore=1.08;rscore=0.03;uscore=-1.25;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 1880 2158 31.4 + 0 ID=MW460250_1_5;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.294;transl_table=11;conf=99.93;score=31.41;cscore=22.58;sscore=8.83;rscore=6.74;uscore=-0.21;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 2155 2565 74.3 + 0 ID=MW460250_1_6;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.299;transl_table=11;conf=100.00;score=74.32;cscore=72.41;sscore=1.91;rscore=8.03;uscore=4.52;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 2580 2777 25.2 + 0 ID=MW460250_1_7;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.323;transl_table=11;conf=99.70;score=25.20;cscore=13.67;sscore=11.53;rscore=8.90;uscore=0.84;tscore=1.80; +MW460250_1 pyrodigal_v3.0.0 CDS 3071 4042 157.9 + 0 ID=MW460250_1_8;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.256;transl_table=11;conf=100.00;score=157.89;cscore=155.63;sscore=2.26;rscore=8.03;uscore=4.87;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 4183 5730 255.7 + 0 ID=MW460250_1_9;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.326;transl_table=11;conf=99.99;score=255.65;cscore=254.49;sscore=1.16;rscore=0.03;uscore=-1.18;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 5723 6544 156.9 + 0 ID=MW460250_1_10;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.279;transl_table=11;conf=100.00;score=156.88;cscore=167.78;sscore=-10.90;rscore=-11.85;uscore=-1.36;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 6531 6704 0.7 + 0 ID=MW460250_1_11;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.213;transl_table=11;conf=53.83;score=0.67;cscore=7.40;sscore=-6.73;rscore=0.02;uscore=1.74;tscore=-8.50; +MW460250_1 pyrodigal_v3.0.0 CDS 6701 7180 97.5 + 0 ID=MW460250_1_12;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.292;transl_table=11;conf=100.00;score=97.51;cscore=83.23;sscore=14.29;rscore=6.74;uscore=5.24;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 7222 8415 269.3 + 0 ID=MW460250_1_13;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.353;transl_table=11;conf=99.99;score=269.30;cscore=268.34;sscore=0.96;rscore=8.03;uscore=3.57;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 8492 8842 9.3 + 0 ID=MW460250_1_14;partial=00;start_type=TTG;rbs_motif=GGxGG;rbs_spacer=5-10bp;gc_cont=0.234;transl_table=11;conf=89.56;score=9.35;cscore=11.23;sscore=-1.88;rscore=5.72;uscore=3.04;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 8851 9231 40.8 + 0 ID=MW460250_1_15;partial=00;start_type=GTG;rbs_motif=None;rbs_spacer=None;gc_cont=0.281;transl_table=11;conf=99.99;score=40.76;cscore=55.18;sscore=-14.41;rscore=-11.85;uscore=3.25;tscore=-5.81; +MW460250_1 pyrodigal_v3.0.0 CDS 9235 10926 346.6 + 0 ID=MW460250_1_16;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.315;transl_table=11;conf=99.99;score=346.56;cscore=351.06;sscore=-4.50;rscore=8.03;uscore=-1.89;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 11120 11893 147.8 + 0 ID=MW460250_1_17;partial=00;start_type=TTG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.323;transl_table=11;conf=100.00;score=147.80;cscore=143.69;sscore=4.11;rscore=10.47;uscore=4.28;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 11912 12868 279.2 + 0 ID=MW460250_1_18;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.322;transl_table=11;conf=99.99;score=279.18;cscore=262.35;sscore=16.83;rscore=11.41;uscore=3.12;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 12984 14375 288.7 + 0 ID=MW460250_1_19;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.372;transl_table=11;conf=99.99;score=288.72;cscore=282.32;sscore=6.40;rscore=-2.28;uscore=6.37;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 14467 14763 67.1 + 0 ID=MW460250_1_20;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.310;transl_table=11;conf=100.00;score=67.14;cscore=61.03;sscore=6.12;rscore=0.03;uscore=3.78;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 14776 15684 168.2 + 0 ID=MW460250_1_21;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.355;transl_table=11;conf=100.00;score=168.17;cscore=161.31;sscore=6.86;rscore=0.03;uscore=4.52;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 15698 16576 158.5 + 0 ID=MW460250_1_22;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.319;transl_table=11;conf=100.00;score=158.46;cscore=142.42;sscore=16.05;rscore=12.88;uscore=0.86;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 16576 17196 99.7 + 0 ID=MW460250_1_23;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.320;transl_table=11;conf=100.00;score=99.68;cscore=90.78;sscore=8.90;rscore=10.47;uscore=-3.88;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 17215 18051 144.0 + 0 ID=MW460250_1_24;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.302;transl_table=11;conf=100.00;score=143.95;cscore=123.19;sscore=20.77;rscore=13.08;uscore=5.38;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 18053 18268 25.2 + 0 ID=MW460250_1_25;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.306;transl_table=11;conf=99.70;score=25.23;cscore=21.83;sscore=3.40;rscore=0.03;uscore=1.41;tscore=1.96; +MW460250_1 pyrodigal_v3.0.0 CDS 18295 20058 374.9 + 0 ID=MW460250_1_26;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.362;transl_table=11;conf=99.99;score=374.85;cscore=357.72;sscore=17.13;rscore=11.41;uscore=3.42;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 20131 20559 75.8 + 0 ID=MW460250_1_27;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.354;transl_table=11;conf=100.00;score=75.83;cscore=62.86;sscore=12.98;rscore=10.47;uscore=0.20;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 20656 20796 25.8 + 0 ID=MW460250_1_28;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.241;transl_table=11;conf=99.74;score=25.85;cscore=17.42;sscore=8.43;rscore=6.30;uscore=0.86;tscore=1.27; +MW460250_1 pyrodigal_v3.0.0 CDS 20839 21297 72.0 + 0 ID=MW460250_1_29;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272;transl_table=11;conf=100.00;score=72.02;cscore=53.81;sscore=18.21;rscore=11.41;uscore=4.50;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 21310 21504 10.3 + 0 ID=MW460250_1_30;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.246;transl_table=11;conf=91.39;score=10.28;cscore=-1.00;sscore=11.28;rscore=9.90;uscore=0.11;tscore=1.77; +MW460250_1 pyrodigal_v3.0.0 CDS 21586 21897 73.0 + 0 ID=MW460250_1_31;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.260;transl_table=11;conf=100.00;score=72.98;cscore=52.87;sscore=20.11;rscore=11.41;uscore=6.40;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 22029 22487 94.9 + 0 ID=MW460250_1_32;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.305;transl_table=11;conf=100.00;score=94.88;cscore=89.98;sscore=4.89;rscore=-2.28;uscore=4.86;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 22531 23067 104.8 + 0 ID=MW460250_1_33;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.337;transl_table=11;conf=100.00;score=104.82;cscore=111.53;sscore=-6.71;rscore=-11.85;uscore=2.83;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 23120 27178 949.4 + 0 ID=MW460250_1_34;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.372;transl_table=11;conf=99.99;score=949.39;cscore=946.51;sscore=2.88;rscore=0.03;uscore=0.54;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 27257 29683 531.0 + 0 ID=MW460250_1_35;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.331;transl_table=11;conf=99.99;score=530.97;cscore=518.30;sscore=12.66;rscore=3.91;uscore=6.44;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 29697 30584 177.5 + 0 ID=MW460250_1_36;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.282;transl_table=11;conf=100.00;score=177.52;cscore=161.63;sscore=15.89;rscore=12.88;uscore=0.70;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 30584 33130 487.9 + 0 ID=MW460250_1_37;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.311;transl_table=11;conf=99.99;score=487.89;cscore=477.64;sscore=10.25;rscore=8.03;uscore=-0.08;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 33237 34028 170.9 + 0 ID=MW460250_1_38;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.332;transl_table=11;conf=100.00;score=170.92;cscore=154.49;sscore=16.43;rscore=11.41;uscore=2.72;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 34028 34552 76.0 + 0 ID=MW460250_1_39;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.290;transl_table=11;conf=100.00;score=75.98;cscore=66.08;sscore=9.90;rscore=10.47;uscore=-2.88;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 34552 35256 136.7 + 0 ID=MW460250_1_40;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.308;transl_table=11;conf=100.00;score=136.73;cscore=120.02;sscore=16.71;rscore=12.88;uscore=1.52;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 35271 36317 204.5 + 0 ID=MW460250_1_41;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312;transl_table=11;conf=99.99;score=204.51;cscore=193.73;sscore=10.79;rscore=6.74;uscore=1.74;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 36338 39397 531.5 + 0 ID=MW460250_1_42;partial=00;start_type=GTG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.288;transl_table=11;conf=99.99;score=531.47;cscore=538.20;sscore=-6.74;rscore=-3.38;uscore=2.46;tscore=-5.81; +MW460250_1 pyrodigal_v3.0.0 CDS 39508 40029 111.4 + 0 ID=MW460250_1_43;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.324;transl_table=11;conf=100.00;score=111.36;cscore=96.81;sscore=14.55;rscore=6.74;uscore=5.51;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 40050 43508 729.3 + 0 ID=MW460250_1_44;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.342;transl_table=11;conf=99.99;score=729.26;cscore=714.76;sscore=14.50;rscore=8.03;uscore=4.16;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 43557 43715 35.4 + 0 ID=MW460250_1_45;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283;transl_table=11;conf=99.97;score=35.43;cscore=29.13;sscore=6.30;rscore=7.12;uscore=-2.26;tscore=1.44; +MW460250_1 pyrodigal_v3.0.0 CDS 43716 45638 342.9 + 0 ID=MW460250_1_46;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.316;transl_table=11;conf=99.99;score=342.94;cscore=326.15;sscore=16.79;rscore=11.41;uscore=3.08;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 45661 46035 67.3 + 0 ID=MW460250_1_47;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=11-12bp;gc_cont=0.307;transl_table=11;conf=100.00;score=67.26;cscore=58.80;sscore=8.46;rscore=4.67;uscore=1.49;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 46042 47418 218.9 + 0 ID=MW460250_1_48;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.328;transl_table=11;conf=99.99;score=218.90;cscore=220.16;sscore=-1.26;rscore=-2.28;uscore=-1.29;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 47510 49258 313.2 + 0 ID=MW460250_1_49;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.310;transl_table=11;conf=99.99;score=313.15;cscore=300.49;sscore=12.66;rscore=11.41;uscore=-1.05;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 49270 50883 229.1 + 0 ID=MW460250_1_50;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.284;transl_table=11;conf=99.99;score=229.15;cscore=213.01;sscore=16.13;rscore=12.88;uscore=0.94;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 50876 52318 266.6 + 0 ID=MW460250_1_51;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.312;transl_table=11;conf=99.99;score=266.61;cscore=254.72;sscore=11.89;rscore=8.03;uscore=1.56;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 52397 53434 190.0 + 0 ID=MW460250_1_52;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318;transl_table=11;conf=99.99;score=189.96;cscore=172.06;sscore=17.90;rscore=11.41;uscore=4.19;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 53434 53811 65.4 + 0 ID=MW460250_1_53;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.280;transl_table=11;conf=100.00;score=65.39;cscore=51.20;sscore=14.19;rscore=12.88;uscore=-1.00;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 53811 55730 324.3 + 0 ID=MW460250_1_54;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.297;transl_table=11;conf=99.99;score=324.33;cscore=325.94;sscore=-1.61;rscore=0.03;uscore=-3.94;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 55730 56326 102.6 + 0 ID=MW460250_1_55;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.281;transl_table=11;conf=100.00;score=102.63;cscore=89.05;sscore=13.58;rscore=8.03;uscore=3.24;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 56341 57408 156.9 + 0 ID=MW460250_1_56;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.290;transl_table=11;conf=100.00;score=156.93;cscore=142.57;sscore=14.36;rscore=6.74;uscore=5.31;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 57475 57813 94.9 + 0 ID=MW460250_1_57;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.274;transl_table=11;conf=100.00;score=94.89;cscore=76.41;sscore=18.48;rscore=11.41;uscore=4.76;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 57813 58265 116.7 + 0 ID=MW460250_1_58;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.331;transl_table=11;conf=100.00;score=116.75;cscore=97.41;sscore=19.34;rscore=11.41;uscore=5.63;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 58252 58860 66.7 + 0 ID=MW460250_1_59;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.328;transl_table=11;conf=100.00;score=66.71;cscore=74.91;sscore=-8.20;rscore=-11.85;uscore=1.35;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 58877 59269 51.7 + 0 ID=MW460250_1_60;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.293;transl_table=11;conf=100.00;score=51.70;cscore=48.32;sscore=3.38;rscore=0.03;uscore=1.05;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 59284 61398 393.9 + 0 ID=MW460250_1_61;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312;transl_table=11;conf=99.99;score=393.94;cscore=381.72;sscore=12.23;rscore=6.74;uscore=3.18;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 61412 62461 183.0 + 0 ID=MW460250_1_62;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.320;transl_table=11;conf=99.99;score=183.01;cscore=175.73;sscore=7.27;rscore=6.74;uscore=-1.77;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 62479 62808 65.5 + 0 ID=MW460250_1_63;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.306;transl_table=11;conf=100.00;score=65.46;cscore=49.90;sscore=15.56;rscore=11.41;uscore=1.84;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 62792 63112 64.7 + 0 ID=MW460250_1_64;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.293;transl_table=11;conf=100.00;score=64.67;cscore=56.68;sscore=7.99;rscore=3.91;uscore=1.77;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 63319 63915 78.7 + 0 ID=MW460250_1_65;partial=00;start_type=ATG;rbs_motif=3Base/5BMM;rbs_spacer=13-15bp;gc_cont=0.283;transl_table=11;conf=100.00;score=78.75;cscore=83.38;sscore=-4.63;rscore=-8.40;uscore=1.46;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 63925 64230 50.6 + 0 ID=MW460250_1_66;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.307;transl_table=11;conf=100.00;score=50.59;cscore=44.08;sscore=6.51;rscore=0.03;uscore=4.18;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 64306 65178 170.8 + 0 ID=MW460250_1_67;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.310;transl_table=11;conf=100.00;score=170.79;cscore=153.35;sscore=17.44;rscore=11.41;uscore=3.73;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 65344 65856 38.4 + 0 ID=MW460250_1_68;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.285;transl_table=11;conf=99.99;score=38.42;cscore=49.64;sscore=-11.22;rscore=0.03;uscore=-5.44;tscore=-5.81; +MW460250_1 pyrodigal_v3.0.0 CDS 65992 67335 240.5 + 0 ID=MW460250_1_69;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.303;transl_table=11;conf=99.99;score=240.51;cscore=248.39;sscore=-7.89;rscore=-11.85;uscore=1.66;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 67603 68310 104.1 + 0 ID=MW460250_1_70;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.275;transl_table=11;conf=100.00;score=104.06;cscore=93.53;sscore=10.53;rscore=11.41;uscore=-3.18;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 68544 69404 129.0 + 0 ID=MW460250_1_71;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.304;transl_table=11;conf=100.00;score=128.97;cscore=147.61;sscore=-18.64;rscore=-11.85;uscore=-9.10;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 69473 69715 31.7 + 0 ID=MW460250_1_72;partial=00;start_type=GTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.337;transl_table=11;conf=99.93;score=31.66;cscore=23.17;sscore=8.48;rscore=12.37;uscore=2.17;tscore=-6.05; +MW460250_1 pyrodigal_v3.0.0 CDS 69732 70214 91.9 + 0 ID=MW460250_1_73;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300;transl_table=11;conf=100.00;score=91.95;cscore=73.37;sscore=18.58;rscore=12.88;uscore=3.39;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 70301 71572 260.5 + 0 ID=MW460250_1_74;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.355;transl_table=11;conf=99.99;score=260.48;cscore=240.20;sscore=20.28;rscore=10.47;uscore=7.50;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 71632 71856 51.3 + 0 ID=MW460250_1_75;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.333;transl_table=11;conf=100.00;score=51.30;cscore=39.12;sscore=12.18;rscore=11.44;uscore=-1.31;tscore=2.05; +MW460250_1 pyrodigal_v3.0.0 CDS 72201 73169 167.7 + 0 ID=MW460250_1_76;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.268;transl_table=11;conf=100.00;score=167.68;cscore=162.50;sscore=5.17;rscore=0.03;uscore=2.84;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 73317 74264 178.7 + 0 ID=MW460250_1_77;partial=00;start_type=ATG;rbs_motif=AGxAG;rbs_spacer=11-12bp;gc_cont=0.344;transl_table=11;conf=99.99;score=178.65;cscore=185.47;sscore=-6.82;rscore=-7.65;uscore=-1.48;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 74268 74621 54.8 + 0 ID=MW460250_1_78;partial=00;start_type=ATG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.314;transl_table=11;conf=100.00;score=54.79;cscore=54.34;sscore=0.46;rscore=-3.38;uscore=1.54;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 74608 75270 126.3 + 0 ID=MW460250_1_79;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.275;transl_table=11;conf=100.00;score=126.33;cscore=113.17;sscore=13.16;rscore=9.27;uscore=1.59;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 75398 76021 152.8 + 0 ID=MW460250_1_80;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.361;transl_table=11;conf=100.00;score=152.83;cscore=137.83;sscore=15.01;rscore=8.03;uscore=4.67;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 76044 76556 99.6 + 0 ID=MW460250_1_81;partial=00;start_type=TTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.365;transl_table=11;conf=100.00;score=99.63;cscore=99.16;sscore=0.47;rscore=6.74;uscore=4.37;tscore=-10.64; +MW460250_1 pyrodigal_v3.0.0 CDS 76571 76798 69.3 + 0 ID=MW460250_1_82;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.355;transl_table=11;conf=100.00;score=69.31;cscore=55.98;sscore=13.33;rscore=11.60;uscore=-0.34;tscore=2.07; +MW460250_1 pyrodigal_v3.0.0 CDS 76894 77154 51.2 + 0 ID=MW460250_1_83;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.295;transl_table=11;conf=100.00;score=51.23;cscore=31.66;sscore=19.58;rscore=12.88;uscore=4.39;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 77158 77913 121.2 + 0 ID=MW460250_1_84;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.279;transl_table=11;conf=100.00;score=121.25;cscore=108.03;sscore=13.21;rscore=9.27;uscore=1.64;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 77906 79156 227.5 + 0 ID=MW460250_1_85;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.289;transl_table=11;conf=99.99;score=227.54;cscore=213.69;sscore=13.86;rscore=6.74;uscore=4.81;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 79170 79538 22.3 + 0 ID=MW460250_1_86;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312;transl_table=11;conf=99.41;score=22.33;cscore=11.50;sscore=10.83;rscore=6.74;uscore=1.79;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 79525 79836 60.6 + 0 ID=MW460250_1_87;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.272;transl_table=11;conf=100.00;score=60.62;cscore=47.45;sscore=13.17;rscore=12.88;uscore=-2.02;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 79900 80436 90.2 + 0 ID=MW460250_1_88;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.333;transl_table=11;conf=100.00;score=90.15;cscore=77.90;sscore=12.25;rscore=8.03;uscore=1.92;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 80429 81196 101.3 + 0 ID=MW460250_1_89;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.303;transl_table=11;conf=100.00;score=101.32;cscore=88.53;sscore=12.79;rscore=11.41;uscore=-0.92;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 81174 81620 52.1 + 0 ID=MW460250_1_90;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.295;transl_table=11;conf=100.00;score=52.10;cscore=47.41;sscore=4.69;rscore=0.03;uscore=2.35;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 81620 82483 150.3 + 0 ID=MW460250_1_91;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.302;transl_table=11;conf=100.00;score=150.35;cscore=139.40;sscore=10.95;rscore=6.74;uscore=1.90;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 82855 83586 166.5 + 0 ID=MW460250_1_92;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.276;transl_table=11;conf=100.00;score=166.55;cscore=152.13;sscore=14.42;rscore=11.41;uscore=0.71;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 83604 84062 92.8 + 0 ID=MW460250_1_93;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.296;transl_table=11;conf=100.00;score=92.84;cscore=81.21;sscore=11.63;rscore=8.03;uscore=1.29;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 84127 84570 72.6 + 0 ID=MW460250_1_94;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.277;transl_table=11;conf=100.00;score=72.61;cscore=55.10;sscore=17.50;rscore=11.41;uscore=3.79;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 84587 85291 122.5 + 0 ID=MW460250_1_95;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.306;transl_table=11;conf=100.00;score=122.47;cscore=112.88;sscore=9.59;rscore=8.03;uscore=-0.75;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 85353 85751 38.2 + 0 ID=MW460250_1_96;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.321;transl_table=11;conf=99.98;score=38.18;cscore=29.38;sscore=8.81;rscore=8.03;uscore=-1.53;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 85898 86140 34.2 + 0 ID=MW460250_1_97;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.333;transl_table=11;conf=99.96;score=34.16;cscore=17.58;sscore=16.58;rscore=10.95;uscore=3.42;tscore=2.21; +MW460250_1 pyrodigal_v3.0.0 CDS 86145 86309 15.9 + 0 ID=MW460250_1_98;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.279;transl_table=11;conf=97.51;score=15.95;cscore=12.27;sscore=3.68;rscore=7.39;uscore=-5.20;tscore=1.49; +MW460250_1 pyrodigal_v3.0.0 CDS 86511 86687 30.6 + 0 ID=MW460250_1_99;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.294;transl_table=11;conf=99.91;score=30.63;cscore=18.11;sscore=12.52;rscore=8.97;uscore=1.95;tscore=1.60; +MW460250_1 pyrodigal_v3.0.0 CDS 86677 87210 51.3 + 0 ID=MW460250_1_100;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.287;transl_table=11;conf=100.00;score=51.34;cscore=43.11;sscore=8.22;rscore=8.03;uscore=-2.11;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 87225 87473 21.7 + 0 ID=MW460250_1_101;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.273;transl_table=11;conf=99.32;score=21.69;cscore=7.38;sscore=14.32;rscore=9.12;uscore=2.93;tscore=2.27; +MW460250_1 pyrodigal_v3.0.0 CDS 87485 87661 24.0 + 0 ID=MW460250_1_102;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.288;transl_table=11;conf=99.60;score=24.02;cscore=14.04;sscore=9.98;rscore=7.94;uscore=0.43;tscore=1.60; +MW460250_1 pyrodigal_v3.0.0 CDS 87654 87950 65.6 + 0 ID=MW460250_1_103;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.303;transl_table=11;conf=100.00;score=65.58;cscore=50.07;sscore=15.51;rscore=11.41;uscore=1.80;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 87998 88180 9.4 + 0 ID=MW460250_1_104;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.317;transl_table=11;conf=89.68;score=9.41;cscore=-2.98;sscore=12.39;rscore=5.78;uscore=5.45;tscore=1.66; +MW460250_1 pyrodigal_v3.0.0 CDS 88193 88561 67.0 + 0 ID=MW460250_1_105;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.279;transl_table=11;conf=100.00;score=67.03;cscore=55.02;sscore=12.02;rscore=12.88;uscore=-3.17;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 88574 88921 66.5 + 0 ID=MW460250_1_106;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.313;transl_table=11;conf=100.00;score=66.46;cscore=50.37;sscore=16.09;rscore=12.88;uscore=0.90;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 88921 89199 9.4 + 0 ID=MW460250_1_107;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.308;transl_table=11;conf=89.71;score=9.42;cscore=10.75;sscore=-1.33;rscore=0.03;uscore=-3.67;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 89269 89574 51.7 + 0 ID=MW460250_1_108;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.252;transl_table=11;conf=100.00;score=51.66;cscore=38.88;sscore=12.78;rscore=11.41;uscore=-0.93;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 89589 89939 65.3 + 0 ID=MW460250_1_109;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.313;transl_table=11;conf=100.00;score=65.27;cscore=47.52;sscore=17.75;rscore=12.88;uscore=2.56;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 89939 90541 85.8 + 0 ID=MW460250_1_110;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300;transl_table=11;conf=100.00;score=85.80;cscore=69.82;sscore=15.97;rscore=12.88;uscore=0.78;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 90555 90734 34.5 + 0 ID=MW460250_1_111;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.261;transl_table=11;conf=99.96;score=34.47;cscore=24.26;sscore=10.21;rscore=6.56;uscore=2.02;tscore=1.63; +MW460250_1 pyrodigal_v3.0.0 CDS 90961 91362 60.5 + 0 ID=MW460250_1_112;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.286;transl_table=11;conf=100.00;score=60.51;cscore=48.84;sscore=11.67;rscore=11.41;uscore=-2.04;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 91364 91624 28.9 + 0 ID=MW460250_1_113;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.341;transl_table=11;conf=99.87;score=28.86;cscore=11.79;sscore=17.07;rscore=12.88;uscore=1.88;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 91676 91963 41.4 + 0 ID=MW460250_1_114;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.323;transl_table=11;conf=99.99;score=41.35;cscore=22.51;sscore=18.84;rscore=12.88;uscore=3.65;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 91974 92090 29.9 + 0 ID=MW460250_1_115;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.316;transl_table=11;conf=99.90;score=29.85;cscore=22.32;sscore=7.53;rscore=5.88;uscore=0.61;tscore=1.05; +MW460250_1 pyrodigal_v3.0.0 CDS 92080 92343 43.4 + 0 ID=MW460250_1_116;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.280;transl_table=11;conf=100.00;score=43.45;cscore=29.73;sscore=13.71;rscore=13.08;uscore=-1.67;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 92420 92599 11.1 + 0 ID=MW460250_1_117;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.283;transl_table=11;conf=92.83;score=11.14;cscore=-0.69;sscore=11.83;rscore=9.12;uscore=1.58;tscore=1.63; +MW460250_1 pyrodigal_v3.0.0 CDS 92614 92877 47.4 + 0 ID=MW460250_1_118;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.330;transl_table=11;conf=100.00;score=47.38;cscore=30.88;sscore=16.50;rscore=12.88;uscore=1.31;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 92880 93197 63.0 + 0 ID=MW460250_1_119;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.283;transl_table=11;conf=100.00;score=63.04;cscore=45.69;sscore=17.35;rscore=12.88;uscore=2.16;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 93198 93878 123.3 + 0 ID=MW460250_1_120;partial=00;start_type=GTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.267;transl_table=11;conf=100.00;score=123.33;cscore=114.19;sscore=9.15;rscore=12.88;uscore=2.07;tscore=-5.81; +MW460250_1 pyrodigal_v3.0.0 CDS 93967 94125 9.0 + 0 ID=MW460250_1_121;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.277;transl_table=11;conf=88.81;score=9.01;cscore=-2.14;sscore=11.15;rscore=8.04;uscore=2.17;tscore=1.44; +MW460250_1 pyrodigal_v3.0.0 CDS 94160 94360 20.3 + 0 ID=MW460250_1_122;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318;transl_table=11;conf=99.06;score=20.26;cscore=9.58;sscore=10.68;rscore=9.03;uscore=-0.18;tscore=1.83; +MW460250_1 pyrodigal_v3.0.0 CDS 94361 94651 39.7 + 0 ID=MW460250_1_123;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.278;transl_table=11;conf=99.99;score=39.74;cscore=24.11;sscore=15.63;rscore=12.20;uscore=1.13;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 94743 95051 50.2 + 0 ID=MW460250_1_124;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272;transl_table=11;conf=100.00;score=50.22;cscore=33.99;sscore=16.23;rscore=11.41;uscore=2.52;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 95048 95956 171.9 + 0 ID=MW460250_1_125;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.285;transl_table=11;conf=100.00;score=171.88;cscore=153.21;sscore=18.67;rscore=10.47;uscore=5.90;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 95974 97443 317.5 + 0 ID=MW460250_1_126;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.323;transl_table=11;conf=99.99;score=317.51;cscore=298.85;sscore=18.66;rscore=10.47;uscore=5.89;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 97522 97767 21.7 + 0 ID=MW460250_1_127;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.305;transl_table=11;conf=99.32;score=21.68;cscore=8.27;sscore=13.41;rscore=11.09;uscore=0.08;tscore=2.24; +MW460250_1 pyrodigal_v3.0.0 CDS 97787 98179 84.7 + 0 ID=MW460250_1_128;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272;transl_table=11;conf=100.00;score=84.72;cscore=68.68;sscore=16.04;rscore=11.41;uscore=2.33;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 98181 98402 47.2 + 0 ID=MW460250_1_129;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.342;transl_table=11;conf=100.00;score=47.15;cscore=35.50;sscore=11.66;rscore=9.99;uscore=-0.35;tscore=2.02; +MW460250_1 pyrodigal_v3.0.0 CDS 98468 98779 65.8 + 0 ID=MW460250_1_130;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.317;transl_table=11;conf=100.00;score=65.84;cscore=50.43;sscore=15.41;rscore=11.41;uscore=1.70;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 98782 99291 88.4 + 0 ID=MW460250_1_131;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.269;transl_table=11;conf=100.00;score=88.43;cscore=72.03;sscore=16.40;rscore=11.41;uscore=2.69;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 99293 99622 49.7 + 0 ID=MW460250_1_132;partial=00;start_type=ATG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.294;transl_table=11;conf=100.00;score=49.74;cscore=45.85;sscore=3.88;rscore=-3.38;uscore=4.96;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 99628 99822 24.0 + 0 ID=MW460250_1_133;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.226;transl_table=11;conf=99.60;score=23.96;cscore=20.33;sscore=3.63;rscore=6.17;uscore=-4.31;tscore=1.77; +MW460250_1 pyrodigal_v3.0.0 CDS 99846 100160 71.2 + 0 ID=MW460250_1_134;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.273;transl_table=11;conf=100.00;score=71.20;cscore=53.20;sscore=18.00;rscore=13.08;uscore=2.61;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 100175 100342 34.6 + 0 ID=MW460250_1_135;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.238;transl_table=11;conf=99.96;score=34.57;cscore=22.18;sscore=12.40;rscore=8.05;uscore=2.82;tscore=1.52; +MW460250_1 pyrodigal_v3.0.0 CDS 100379 100480 0.1 + 0 ID=MW460250_1_136;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.284;transl_table=11;conf=50.36;score=0.06;cscore=-6.94;sscore=7.01;rscore=5.10;uscore=1.49;tscore=0.91; +MW460250_1 pyrodigal_v3.0.0 CDS 101353 101652 38.3 + 0 ID=MW460250_1_137;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.290;transl_table=11;conf=99.98;score=38.26;cscore=26.66;sscore=11.60;rscore=11.41;uscore=-2.11;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 101668 101853 12.6 + 0 ID=MW460250_1_138;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.247;transl_table=11;conf=94.75;score=12.58;cscore=0.54;sscore=12.04;rscore=9.43;uscore=0.92;tscore=1.69; +MW460250_1 pyrodigal_v3.0.0 CDS 101960 102250 51.4 + 0 ID=MW460250_1_139;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.309;transl_table=11;conf=100.00;score=51.36;cscore=39.98;sscore=11.38;rscore=11.41;uscore=-2.33;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 102250 102537 32.8 + 0 ID=MW460250_1_140;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.281;transl_table=11;conf=99.95;score=32.77;cscore=19.87;sscore=12.91;rscore=11.41;uscore=-0.81;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 102537 102830 40.7 + 0 ID=MW460250_1_141;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.327;transl_table=11;conf=99.99;score=40.74;cscore=31.43;sscore=9.31;rscore=11.41;uscore=-4.40;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 102834 103091 39.9 + 0 ID=MW460250_1_142;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283;transl_table=11;conf=99.99;score=39.87;cscore=26.47;sscore=13.40;rscore=11.41;uscore=-0.31;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 103169 103408 32.0 + 0 ID=MW460250_1_143;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.312;transl_table=11;conf=99.94;score=31.97;cscore=23.35;sscore=8.62;rscore=8.78;uscore=-2.35;tscore=2.19; +MW460250_1 pyrodigal_v3.0.0 CDS 103419 103766 62.8 + 0 ID=MW460250_1_144;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.305;transl_table=11;conf=100.00;score=62.77;cscore=53.09;sscore=9.68;rscore=12.88;uscore=-5.51;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 103975 104313 81.2 - 0 ID=MW460250_1_145;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.198;transl_table=11;conf=100.00;score=81.24;cscore=64.20;sscore=17.04;rscore=6.74;uscore=8.00;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 104624 104932 37.7 + 0 ID=MW460250_1_146;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.366;transl_table=11;conf=99.98;score=37.70;cscore=21.60;sscore=16.11;rscore=8.03;uscore=5.77;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 105138 105422 49.4 + 0 ID=MW460250_1_147;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.337;transl_table=11;conf=100.00;score=49.41;cscore=34.35;sscore=15.06;rscore=6.74;uscore=6.02;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 105497 105688 33.7 + 0 ID=MW460250_1_148;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.177;transl_table=11;conf=99.96;score=33.73;cscore=21.06;sscore=12.67;rscore=5.09;uscore=5.83;tscore=1.74; +MW460250_1 pyrodigal_v3.0.0 CDS 106005 106493 58.5 - 0 ID=MW460250_1_149;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.276;transl_table=11;conf=100.00;score=58.51;cscore=40.67;sscore=17.84;rscore=12.88;uscore=2.65;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 106661 106819 1.2 + 0 ID=MW460250_1_150;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.245;transl_table=11;conf=57.09;score=1.24;cscore=-1.90;sscore=3.14;rscore=0.02;uscore=2.18;tscore=1.44; +MW460250_1 pyrodigal_v3.0.0 CDS 106889 107020 12.5 + 0 ID=MW460250_1_151;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.242;transl_table=11;conf=94.61;score=12.47;cscore=5.17;sscore=7.30;rscore=3.48;uscore=2.63;tscore=1.19; +MW460250_1 pyrodigal_v3.0.0 CDS 107188 107424 28.3 + 0 ID=MW460250_1_152;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.350;transl_table=11;conf=99.85;score=28.33;cscore=14.97;sscore=13.37;rscore=6.31;uscore=4.90;tscore=2.16; +MW460250_1 pyrodigal_v3.0.0 CDS 107504 107974 104.5 + 0 ID=MW460250_1_153;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.380;transl_table=11;conf=100.00;score=104.52;cscore=86.99;sscore=17.53;rscore=6.74;uscore=8.49;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 108004 108129 7.6 + 0 ID=MW460250_1_154;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.310;transl_table=11;conf=85.29;score=7.65;cscore=7.15;sscore=0.50;rscore=0.01;uscore=-0.65;tscore=1.13; +MW460250_1 pyrodigal_v3.0.0 CDS 108214 108393 35.9 + 0 ID=MW460250_1_155;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.256;transl_table=11;conf=99.97;score=35.92;cscore=26.08;sscore=9.84;rscore=4.77;uscore=3.44;tscore=1.63; +MW460250_1 pyrodigal_v3.0.0 CDS 108727 108963 53.3 - 0 ID=MW460250_1_156;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.287;transl_table=11;conf=100.00;score=53.35;cscore=35.83;sscore=17.52;rscore=10.68;uscore=4.68;tscore=2.16; +MW460250_1 pyrodigal_v3.0.0 CDS 108965 109450 98.0 - 0 ID=MW460250_1_157;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300;transl_table=11;conf=100.00;score=98.01;cscore=83.86;sscore=14.15;rscore=12.88;uscore=-1.04;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 109463 109870 66.7 - 0 ID=MW460250_1_158;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.282;transl_table=11;conf=100.00;score=66.69;cscore=53.00;sscore=13.69;rscore=12.88;uscore=-1.51;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 109870 110301 83.6 - 0 ID=MW460250_1_159;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.315;transl_table=11;conf=100.00;score=83.57;cscore=64.73;sscore=18.85;rscore=13.08;uscore=3.46;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 110304 110495 -0.1 - 0 ID=MW460250_1_160;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.328;transl_table=11;conf=50.00;score=-0.10;cscore=-12.27;sscore=12.17;rscore=9.74;uscore=1.19;tscore=1.74; +MW460250_1 pyrodigal_v3.0.0 CDS 110492 110977 41.9 - 0 ID=MW460250_1_161;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.288;transl_table=11;conf=99.99;score=41.90;cscore=29.36;sscore=12.53;rscore=12.88;uscore=-2.66;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 110970 111401 84.3 - 0 ID=MW460250_1_162;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.282;transl_table=11;conf=100.00;score=84.29;cscore=65.63;sscore=18.66;rscore=11.41;uscore=4.95;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 111415 111957 87.0 - 0 ID=MW460250_1_163;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.243;transl_table=11;conf=100.00;score=86.99;cscore=73.92;sscore=13.07;rscore=9.27;uscore=1.50;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 111969 112457 75.7 - 0 ID=MW460250_1_164;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.315;transl_table=11;conf=100.00;score=75.68;cscore=64.76;sscore=10.91;rscore=11.41;uscore=-2.80;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 112470 112868 61.2 - 0 ID=MW460250_1_165;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.251;transl_table=11;conf=100.00;score=61.17;cscore=44.13;sscore=17.03;rscore=12.20;uscore=2.53;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 112865 113572 119.3 - 0 ID=MW460250_1_166;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.297;transl_table=11;conf=100.00;score=119.30;cscore=104.73;sscore=14.57;rscore=12.88;uscore=-0.62;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 113672 114226 119.9 - 0 ID=MW460250_1_167;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283;transl_table=11;conf=100.00;score=119.93;cscore=102.07;sscore=17.85;rscore=11.41;uscore=4.14;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 114242 114559 42.7 - 0 ID=MW460250_1_168;partial=00;start_type=GTG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283;transl_table=11;conf=99.99;score=42.70;cscore=33.15;sscore=9.54;rscore=11.41;uscore=3.95;tscore=-5.81; +MW460250_1 pyrodigal_v3.0.0 CDS 115545 116093 94.2 - 0 ID=MW460250_1_169;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.237;transl_table=11;conf=100.00;score=94.24;cscore=82.84;sscore=11.40;rscore=8.03;uscore=1.06;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 116097 116315 36.2 - 0 ID=MW460250_1_170;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.260;transl_table=11;conf=99.98;score=36.20;cscore=25.55;sscore=10.65;rscore=9.05;uscore=-0.39;tscore=1.99; +MW460250_1 pyrodigal_v3.0.0 CDS 116316 116510 27.6 - 0 ID=MW460250_1_171;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.272;transl_table=11;conf=99.83;score=27.64;cscore=21.27;sscore=6.38;rscore=9.37;uscore=-4.76;tscore=1.77; +MW460250_1 pyrodigal_v3.0.0 CDS 116500 117237 143.0 - 0 ID=MW460250_1_172;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.255;transl_table=11;conf=100.00;score=143.01;cscore=122.38;sscore=20.62;rscore=12.88;uscore=5.43;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 117300 117404 5.6 - 0 ID=MW460250_1_173;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.238;transl_table=11;conf=78.51;score=5.64;cscore=5.15;sscore=0.48;rscore=5.26;uscore=-5.71;tscore=0.94; +MW460250_1 pyrodigal_v3.0.0 CDS 117416 117655 41.7 - 0 ID=MW460250_1_174;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.279;transl_table=11;conf=99.99;score=41.69;cscore=32.18;sscore=9.51;rscore=6.39;uscore=0.94;tscore=2.19; +MW460250_1 pyrodigal_v3.0.0 CDS 117657 118046 84.1 - 0 ID=MW460250_1_175;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318;transl_table=11;conf=100.00;score=84.08;cscore=63.54;sscore=20.54;rscore=11.41;uscore=6.83;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 118145 118318 25.9 - 0 ID=MW460250_1_176;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.328;transl_table=11;conf=99.74;score=25.92;cscore=13.90;sscore=12.02;rscore=8.81;uscore=1.63;tscore=1.58; +MW460250_1 pyrodigal_v3.0.0 CDS 118359 118841 94.3 - 0 ID=MW460250_1_177;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.290;transl_table=11;conf=100.00;score=94.31;cscore=77.87;sscore=16.44;rscore=12.88;uscore=1.25;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 118891 119433 140.8 - 0 ID=MW460250_1_178;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.276;transl_table=11;conf=100.00;score=140.82;cscore=125.31;sscore=15.51;rscore=11.41;uscore=1.80;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 119433 119966 107.5 - 0 ID=MW460250_1_179;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.270;transl_table=11;conf=100.00;score=107.51;cscore=93.18;sscore=14.33;rscore=10.47;uscore=1.56;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 119969 120133 12.6 - 0 ID=MW460250_1_180;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.267;transl_table=11;conf=94.82;score=12.64;cscore=0.55;sscore=12.09;rscore=6.78;uscore=3.81;tscore=1.49; +MW460250_1 pyrodigal_v3.0.0 CDS 120136 120411 51.2 - 0 ID=MW460250_1_181;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.264;transl_table=11;conf=100.00;score=51.22;cscore=37.42;sscore=13.80;rscore=12.20;uscore=-0.71;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 120411 121256 150.7 - 0 ID=MW460250_1_182;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.307;transl_table=11;conf=100.00;score=150.72;cscore=132.94;sscore=17.78;rscore=12.88;uscore=2.59;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 121268 122386 235.3 - 0 ID=MW460250_1_183;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.323;transl_table=11;conf=99.99;score=235.28;cscore=216.70;sscore=18.58;rscore=11.41;uscore=4.87;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 122540 122866 43.3 - 0 ID=MW460250_1_184;partial=00;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.254;transl_table=11;conf=100.00;score=43.26;cscore=39.10;sscore=4.16;rscore=8.03;uscore=1.94;tscore=-5.81; +MW460250_1 pyrodigal_v3.0.0 CDS 122859 123275 88.7 - 0 ID=MW460250_1_185;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.273;transl_table=11;conf=100.00;score=88.66;cscore=74.94;sscore=13.72;rscore=12.88;uscore=-1.47;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 123409 123711 74.6 - 0 ID=MW460250_1_186;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.300;transl_table=11;conf=100.00;score=74.60;cscore=56.08;sscore=18.52;rscore=11.41;uscore=4.81;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 123711 123899 40.2 - 0 ID=MW460250_1_187;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.254;transl_table=11;conf=99.99;score=40.23;cscore=31.56;sscore=8.67;rscore=5.01;uscore=1.94;tscore=1.72; +MW460250_1 pyrodigal_v3.0.0 CDS 123943 124104 32.3 - 0 ID=MW460250_1_188;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.290;transl_table=11;conf=99.94;score=32.26;cscore=20.19;sscore=12.08;rscore=8.19;uscore=2.41;tscore=1.47; +MW460250_1 pyrodigal_v3.0.0 CDS 124104 126152 393.2 - 0 ID=MW460250_1_189;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.257;transl_table=11;conf=99.99;score=393.20;cscore=377.10;sscore=16.10;rscore=11.41;uscore=2.39;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 126230 126493 52.5 - 0 ID=MW460250_1_190;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.258;transl_table=11;conf=100.00;score=52.51;cscore=40.59;sscore=11.93;rscore=12.88;uscore=-3.26;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 126510 126683 4.4 - 0 ID=MW460250_1_191;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.270;transl_table=11;conf=73.48;score=4.43;cscore=15.00;sscore=-10.57;rscore=5.49;uscore=-0.51;tscore=-15.55; +MW460250_1 pyrodigal_v3.0.0 CDS 126690 127268 46.5 - 0 ID=MW460250_1_192;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.273;transl_table=11;conf=100.00;score=46.46;cscore=42.10;sscore=4.36;rscore=0.03;uscore=2.03;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 127261 127887 134.2 - 0 ID=MW460250_1_193;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.308;transl_table=11;conf=100.00;score=134.25;cscore=118.13;sscore=16.12;rscore=11.41;uscore=2.41;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 127880 128776 185.5 - 0 ID=MW460250_1_194;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.265;transl_table=11;conf=99.99;score=185.52;cscore=171.25;sscore=14.26;rscore=12.88;uscore=-0.93;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 128776 129000 19.4 - 0 ID=MW460250_1_195;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.244;transl_table=11;conf=98.85;score=19.36;cscore=4.28;sscore=15.08;rscore=11.44;uscore=1.59;tscore=2.05; +MW460250_1 pyrodigal_v3.0.0 CDS 129069 129809 118.7 - 0 ID=MW460250_1_196;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.306;transl_table=11;conf=100.00;score=118.68;cscore=111.56;sscore=7.12;rscore=0.03;uscore=4.79;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 129861 130475 142.3 - 0 ID=MW460250_1_197;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.289;transl_table=11;conf=100.00;score=142.31;cscore=124.09;sscore=18.21;rscore=11.41;uscore=4.50;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 130491 130916 57.8 - 0 ID=MW460250_1_198;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.298;transl_table=11;conf=100.00;score=57.75;cscore=46.46;sscore=11.30;rscore=12.20;uscore=-3.21;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 130906 131097 28.8 - 0 ID=MW460250_1_199;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.281;transl_table=11;conf=99.87;score=28.77;cscore=14.16;sscore=14.61;rscore=9.22;uscore=3.64;tscore=1.74; +MW460250_1 pyrodigal_v3.0.0 CDS 131120 131761 128.8 - 0 ID=MW460250_1_200;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.255;transl_table=11;conf=100.00;score=128.83;cscore=114.30;sscore=14.53;rscore=12.20;uscore=0.03;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 131751 131981 40.7 - 0 ID=MW460250_1_201;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.329;transl_table=11;conf=99.99;score=40.69;cscore=25.07;sscore=15.63;rscore=10.40;uscore=3.12;tscore=2.10; +MW460250_1 pyrodigal_v3.0.0 CDS 131984 132211 25.1 - 0 ID=MW460250_1_202;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.263;transl_table=11;conf=99.69;score=25.12;cscore=9.76;sscore=15.36;rscore=11.60;uscore=1.69;tscore=2.07; +MW460250_1 pyrodigal_v3.0.0 CDS 132321 133013 179.1 - 0 ID=MW460250_1_203;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.348;transl_table=11;conf=99.99;score=179.15;cscore=160.07;sscore=19.08;rscore=11.41;uscore=5.37;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 133200 133835 97.8 - 0 ID=MW460250_1_204;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.280;transl_table=11;conf=100.00;score=97.78;cscore=82.49;sscore=15.29;rscore=12.88;uscore=0.10;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 133902 134693 180.8 - 0 ID=MW460250_1_205;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.298;transl_table=11;conf=99.99;score=180.83;cscore=161.76;sscore=19.07;rscore=11.41;uscore=5.36;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 134693 135001 48.1 - 0 ID=MW460250_1_206;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.275;transl_table=11;conf=100.00;score=48.11;cscore=32.41;sscore=15.70;rscore=12.88;uscore=0.51;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 135114 135743 88.1 - 0 ID=MW460250_1_207;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.379;transl_table=11;conf=100.00;score=88.12;cscore=100.56;sscore=-12.44;rscore=-11.85;uscore=-2.90;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 136014 136514 84.2 - 0 ID=MW460250_1_208;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283;transl_table=11;conf=100.00;score=84.25;cscore=69.67;sscore=14.58;rscore=11.41;uscore=0.87;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 136674 137477 181.8 - 0 ID=MW460250_1_209;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.353;transl_table=11;conf=99.99;score=181.83;cscore=166.50;sscore=15.33;rscore=12.88;uscore=0.14;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 137477 137980 121.0 - 0 ID=MW460250_1_210;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.349;transl_table=11;conf=100.00;score=121.00;cscore=115.24;sscore=5.76;rscore=-2.28;uscore=5.73;tscore=2.31; +MW460250_1 pyrodigal_v3.0.0 CDS 138065 138250 39.3 - 0 ID=MW460250_1_211;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.258;transl_table=11;conf=99.99;score=39.30;cscore=27.36;sscore=11.94;rscore=8.35;uscore=1.90;tscore=1.69; +MW460250_1 pyrodigal_v3.0.0 CDS 139797 140015 29.1 - 0 ID=MW460250_1_212;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.274;transl_table=11;conf=99.88;score=29.10;cscore=19.12;sscore=9.98;rscore=3.38;uscore=4.61;tscore=1.99; diff --git a/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out_aas_tmp.fasta b/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out_aas_tmp.fasta new file mode 100644 index 0000000..6cae925 --- /dev/null +++ b/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out_aas_tmp.fasta @@ -0,0 +1,1006 @@ +>MW460250_1_1 # 183 # 392 # -1 # ID=1_1;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.267 +MSKHIEITMSSGAKYFLVSTDEKSYNRQDIDYMLRGMDETSIKVYTESAITSPQVYINPN +RIESFKIVF* +>MW460250_1_2 # 405 # 737 # -1 # ID=1_2;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.306 +MDKEINNLVSQVETIKSKIQEGNYIDRGTFKDLEVEVAELRKMIVSIDKDVAVNSEKQSA +IYVQLERLDEKISELAESTKTKDTEKKDTTEKVLLLVLGAILSFVFNKFA* +>MW460250_1_3 # 750 # 1076 # -1 # ID=1_3;partial=00;start_type=TTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.306 +MIKYKDILKLEFKDALAHFKRDRRYFHVYRIDRVLINGSIIYFDYYYLPSDDPNIVIKEL +DLQSFGKLRFEIDTKTSYGKVVTDNYMEIINDFLENYDIHSESETVRP* +>MW460250_1_4 # 1636 # 1902 # 1 # ID=1_4;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.292 +MTYFIFGGIVSNVALTVTDKFLLKKEDPLPEYVLKKVEINDKEIRIIKKIIESNYGITAE +EIKVRAKAQRRVEEDSKKEDYNENKERN* +>MW460250_1_5 # 1880 # 2158 # 1 # ID=1_5;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.294 +MKTKKEIKEQRKELKDGATSVSLVKKGDKRIASPSRICSLCGQQLSGMNYTKGKALSKVN +HFHLQYSKYIYFDICADINNCYKNLRKRGEMD* +>MW460250_1_6 # 2155 # 2565 # 1 # ID=1_6;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.299 +MSAENIRDIINKKKLEEEDTRKYIADGFMNGIGKLMYEFNKKVDNKEIEVKDPNDLYKLF +VIFSQMQNMVNETSEGGAIPQLSRPQQELFDEITTEDSNGESTVDLQKISEMSAEDITAM +ISEKEKVMNEENSETF* +>MW460250_1_7 # 2580 # 2777 # 1 # ID=1_7;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.323 +MDGKELIKIAQETFQTEKITREQIDHIINMLNPSTYMLKYHTLRGHPITFSIPNRDRSKA +QAHRP* +>MW460250_1_8 # 3071 # 4042 # 1 # ID=1_8;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.256 +MTLEKRKQEYLKKLKQIKNDEFELLGGFTKTREKALFKHKVCGYEWYTTPYNLLKSKGTG +CPKCQYRDKSYTTDEFKKKLKDKFGYEYELIEGQEYKNSREKLLFIHNKCGTEFKITSDS +LFRSKVPCHKCSKENRKTKKKTTEQFKNELYNKHKDEYILVEGSEYKTALEKVRIIHTKC +GYTWDVRASHILHTSKCPNCNESKGESLIKDILEDNNFSYIREYTFEDLKNVKKLPFDFA +LFIDNELVGLIEYDGSQHFIPFEHFGGKEKLRKTQYNDRKKNEYCDKNRIPLKRIKYDLD +EKEVIREIEMFLNSIVKSKAESY* +>MW460250_1_9 # 4183 # 5730 # 1 # ID=1_9;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.326 +MGVMEMVHFADMHSYANAKCLYTFPTNEQMKKFVQSRLNPVLEKEYFRDIVDWDKDSLGF +KKIRNSSLFFRTSSKASTVEGVDIDYLSLDEYDRVNLLAESSALESMSSSPFKIVRRWST +PSVPGMGIHKLYQQSDQWYYGHRCQHCDYLNEMSYNDYNPDNLEESGNMLCVNPEGVDEQ +AKTVQNGSYQFVCQKCGKPLDRWYNGEWHCKYPERTKGNKGVRGYLITQMNAVWISADEL +KEKEMNTESKQAFYNYILGYPFEDVKLRVNEEDVYGNKSPIAETQLMKRDRYSHIAIGID +WGNTHWITVHGMLPNGKVDLIRLFSVKKMTRPDLVEADLEKIIWEISKYDPDIIIADNGD +SGNNVLKLINHFGKDKVFGCTYKSSPKSTGQLRPEFNENNNRVTVDKLMQNKRYVQALKT +KDISVYSTVDDDLKTFLKHWQNVVIMDEEDEKTGEMYQVIKRKGDDHYAQASVYAYIGLT +RIKELLKEGNGTSFGSTFVSTDYNQEGNKQFYFDE* +>MW460250_1_10 # 5723 # 6544 # 1 # ID=1_10;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.279 +MNRGEIDLTDKLFYGTISNEEINKSVLNLLLGEELSLDYVSKNSDILDVKYEHVYKSLGF +DNFFDCFLYANREPEIVHKGGDKNLGGLNKVKRTVIRNGKEMEMTVYEDGNKENDSKEKQ +EGKEEVSRSAVGARAISNGEEGKVNPKKVANSLSNLSKKGVDVSHINTNSSLYKEFVDDN +GDTIGITSFKRTENDIILESYASSPDSDGVGARAIMELLRLSIKENKNAVVYDIELPEAI +EYLKTLGFKPNKDGYILRKKDVKQFLGDYSDFI* +>MW460250_1_11 # 6531 # 6704 # 1 # ID=1_11;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.213 +MILFSTIVIYSIVFILYIVLKTIYIKSNMSRIDNTTELLKILQEDIEGKIKKEGRNK* +>MW460250_1_12 # 6701 # 7180 # 1 # ID=1_12;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.292 +MTLEENKLTLEESITPLSKEEKEDSIKEFSSLLCEMVNRLYKSYNVFRQDPMDETQRLDG +SLMVFQSRLNDPLTGDLHDKMYKLAFSKRIDIFEANKQFRKDVEAGKAIELGDVAIIDTA +LSNILSGNEFQGSISFMLRKDFEEKERIRKEEEEKLNNL* +>MW460250_1_13 # 7222 # 8415 # 1 # ID=1_13;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.353 +MKKKPQGNEVIITIITVMIAVFVVIMTIFFNKYQDAKEDKDRYQRLVEIYKKADDNDGET +KKKYVKRLNKAEEELKKVKKETNYKDYNKKSSKERQKEDKETREKIYDVTGDDDLILVKN +NIEFSDKVDKPEILISEDGIGTITVPVDSGYEKQTVGSIITSVLGSPFLSPGSNSIDGLS +VINDNVYPNTVDSIVEDTKPSINLPTDNPIITNPVEPTIPSDIIPPIDNPSVPISPENPG +DNNQGNTDNPNPPPPGYTDEDGGRGSGGGGNSEPPSTEEPSDNGNTGGGDWEEKPDPGEE +PSDNGNTGGNGGEVTPEPEPEPEPEPEPEPEPEPSEPSDNPDENGGWETEPTEPESPSEP +DDKVDEEDKNEDTTDDKQSTEQPDDNNIDNEDKTEEE* +>MW460250_1_14 # 8492 # 8842 # 1 # ID=1_14;partial=00;start_type=TTG;rbs_motif=GGxGG;rbs_spacer=5-10bp;gc_cont=0.234 +MLGMNIITSLSVVFTCLSLLTLMIFVHSKFSSKNVFVLYVIYAIIGIGTYIVLTMFQTTS +VLIKNDVIDSIENTEHYIVFNDPIIIFIISFIGAILGGIWYKMMKIIKKSNFKDKK* +>MW460250_1_15 # 8851 # 9231 # 1 # ID=1_15;partial=00;start_type=GTG;rbs_motif=None;rbs_spacer=None;gc_cont=0.281 +MNRLIFSKDKKWDEAKDFIKGQGMQDNWIEIVDYYRQIGGKHVAVFIALNKVKYMILEAT +KDNKVILVDKDNNILLEDYDIVMESKKMFYYIEEPFEVKINIPQHIRDVTYNNTVVLTTV +RGSRGD* +>MW460250_1_16 # 9235 # 10926 # 1 # ID=1_16;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.315 +MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEP +FIEMMDTNPEFRDKRSYMKNEHNLHDILKKFGNNPILNAIILTRSNQVAMYCQPARYSEK +GLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQ +VNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSREL +AMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQ +SQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISA +LYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIIS +EYGDKYTFQFVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQ +GTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDN +VYRTQTSNKGQGRKGEKSSDFKH* +>MW460250_1_17 # 11120 # 11893 # 1 # ID=1_17;partial=00;start_type=TTG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.323 +MEEIKFNAFVPMDLKKSVSTASDTNEYSIVSGWASTPSMDLQNDIVNPKGIDIEYFKSQG +YINYEHQSDKVVGIPTENCYVDIEKGLFIEAKLWKNDENVVKMLDLAEKLEKSGSGRRLG +FSIEGAVKKRNINDNRVIDEVMITGVALVKNPANPEATWESFMKSFLTGHGTSPDTQVDA +GALRKEEIASSITNLAYVTKIKDLKEFNDVWNGVVEDLSKSNSMGYEESVLTLQLAKGLS +RKDAELAVMDINKQKLE* +>MW460250_1_18 # 11912 # 12868 # 1 # ID=1_18;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.322 +MSKEMQNILEEYDKLNAQEAVSKSVEDDEKNTVESTEEQVAETTEEPAKEPEKVSEEDAK +EAQEQGEKVESEEVAEGNEDEEVEKSAKESKDPVDQKDTKTENKDNEKRKNKKDKKEDSD +SDDEDKDTDDDKDKKEDKKEKTSKSISDEDITTVFKSILTSFENLNKEKENFATKEDLSE +VSKSINELSAKISEIQAEDVSKSVDTDEEAVEKSVTSTNGEQEKVEGYVSKSVDTEEQAE +TGEAKSEEAEEVQEDNTFKGLSQEERTKFMDSYKAQAKDPRASKHDLQSAYQSYLNINTD +PTNASEKDIKTVKDFAQI* +>MW460250_1_19 # 12984 # 14375 # 1 # ID=1_19;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.372 +MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNE +DLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSD +TKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAK +LIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQ +DNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQ +KGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFV +SIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVVHLFELL +PMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV* +>MW460250_1_20 # 14467 # 14763 # 1 # ID=1_20;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.310 +MLYYKKLLDKKMATVYGTVEIDKDGVVKGLTKEQEKEFANVPGFEFEEEKKTTRKQSAST +SKEEEPKEEEKKASTRKTTNTTRKSTARKTTAKKDENK* +>MW460250_1_21 # 14776 # 15684 # 1 # ID=1_21;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.355 +MVNSMFGGDLDPYEKSLNYEYPYHPSGNPKHIDVSEIDNLTLADYGWSPDAVKAYMFGIV +VQNPDTGQPMGDEFYNHILERAVGKAERALDISILPDTQHEMRDYHETEFNSYMFVHAYR +KPILQVENLQLQFNGRPIYKYPANWWKVEHLAGHVQLFPTALMQTGQSMSYDAVFNGYPQ +LAGVYPPSGATFAPQMIRLEYVSGMLPRKKAGRNKPWEMPPELEQLVIKYALKEIYQVWG +NLIIGAGIANKTLEVDGITETIGTTQSAMYGGASAQILQINEDIKELLDGLRAYFGYNMI +GL* +>MW460250_1_22 # 15698 # 16576 # 1 # ID=1_22;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.319 +MEKPYMIGANSNPNVINKSTTYTTTTQADEQDKPKYTTRLEFDTIDMIRFINDRGIKVLW +EEAYFCPCLNPDTGHPRVDCPRCHGKGIAYLPPKETIMAIQSQEKGTNQLDIGILDTGTA +IGTTQLEKRISYRDRFTVPEVLMPQQMIYFVNKDRIKKGIPLYYDVKEITYIATQDGTVY +EEDYEIKNNRLYLNEKYENHTVTLKILMTLRYVVSDILKESRYQYTKFNQPKSKFENLPQ +KLLLKREDVIVLQDPYKVNDGIEEDLEIQVDDPKASASNPSNLGGFFGGAFK* +>MW460250_1_23 # 16576 # 17196 # 1 # ID=1_23;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.320 +MPVHGKRPNLFKNKNYKQVGKRTIDGMRSEVLDKLQATAQQVENTSIKRMPTYLQITEKK +LEKEGVVDLKKAFAHSSKKKTSKDGGWYLTVPIRIKTSRMNNSTYQDMRTLKVDKGTGSV +SKITDYLEGRRKNVSHPSMKPEPMTHNMTKVKRGKQSSYFIFRTVSSKSPASSWILNRDK +VNEDNFSKTTLKTVKQLMNWKMKNLN* +>MW460250_1_24 # 17215 # 18051 # 1 # ID=1_24;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.302 +MAITSVDSYLLSEIKPRLNTVLENCYIIDEVLKDFDYQTRESFKEAFCGKNAQHEVTVGF +NFPKFKNNYEAHYLIQLGQGQETKNSLGSIQSSYFEATGDTLVESSTAIREDDKLVFTVS +KPIGELIKVEDIEFAKYDNLQVEGNKVSFKYQTNEDYENYNANIIFTEKKNDSKGLVKGF +TVEEQVTVVGLSFNVDVARCLDAVLKMILISMRDSIEEQQTFQLQNLSFGDIAPIIEDGD +SMIFGRPTIIKYTSSLDLDYTITQDINKLTFKERKDWK* +>MW460250_1_25 # 18053 # 18268 # 1 # ID=1_25;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.306 +MARKKTPENNTPKFNGYVHIDTFLDTAKTLFNMRDSQVAGFKAYMEGSHYLFSEQEFLPS +LEKYLGRKLDI* +>MW460250_1_26 # 18295 # 20058 # 1 # ID=1_26;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.362 +MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAK +RLFRSGELLDAIELAWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQ +VGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYKGEEANATFSVEHDEETQKASR +LVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIENAN +IKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTI +EPFELTKLKGGTNGEPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGE +PMRAIVGGGFNESKEQLFGRQASLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGL +ASGLEIGESITFKPLRVSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTF +NDKSDPVKAEMAVGEANDFLVSELKVQLEEQFIGTRTINTSASIIKDFIQSYLGRKKRDN +EIQDFPAEDVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA* +>MW460250_1_27 # 20131 # 20559 # 1 # ID=1_27;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.354 +MASEAKQTVHTGNTVLLMIKGKPVGRAQSASGQREYGTTGVYEIGSIMPQEHVYLRYEGT +ITVERLRMKKENFADLGYASLGEEILKKDIIDILVVDNLTKQVIISYHGCSANNYNETWQ +TNEIVTEEIEFSYLTASDKART* +>MW460250_1_28 # 20656 # 20796 # 1 # ID=1_28;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.241 +MANKRKTIGKMSNTRATWNINPVTKVKKDKTKYSRKNKHKGLDNYN* +>MW460250_1_29 # 20839 # 21297 # 1 # ID=1_29;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272 +MSTFWSERRTTNKDRQVKKHYTQMSMYERKKCVELLQETITENRIINFTRHSAKKVKGKP +TTNIPKLIGFIFKNKFAYENIIEYNNTDYNGNIERRIVVKHPKVITVEGKPSYQFLTISL +EDARVITVWYNSVDDTHRTLDLNYYSKDLTIQ* +>MW460250_1_30 # 21310 # 21504 # 1 # ID=1_30;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.246 +MGITIVNSYFILSSIFLIILTILNGKGTVTRESLTMSKILVVITSIQFLACLIINGIYWS +LKFM* +>MW460250_1_31 # 21586 # 21897 # 1 # ID=1_31;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.260 +MSQDKLRAIYTEMKVELHKFPKEVDITSKSTAIAINQILDKFKTLTEQAGKITRKYLEGQ +EILTIDYEYYDSLQEYYIYLLRNSEKIEQSLQEITKRTGEYVK* +>MW460250_1_32 # 22029 # 22487 # 1 # ID=1_32;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.305 +MAEEIKKEQDVQETTKEEKKDVSKMTPEEIDKLKYQDKQEKEQVINKVIKGVNDTWEKEY +NFEELDLRFKVKIKLPNAREQGNIFALRSAYLGGMDMYQTDQVIRAYQMLATLQEVGIEV +PKEFQDPDDIYNLYPLTVMYEDWLGFLNSFRY* +>MW460250_1_33 # 22531 # 23067 # 1 # ID=1_33;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.337 +MESIVKQPLSRNLWAIMKEFNVLPTEQRFKDLDDYQIEFIIGNMNRDVYEHNKQLKQAQK +GGKFDSQFEDDDSSWWNESHEDFDPVPDFLDADDLAQQMEAKLSDRDKEERAKRNDAELN +DETEGLTTQHLAMMEYIRQKQQELDDEVGNGKTSEDDATISQDSVNKALEDLDDDWYM* +>MW460250_1_34 # 23120 # 27178 # 1 # ID=1_34;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.372 +MMAMNDDYRLVLSGDSSDLENSLKAIELYMDSLESKNIDAPLDNFLKKLKVIAKEVKNVQ +NAMDKQDGKSVISSKDMDESIKSTQSATKNINELKKALDDLQKENISKGIAPDPEVEKAY +AKMGKVVDETQEKLEKMSSQKIGSDASIQNRIKEMKTLNQVTEEYNKISKDSSATKDYTK +RLRANRNMTRGYMERSEGTGRLTYDQGARVRSELGKISSYESQRKQNQRNLGQAREQYSN +YRNQQQDLTKRRASGQINKEQYEQELASIKQEMKAREELISNYEKLGAELDKTVQYYKGS +VQKDFQSRDVDQQRGTFGRMVQERLPSIGSHAMMGTTAMATGLYMKGASLSETNRPMVTS +LGQNSDNMDIDSVRNAYGDLSIDNKLGYNSTDMLKMATSYEASVGHKSDEDTMAGTKQLA +IGGRSLGIKDQEAYQESMGQIMHTGGVNSDNMKEMQDAFLGGIKQSGMVGRQDEQLKALG +SIAEQSGEGRTLTKDQMSNLTAMQSTFAESGSKGLQGEQGANAINSIDQGLKNGMNSSYA +RIAMGWGTQYQGLEGGYDLQKRMDEGISNPENLTDMADIATQMGGSEKEQKYLFNRSMKE +IGANLTMEQSDEIFKDAQSGKLSKEELAKKAKKMEKEGKKEGEDNATDYKESKSGKNDQN +KSKTDDKAEDTYDMAQPLRDAHSALAGLPAPIYLAIGAIGAFTASLIASASQFGAGHLIG +KGAKGLRNKFGRNKGGSSGGNPMAGGMPSGGGSPKGGGSPKGGGTRSTGGKILDSAKGLG +GFLVGGAGWKGMFGGESKGKGFKQTSKEAWSGTRKVFNRDNGRKAMDKSKDIAKGTGSGL +KDIYNDSIFGKERRQNLGEKAKGFGGKAKGLYGKFADKFGDGGKNGILSQSPKAGGSGIG +KLGKLAGGLGKGAGVLGVATSALSLIPALASGDSKAIGGGIGSMGGGMAGASAGASIGAL +FGGVGAIPGALIGGAIGSFGGGAVGEKVGDMAKKANTKEGWNLGWTNGDKDGKNKFQDSL +LGKPISKAWSGITGLFDNDAEASEEDSKDKKKGVKGVKGDTKKKEKMTAEQLREKNNQSE +TKNLKIYSDLLDRAQKIIESAKGINIDGGTSDSGSDSGGSASDVGGEGAEKMYKFLKGKG +LSDNQVGAVMGNLQQESNLDPNAKNASSGAFGIAQWLGARKTGLENFAKSKGKKSSDMDV +QLDYLWKEMQSDYESNNLKNAGWSKGGSLEQNTKAFATGFERMGANEAMMGTRVNNAKEF +KKKYGGSGGGGGGGALSSTYQEAMSNPVLTTGSNYRGSNDASNASTTNRITVNVNVQGGN +NPEETGDIIGGRIREVLDSNMDIFANEHKRSY* +>MW460250_1_35 # 27257 # 29683 # 1 # ID=1_35;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.331 +MRRIRRPKVRIEIVTDDNTFTLRFEDTRDYNGDEFGAKLLGFQTKNSMEDDSSVFQINMA +GDTYWDKLVMANDIIRIFITPNDDPNDKEGKQERLIQVGMVSQVSKVGSYGNDQTQFRIT +GQSFVKPFMKFGLGVIQEVQAVLPEVGWLIDGDGDNEVKFTGSSAHEVMTGIIRRFIPYM +KYNYTEKTYNTIDNYLDYDDLSSWDEFEKLTEVSAFTNFDGSLKQLMDMVTARPFNELFF +KNSEKTPGKAQLVLRKTPFNPTEWRALDMIKVPTEDFIEEDVGKSDVETYSIFTATPAGM +LKELNGDVFSKPQFHPELTDRYGYTKFEVENIYLSTKSGSATEDSDSSGDDNGTERGTYS +KIMKDLSNYGRDNISKGIDKYTSKLSSKYKNLKKAQAKKIIEKFVKEGKVTEKEYEKITG +NKVDDELTSDNRPKLTKDKLKSILKEKFKTQDDFNNSKKKKKAKTDALKELTTKYRFGNK +THATTLLDEYIKYKGEPPNDEAFDKYLKAIEGVSNVATDTGSDASDSPLVMFSRMLFNWY +HGNPNFYAGDIIVLGDPKYDLGKRLFIEDKQRGDTWEFYIESVEHKFDYKQGYYTTVGVT +RGLKDAILEDGKGSPHRFAGLWNQSSDFMGGLMGEDTSKELKEKGVAEKQSSGDKDGGSD +SGGAQDGGSLDSLKKYNGKLPKHDPSFVQPGNRHYKYQCTWYAYNRRGQLGIPVPLWGDA +ADWIGGAKGAGYGVGRTPKQGACVIWQRGVQGGSPQYGHVAFVEKVLDGGKKIFISEHNY +ATPNGYGTRTIDMSSAIGKNAQFIYDKK* +>MW460250_1_36 # 29697 # 30584 # 1 # ID=1_36;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.282 +MATDKEAKDVIDKFIDNVFNFDVLTKERIKEKDEEIKKITTDDMYEKVVYIRPYVGVIQS +LNPQHVQYESFSNNGYDIEAELSFRKVSYLVDKGSIPTDSLSTLTVHLVERNQELLIDYF +DEIQDVLYGEYMEEEYVFDEDVPLSTILALDLNDNLKSLSNIKYMFKGAPKENPFGTDKD +VYIDTYNLLYWLYLGEDEELAYPMNINYFFTEGRFFTIFGKGHKYKVDVSKFIVGDILFF +GRSDTNIGIYVGDGEFISMMGKFPKDETPIGKYKLDDYWNEFNGRVMRFDEEVYI* +>MW460250_1_37 # 30584 # 33130 # 1 # ID=1_37;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.311 +MVVRFQSSMGRSLKRVDSDDLNVKGLVLATVSKINYKYQSVEVKVNNLTLGSRIGDDGSL +AVPYPKSFIGRTPEGSVFGTKPLITEGSVVLIGFLNDDINSPIILSVYGDNEQNKMINTN +PLDGGKFDTESVYKYSSSLYEILPSLNYKYDDGEGTSIRTYNGKSFFSMTSGEEEKPQAT +DFYTGTEYQDLFTSYYGNKTLIEPRIQKAPNMLFKHQGVFYDDGTPDNHITTLFISERGD +IRASVLNTETQKRTTQEMSSDGSYRVIKQDDDLMLDEAQVWIEYGISEDNKFYIKNDKHK +FEFTDEGIYIDDKPMLENLDESIAEAMKNLNEIQKELDDINYLLEGVGKDNLEELIESTK +ESIEASKKATSDVNRLTTQIAEVSGRTEGIITQFQKFRDETFKDFYEDASTVINEVNQNF +PTMKTDVKTLKTKVDNLEKTEIPNIKTRLTELENNNNNADKIISDRGEHIGAMIQLEENV +TVPMRKYMPIPWSKVTYNNAEFWDSNNPTRLVVPKGITKVRVAGNVLWDSNATGQRMLRI +LKNGTYSIGLPYTRDVAISTAPQNGTSGVIPVKEGDYFEFEAFQDSEGDRQFRADPYTWF +SIEAIELETETMEKDFMLIGHRGATGYTDEHTIKGYQMALDKGADYIELDLQLTKDNKLL +CMHDSTIDRTTTGTGKVGDMTLSYIQTNFTSLNGEPIPSLDDVLNHFGTKVKYYIETKRP +FDANMDRELLTQLKAKGLIGIGSERFQVIIQSFARESLINIHNQFSNIPLAYLTSTFSES +EMDDCLSYGFYAIAPKYTTITKELVDLAHSKGLKVHAWTVNTKEEMQSLIQMGVDGFFTN +YLDEYKKI* +>MW460250_1_38 # 33237 # 34028 # 1 # ID=1_38;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.332 +MPQSDGISNLHRIALRFPKEGGGYDMYRFKVNPENYTIDSPQRTTAIKTKSDIVIEDYGK +DIEVINFTGTTGFRPVREADGLKTGKQKMEELQSRVSEYAMQGGSGNVSGSYLQFFNFTD +DSYYKVHLAPQGLKITRSKDEPLLFRYEITLVVIGSLTEADRSAVTTEEFGNVKPNASQR +VDEGIKELDKNARKTRDRNNQEISRRENTIPKSTGDNTNEGNRLKQSFPSSSIYNPRQST +NGLKGNIDNMALIIGYGDGGVSS* +>MW460250_1_39 # 34028 # 34552 # 1 # ID=1_39;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.290 +MNNFIPQPQGLLRFLNALDTDLTSSHMNLLDEEVSFVSKFYTPQLQLSELAKKVLTNIKT +DDIPVLEREFNDNTIIHKANDTLLKVQAPRMYMILQSIVLEAYAIVNCFVENPSSLKYLT +EEDVSITRENLNYVADYLGNYDDYNSVVLDLRDLDLCFSAIELQLPLIKKEANV* +>MW460250_1_40 # 34552 # 35256 # 1 # ID=1_40;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.308 +MRFKKHVVQHEETMQAIAQRYYGDVSYWIDLVEHNNLKYPYLVETDEEKMKDPERLASTG +DTLIIPIESDLTDVSAKEINSRDKDVLVELALGRDLNITADEKYFNEHGTSDNILAFSTN +GNGDLDTVKGIDNMKQQLQARLLTPRGSLMLHPNYGSDLHNLFGLNIPEQATLIEMEVLR +TLTSDNRVKSANLIDWKIQGNVYSGQFSVEIKSVEESINFVLGQDEEGIFALFE* +>MW460250_1_41 # 35271 # 36317 # 1 # ID=1_41;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312 +MKTRKLTNILSKLIDKTMAGTSKITDFTPGSASRSLLEAVSLEIEQFYILTKENIDWGIQ +EGIIEAFDFQKRQSKRAYGDVTIQFYQPLDMRMYIPAGTTFTSTRQEYPQQFETLVDYYA +EPDSTEIVVEVYCKETGVAGNVPEGTINTIASGSSLIRSVNNEYSFNTGTKEESQEDFKR +RFHSFVESRGRATNKSVRYGALQIPDVEGVYVYEETGHITVFAHDRNGNLSDTLKEDIID +ALQDYRPSGIMLDVTGVEKEEVNVSATVTISNKSRIGDTLQKHIESVIRSYLNNLKTSDD +LIITDLIQAIMNIDDVLIYDVSFDNLDENIIVPPQGIIRAGEIKVELK* +>MW460250_1_42 # 36338 # 39397 # 1 # ID=1_42;partial=00;start_type=GTG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.288 +MANFLKNLHPLLRRDRNKKDNQDPNFALIDALNEEMNQVEKDAIESKLQSSLKTSTSEYL +DKFGDWFGVYRKTDEKDDVYRARIIKYLLLKRGTNNAIIDAIKDYLGRDDIDVSVYEPFT +NIFYTNKSHLNGEDHLMGYYYRFAVINVSIGDYFPVEIIDVINEFKPAGVTLYVTYDGAS +TIRGGAIIKWLDGLPKIETYQEFDRFTGYDDTFYGHINMNQSKDTDNSSSDIFKTNHSLI +NSLDVLTGSSSVGRQYINYGYVTSYVYNPGMTSSVNQISASTKGRGQEVPTDYYMYTSTK +NNNTVELSMQTTSGVSYLYNNFNFRDYMSKYRPQVDLQSDEARRIVSDYIKELSIDYYLS +AVIPPDESIEIKLQVYDFSINRWLTVSINNLSFYEKNIGSNIGYIKDYLNSELNMFTRLE +INAGKRDSVDIKVNYLDLMFYYYERGIYTIKPYKALIENYLDISRETYVEAFKIASLSNG +DIITKTGFQPIGYLKLVGNYENTIPSTINIVAKDTDNNPIESNELDVYNTVENRNLLQSY +KGVNTIAREITSTKEFTVSGWAKEIYSTNYLSKVLKPGKVYTLSFDMEITGNDPTLKSYS +DSHGIYLYSNTKGIVVSGVKSMERTIGNKVSVTQTFTAPTITDHRLLIYTGRYTSDGKAS +TPPVFFNTVKITELKLTEGSSKLEYSPAPEDKPNVIEKGIKFNNILTNIQTLSINSDTIL +KNVTLYYSYYGDSWVELKTLGNISTGETTETNNLIDLYGLQTVDYSNINPMSKVSLRSIW +NVKLGELNNQEGSLYNMPNDYFNAVWQDIDKLSDIELGSMRMVKDTEGGVFDGATGEIIK +ATLFNVGAYTDLDMLAYTLTNYTEPLTLGSSRLIIELKEELLTSESFNVDNRIKVIDSIY +EELPNTSIIKNGFVEREVTGSKYLDYGLYEPIEDGTRYKLIVEGEFKDNIEFISLYNSNP +NFNETFIYPSEIINGVAEKEFIAKPSTEDKPRLNTDVRIYIRPYDSTISKVRRVELRKV* +>MW460250_1_43 # 39508 # 40029 # 1 # ID=1_43;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.324 +MAIATYNSHVELAKYLVSKADSVYLTIGKSTPWSNETNPPQPDENATVLQEVIGYKKATK +VTLVRPSKSPEDDNKNLISYGNKSWVEVTPENAKAEGAKWVYLESSIVGDELPLGTYRQV +GFVMDLVAKSGISKFNLVPSEVESTGTLLFFDNKQFQNRSEQTTAKERFIVEV* +>MW460250_1_44 # 40050 # 43508 # 1 # ID=1_44;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.342 +MAINFKGSPYLDRFDPSKDRTKVLFNPDRPLQQAELNEMQSIDQYYLKNLGDAIFKDGDK +QSGLGFTLSEDNVLTVNPGYVYINGKIRYYDNDDSVKITGVGKETIGIKLTERIVTPDED +ASLLDQTSGVPSYFSKGADRLEEKMSLTVNDPTSATIYTFMDGDLYIQSTNAEMDKINKV +LAERTYDESGSYKVNGFELFSEGNAEDDDHVSVVVDAGKAYVKGFKVDKPVSTRISVPKS +YDLGTAENESTIFNKSNNSISLANSPVKEIRRVTGQVLIEKERVTRGAQGDGQDFLSNNT +AFEIVKVWTETSPGVTTKEYKQGEDFRLTDGQTIDWSPQGQEPSGGTSYYVSYKYNKRME +AGKDYEVTTQGEGLSKKWYINFTPSNGAKPIDQTVVLVDYTYYLARKDSVFINKYGDIAI +LPGEPNIMRLVTPPLNTDPENLQLGTVTVLPDSDEAVCISFAITRLSMEDLQKVKTRVDN +LEYNQAVNALDDGAMEGQNPLTLRSVFSEGFISLDKADITHPDFGIVFSFEDAEATLAYT +EAVNQPKIIPGDTTAHIWGRLISAPFTEERTIYQGQASETLNVNPYNIPNKQGVLKLTPS +EDNWIDTENVTITEQKTKKVTMKRFWRHNESYYGETEHYLYSNLQLDAGQKWKGETYAYD +REHGRTGTLLESGGQRTLEEMIEFIRIRDVSFEVKGLNPNDNNLYLLFDGVRCAITPATG +YRKGSEDGTIMTDAKGTAKGKFTIPAGIRCGNREVTLKNANSTSATTYTAQGRKKTAQDI +IIRTRVTVNLVDPLAQSFQYDENRTISSLGLYFASKGDKQSNVVIQIRGMGDQGYPNKTI +YAETVMNADDIKVSNNASAETRVYFDDPMMAEGGKEYAIVIITENSDYTMWVGTRTKPKI +DKPNEVISGNPYLQGVLFSSSNASTWTPHQNSDLKFGIYTSKFNETATIEFEPIKDVSAD +RIVLMSTYLTPERTGCTWEMKLILDDMASSTTFDQLKWEPIGNYQDLDVLGLARQVKLRA +TFESNRYISPLMSSSDLTFTTFLTELTGSYVGRAIDMTEAPYNTVRFSYEAFLPKGTKVV +PKYSADDGKTWKTFTKSPTTTRANNEFTRYVIDEKVKSSGTNTKLQVRLDLSTENSFLRP +RVRRLMVTTRDE* +>MW460250_1_45 # 43557 # 43715 # 1 # ID=1_45;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MPREVRDPYSQAKLFIPTVEEKSIKELEKTYKEKIDEATKLINELKKERGEK* +>MW460250_1_46 # 43716 # 45638 # 1 # ID=1_46;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.316 +MAFNYTPLTETQKLKDMYPKVNDIGNFLKTEVNLSDVKQISQPDFNNILASIPDSGNYYV +TNSKGAPSGEATAGFVRLDKRNVNYYKIYYSPYSSNKMYIKTYANGTVYDWISFKLDEGS +LYNEGNTLNVKELTESTTQYATLVNPPKENLNTGWVNYKESKNGVSSLVEFNPVNSTSTF +KMIRKLPVQEQKPNLLKDSLFVYPETSYSNIKTDNWDTPPFWGYSSNSGRSGVRFRGENT +VQIDDGSDTYPSVVSNRFKMGKELSVGDTVTVSVYAKINDPALLKDNLVYFELAGYDTVD +DTSKNPYTGGRREITASEITTEWKKYSFTFTIPENTIGASGVKVNYVSLLLRMNCSSSKG +NGAVVYYALPKLEKSSKVTPFITHENDVRKYDEIWSNWQEFISKDELKGHSPVDIEYNDY +FKYQWWKSEVNEKSLKDLAMTVPQGYHTFYCQGSIAGTPKGRSIRGTIQVDYDKGDPYRA +NKFVKLLFTDTEGIPYTLYYGGYNQGWKPLKQSETSTLLWKGTLDFGSTEAVNLNDSLDN +YDLIEVTYWTRSAGHFSTKRLDIKNTSNLLYIRDFNISNDSKGSSVDFFEGYCTFPTRTS +VQPGMVKSITLDGSTNTTKVASWNEKERIQVYNIMGINRG* +>MW460250_1_47 # 45661 # 46035 # 1 # ID=1_47;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=11-12bp;gc_cont=0.307 +MAVKYDIGNNEIVLHLREGKYITGFTTVGGYDKELGQVKVNREILPAYFFDNFAYERYLY +YSKPEEVIENKNYVPPQINDDDEESQQITVPKEQYDSLKEELELMRKQQEAMMEMLQKLL +GQKG* +>MW460250_1_48 # 46042 # 47418 # 1 # ID=1_48;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.328 +MALNFTTITENNVIRDLTTQVNNIGEELTKERNIFDITDDLVYNFNKSQKIKLTDDKGLT +KSYGNITALRDIKEPGYYYIGARTLATLLDRPDMESLDVVLHVVPLDTSSKVVQHLYTLS +TNNNQIKMLYRFVSGNSSSEWQFIQGLPSNKNAVISGTNILDIASPGVYFVMGMTGGMPS +GVSSGFLDLSVDANDNRLARLTDAETGKEYTSIKKPTGTYTAWKKEFELKDMEKYLLSSI +IDDGSASFPLLVYTSDSKTFQQAIIDHIDRTGQTTFTFYVQGGVSGSPMSNSCRGLFMSD +TPNTSSLHGVYNAIGTDGRNVTGSVVGSNWTSPKTSPSHKELWTGAQSFLSTGTTKNLSD +DISNYSYVEVYTTHKTTEKTKGNDNTGTICHKFYLDGSGTYVCSGTFVSGDRTDTKPPIT +EFYRVGVSFKGSTWTLVDSAVQNSKTQYVTRIIGINMP* +>MW460250_1_49 # 47510 # 49258 # 1 # ID=1_49;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.310 +MRLRIKNLYTYVEFEEDDKYLKDIFLKRVHTTIGARQEGFQYSPAYKRGSWDGYVDFYVY +EEDKFPTGLLFKIELLLGELQSRYNFQFETIDERDESFLSEEDIDDEITLLDNNVGQITL +RDYQYEAVYNSLTFYNGIAHLATNGGKTEVASGIIDQLLPQLEKGERVAFFTGSTEIFHQ +SADRLQERLNIPIGKVGAGKFDVKQVTVVMIPTLNANLKDPTQGVKVTPKQNISKKIAQE +ILPKFEGGTNQKKLLKVLLDNTTPKTKVEQNVLSALEIIYQNSKTDAEVLLNLRNHNAHF +QKIVREKNEKKYDKYQDMRDFLDSVTVMIVDEAHHSKSDSWYNNLMTCEKALYRIALTGS +IDKKDELLWMRLQALFGNVIARTTNKFLIDEGHSARPTINIIPVANPNDIDRIDDYREAY +DKGITNNDFRNKLIAKLTEKWYNQDKGTLIIVNFIEHGDTISEMLNDLDVEHYFLHGEID +SETRREKLNDMRSGKLKVMIATSLIDEGVDISGINALILGAGGKSLRQTLQRIGRALRKK +KDDNTTQIFDFNDMTNRFLYTHANERRKIYEEEDFEIKDLGK* +>MW460250_1_50 # 49270 # 50883 # 1 # ID=1_50;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.284 +MATKTQRKLYQYLEENATENKFHISTKKELADSLGVSISALSNNLKKLEEENKVVTVSKR +GKNGGVIITLVREYDTEELKEFNNSTDNIITSDLQYAKALREKHFPSYRYERKEQRRRTK +IEMAQYNAIKDEKRRIIADMNFYSEGLPYPSKDIFNMSYDPEGFYKAYILCKLYDQYAIS +HMDAKHTSHLKAMSKATTKDEYDYHQHMSEYYRNKMIQNLPRNSVSDNFFGSKMFNTFYN +FYLKIKDKNINVFKYMQNVFKNVTFYYENGMQPNPIPSPNFFSSDKYFKNYNNYIKGIKK +GVNSTNRHLGDTDSIINSSDYVKNPAVLHLHQLYTTGLNSTLHDIDTMFEQALDLENASY +GLFGDMKHIILLQYNSMIEEEIKNLPREEKDIINKYVKQCIINDYSPTSISPSARLSMFT +MQKEHIVYNKQLNKGIKREDLLPLSLGGIVNKDLLSGMDIQNLEQNGNEYLYMRQHTSTY +YILRMFGDYLGYEVNLREVKYIVEKYNLIDKIPLTKEGMLDYNKLIHLVEEEVNNYE* +>MW460250_1_51 # 50876 # 52318 # 1 # ID=1_51;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.312 +MSKKIKELILHKSMKDIHFAREVLDNLPKNLFSAESEDMGYLFTAIKRTAHISDKMSNEA +LAIKVEQLMGNNKEDEEKVTKTLTYLEDLYKVDVNEKDESVNYEIEKYIKTEMSKEVLVK +FIAENKQEDSDNLHELVDKLKQIEVSDISGGNGEFIDFFEDTEKKQELLSNLATNKFSTG +FTSIDNHIEGGIARGEVGLIIAPTGRGKSLMASNLAKNYVKSGLSVLYIALEEKMDRMVL +RAEQQMAGAEKSQIVNQDMSLNNKVYDAIQNHYQKNRKLLGDFYISKHMPGEVTPNQLEQ +IIVNTTIKKDKNIDVVIIDYPHLMRNPYAKYHSESDAGGKLFEDIRRLSQQYGFVCWTLA +QTNRGAYGSDVITSEHVEGSRKIVNAVEVSLAVNQKDEEFKSGFLRLYLDKIRNSSNTGE +RFVNLKVEPTKMIVRDETPEEKQEHIQLLSDNGKEDTSKFQNKDNKIEAINNTFGGLPGV +* +>MW460250_1_52 # 52397 # 53434 # 1 # ID=1_52;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +MKFVFFTDSHFHLFTNYAKPDEQYVNDRFREQIQALQKMFDIAREEDATVIFGGDLFHKR +NAVDTRVYNKVFETFQLNRDIEVLMLRGNHDSVTNSLYTDSSIEPFGYLPNVEVCKNLDT +LGFLGEEQDINIVMAPYGDETEEIKEFIKNKYVEDRVNILVGHLGVEGSLTGKGSHRLEG +AFGYQDLLPDKYDFILLGHYHRRQYFQNPNHFYGGSLMQQSFSDEQEANGVHLIDTEKMT +TEFIPIHTRRFITIQGEDIPENFEQLIEEDNFIRVIGTANHAKVLEMDDSMKDKNVEVQI +KKEYTVEKRIDSDVSDDPLTIASTYAKQYSPESEQEILECLKEVL* +>MW460250_1_53 # 53434 # 53811 # 1 # ID=1_53;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.280 +MKKYREYLNKTDAENLAEDWEKVTEDLWKVFKDMKPKINTLDISNVVSKDLDKSKPILQF +QDSDGVIENICNVEGLEDGLSKMKKIFDDSNFEKHYYNRVVDHDEYYWIDYGSHHCFFRV +TKGDK* +>MW460250_1_54 # 53811 # 55730 # 1 # ID=1_54;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.297 +MVVFKQVEVNNFLAIKEATLELDNRGLILIEGENKSNESFHSNGSGKSTLISAITYALYG +KTEKGLKADDVVNNIEKKNTSVKLKFDIGEDSYLIERYRKDKENKNKVKLFVNEKEITGS +TNDVTDKQIQDLFGIEFNTYVNAIMYGQGDIPMFSQATDKGKKEILESITKTDVYKQAQD +VAKEKVKEVEEQQNNIRQEIYKLGYQLSTKDEYFQREIEQYNQYKEQLVQIENSNKEKDR +LREQEEKQIEAQIEQLASQIPTIPEDEFKHSEEYNKASQSLDLLSNKLTELNQVYSEYNT +KEQVLKSEIATLSNSLNQLDTNDHCPVCGSPIDNSHKLKEQENINNQIENKKQEITSVLE +MKDTYKEAIDKVKDKSQEIKDKMSQEDQQEREHNNKINSIIQEASRIKSDISSLENNKTY +LKVKYQHQSVQGLEREEPSKEKHEEDKKELQESIDKHEENIVQLETKKGKYQQAVDAFSN +KGIRSVVLDFITPFLNEKANEYLQTLSGSDIEIEFQTQVKNAKGELKDKFDVIVKNSKGG +GSYKSNSAGEQKRIDLAISFAIQDLIMSKDEISTNIALYDECFDGLDTIGCENVIKLLKD +RLNTVGTIFVITHNTELKPLFEQTIKIVKENGVSKLEQK* +>MW460250_1_55 # 55730 # 56326 # 1 # ID=1_55;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.281 +MKLKILDKDNATLNVFHRNKEHKTIDNVPTANLVDWYPLSNAYEYKLSRNGEYLELKRLR +STLPSSYGLDDNNQDIIRDNNHRCKIGYWYNPAVRKDNLKIIEKAKQYGLPIITEEYDAN +TVEQGFRDIGVIFQSLKTIVVTRYLEGKTEEELRIFNMKSEESQLNEALKESDFSVDLTY +SDLGQIYNMLLLMKKISK* +>MW460250_1_56 # 56341 # 57408 # 1 # ID=1_56;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.290 +MRFEDFLTQELGEPKENTIGELRYCCPFCGEKSYKFYVKQALDSSNGQYHCKKCDESGNP +ITFMKTYYNITGKQAFDLLESKNIDIERAPLLTTNNKDLTESEKLILMLRGVHQDKGNTS +IKPPRLPEGYKLLKDNLNNKEIIPFLKYLKGRGITLEQIINNNIGYVINGSFYKVDGESK +VSLRNSIIFFTYDNDGNYQYWNTRSIEKNPYIKSINAPAKQDEVGRKDVIFNLNIARKKK +FLVITEGVFDALTFHEYGVATLGKQVTENQIKKIIDYVSIDTSIYIMLDTDALDNNIDLA +YKLKTHFNKVYFVPHGDEDANDMGTRKAFELLKQNRVLVTPESIQSYKIQQKLKL* +>MW460250_1_57 # 57475 # 57813 # 1 # ID=1_57;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.274 +MSNNKKDILEFVDEYITALRVGNEQRQHQLEEMGKEETATLTDVAKAITNLMLGVNEQMT +DLEYNNELNLNILIDALYKAELINEDVLDYIQESIDKSQEEPKNEEEKGEQE* +>MW460250_1_58 # 57813 # 58265 # 1 # ID=1_58;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.331 +MEKNISTHTKGISQADMEKWIEAVVQGTVDGKQVDEKTAKQLDRIGSRSVSLEEATRIAK +VLNAVTAQEVTGDFNDAFNAIDLMMIIMEDELGVTQEKVGKAKDKLNEKREAYLKEKQEE +LRQKQQEEAQKKTESDSNEKVIQLKKNDEQ* +>MW460250_1_59 # 58252 # 58860 # 1 # ID=1_59;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.328 +MTNSKKKGDTFERKIAKELTAWWGYQFNRSPQSGGASWGKDNNAVGDIVVPQEANFPLVV +ECKHREEWTIDNVLLNNREPHTWWEQVINDSSKVNKTPCLIFTRNRAQSYVALPYDEKVY +EDLRNNEYPVMRTDFIIDNIRKDKFFYDVLITTMNGLTSFTPSYIISCYDKKDIKPYKKV +ESNLSEVSKHEDELINDLLSDI* +>MW460250_1_60 # 58877 # 59269 # 1 # ID=1_60;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.293 +MTSKERPLIVYFSGTGQTERLVNKININNSFETFRVKSGKEKVNKPFILITPTYKKGAIP +KQIERFLEINGSPKEVIGTGNKQWGSNFCGASKKISEMFKIPLIAKVEQSGHFNEIQPIL +EHFSNKYKVA* +>MW460250_1_61 # 59284 # 61398 # 1 # ID=1_61;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312 +MATYGKWIELNNEITQLDDNGKNKLYKDQEALDEYLKYIEDNTRKFNSEVERIRVLTKEG +TYDKIFDNVPDTIIDEMTKLAYSFNFKFPSFMAGQKFYESYASKQYDENKKPIFVEDYEQ +HNVRVALYLFQNDYVKARELLVQLMEQTFQPSTPTYNNSGQANRGELSSCYLFVVDDSIE +SLNFVEDSVANASSNGGGVAIDLTRIRPKGAPVRNRPNSSKGVIAFAKAIEHKVSIYDQG +GVRQGSGAVYLNIFHNDILDLLSSKKINASESVRLDKLSIGVTIPNKFMELVKEGKPFYT +FDTYDINKVYGKYLDELNIDEWYDKLLNNDSIGKVKHDAREVMTDIAKTQLESGYPYVFY +IDNANDNHPLKNLGKVKMSNLCTEISQLQEVSEIYPYSYSNQNVINRDVVCTLGSLNLVN +VVEKGLLNESVDIGTRALTKVTDIMDLPYLPSVQKANDDIRAIGLGSMNLHGLLAKNMIS +YGSREALDLVNSLYSAINFQSIKTSMLMAKETGKPFKGFEKSDYATGEYFVRYIRESNQP +KTDKAKKVLNKVYIPTQDDWDELAKAVKVHGLYNGYRKAEAPTQSISYVQNATSSIMPVP +SAIENRQYGDMETYYPMPYLSPITQFFYEGETAYKIDNKRIINTSAVVQKHTDQAVSTIL +YVESEIPTNKLVSLYYYAWEQGLKSLYYTRSRKLSVIECETCSV* +>MW460250_1_62 # 61412 # 62461 # 1 # ID=1_62;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.320 +MDITQKVKQHNKNAVLKATNWNIEDDGMSDIYWEQGISQFWTPEEFDVSRDLSSWNSLTE +SEKNTYKKVLAGLTGLDTKQGGEGMNLVSYHEPRPKYQAVFAFMGGMEEIHAKSYSHIFT +TLLSNKETSYLLDTWVEENDFLKVKAQFIGYYYDQLLKPNPTIFDRYMAKVASAFLESAL +FYSGFYYPLLLAGRGQMTQSGAIIYKITQDEAYHGSAVGLTAQYDYNLLTEEEKKQADKE +TYELLDILYTNEVAYTHSLYDPLELSEDVINYVQYNFNRALQNLGREDYFNPEPYNPIVE +NQTNVDRLRNVDFFSGKADYEKSTNIKDIKDEDFSFLDSKEYSTAKEFL* +>MW460250_1_63 # 62479 # 62808 # 1 # ID=1_63;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.306 +MDRKEAMDLLSKAEILFKKHDEFSCVSDINDPMKLFSNSKDAKADDTSNSFQLEFMHDMT +MYTLSYGSGQLKLIDLAEGYEAQKATIVNSFPEIIKTLEKDDSEDGKNE* +>MW460250_1_64 # 62792 # 63112 # 1 # ID=1_64;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.293 +MEKMNSLVDLNTAIRQKKDVIVMITQDNCGKCEILKSVIPMFQESGDIKKPILTLNLDAE +DVDREKAVKLFDIMSTPVLIGYKDGQLVKKYEDQVTPMQLQELESL* +>MW460250_1_65 # 63319 # 63915 # 1 # ID=1_65;partial=00;start_type=ATG;rbs_motif=3Base/5BMM;rbs_spacer=13-15bp;gc_cont=0.283 +MDELISKSRRYIMRDEKHYMLFNEKYNNDRLIEKVCKHGGKVTYYTDSVLPYYVLKDLSS +HPDSEVVYRMRNGFTAKEVDNIALSFMGTKVIIDISVVFPYVNPYDIIRSLHDIKTNVDE +VHLSFPRILGVDEKQEKFYFFDGEAYDLKPEYKVDFADKIRVSLSVWKMYIYILTSSRDF +EDVDNVITKLKQQRKIKI* +>MW460250_1_66 # 63925 # 64230 # 1 # ID=1_66;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.307 +MSTANRRDIARKISENTGYYIQDVEEILSAETDAISDLLEEGYTKVKNHKFMQIEVIERK +GKKAWDGLNKEYFHLPNRKAIKFKPLKELEEVIDRLNEEEK* +>MW460250_1_67 # 64306 # 65178 # 1 # ID=1_67;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.310 +MKVLILFDHIREEHFSVSKDGSVKSNVLNTPNGKTLKKLLEKCSNLKRDKTNRDYDIDFL +YNAVPTPIRNDYGKIIKYQDVKQAEVKPYYERMNNIIIDNSYDMVIPVGKLGVKYLLNVT +AIGKVRGVPSKVTIENGTSSHDVWVLPTYSIEYTNVNKNSERHVVSDLQTVGKFVEQGEE +AFKPKEVSYELVDNIERVREIFNKEVKNDNYDGVDITAWDLETNSLKPDKEGSKPLVLSL +SWRNGQGVTIPLYKSDFNWENGQDDIDEVLELLKNWLASKEDIKVAHNGK* +>MW460250_1_68 # 65344 # 65856 # 1 # ID=1_68;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.285 +MYRLNRGGTVKKDYMTSVKNNKKVCRRCNEELDLSNFKTYKKNDKTYYQSMCIPCRKEYN +KLDKTKNTIKKCYEKNGDKYRRQSNEYNTSDRGRELNKNRSRKYRENNSLKSKARSSVRT +ALRNGSLIRPDKCSECNKDCIPEAHHPDYTKPLEIKWLCKSCHEDTHHKK* +>MW460250_1_69 # 65992 # 67335 # 1 # ID=1_69;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.303 +MSTENFKDFESIQDTKVGWYLAVTQEVKESLRLSDLAYEVTDVGGYDKPLEDFKLWFVTK +LLRFFSDKIKEIQKENKKIAKKEYDVKAPEYKEWLENKLNETVVELDDTEKKFRVSELEK +KYIQLGLSPEIVNMNLVMDNDEFINIAEQSPEYMGLSDYAKSYTLNTAINLINEYRDVKD +VVNDIDGGNFNYDWFPIELMHPYASGDTDVCRRIYCDVIKKLKEQDRPKSMHLLEVNYPR +LTKSLARIESNGLYCDLDYMKENDESYESEMAKNHATMREHWAVKEFEEYQYNLYQMALE +EHEKKPKDRDKDIHQYRDKFKDGKWMFSPSSGDHKGRVIYDILGIQLPYDKEYVKEKPFN +ANVKEADLTWQDYKTDKKAIGYALDNLELKDDVKELLELLKYHASIQTKRNSFTKKLLNM +INKQKRTLHGSFSETGTETSRLSSSNP* +>MW460250_1_70 # 67603 # 68310 # 1 # ID=1_70;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.275 +MKEIWKKVVGFENYEVSNKGKVRNIKTNYILKPWIINSGYEQVSIGIANVLVHRLVAMTF +IPTDSYSIVNHIDNNKLNNCVENLEWVSYKGNSAHANKQGRLNTYSAREKLSSVSKKAIY +QKDMEGNIIKLWDSPSEAEKESNGYFKSTKISSVAHGKRKHHRSYTWEYVYKDSKRSLNK +SINMYDLNNNLLYEDLTMNKIMGILEMNNHKTLRDKLRNTDDFVEYRGYKFKNNN* +>MW460250_1_71 # 68544 # 69404 # 1 # ID=1_71;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.304 +MRIIGLFTKDPDMLQSFLNGEDIHKATASIVYNKPVEEVTKEERQATKAVNFGLAFGESP +FSFAGKNNMEVSEAEEIFEKYFQTKPSVKTSIDNVHEFVQQYGYVDTMHGHRRFIRSAQS +TDKKIKNEGLRQSFNTIIQGSGSFLTNMSLTYLDDFIQSRNLKSKVIATVHDSILIDCPP +EEAKIMAKVTIHIMENLPFDFLKAEIDGKEVQYPIEADMEIGLNYNDMVEYDEEEIDTFN +SYQGYIKYMMNLQTLEDYKESGKLTDEQFEKATNVVKSEKHIYQEI* +>MW460250_1_72 # 69473 # 69715 # 1 # ID=1_72;partial=00;start_type=GTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.337 +MNTGEIRFNRSMDEWIITSMYQDELGGMNIVVTFYNREENKHGSTVLPTESSTGEVTEEL +ASLEEEYPLALPLSSISVNI* +>MW460250_1_73 # 69732 # 70214 # 1 # ID=1_73;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +MEIHIDSLDFTNFTIKDRNGNSQEFDITDELRITEYTIQEDFMQQSAKYAFWASILEKVR +AYSEMEQRNLETIGSKLNLTIRQEYEQQGKKPTKDMIESSVYIHDSYQQQLKVVEAWNYK +VKQLQYVVKAFETRRDMMIQLGAELRQTNKNGGITNPFSH* +>MW460250_1_74 # 70301 # 71572 # 1 # ID=1_74;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.355 +MDFNQFINNEASKLESNNSSFNNNVESYKPKNPVLRLGNIKDANGNKVVKENAFVRVLPP +AQGTNVFFKEFRTTGINYSKKDGSQGFTGLTLPAEEGSSVLDPYIQDWITNGVQFSRFPN +KPGVRYYIHVIEYFNNNGQIQPKTDAQGNVMIQPMELSNTGYKELLANLKDTMLKPSPNA +PHSFISATEAFLVNIVKAKKGEMSWKVSVYPNAPLGALPQGWEQQLSDLDQLAKPTEEQN +PNFVNFLINNVNNTELSHDNFKFNRETNVLGEEPSEPKQAPTQQDVDSQMPSNMGGQPNQ +PQQGQVGQYAQQGQSNGQGQQLQGTQQPINNTQFGQGTPSGQQPSNTGSVDWDNLAQQQS +QPDSNPFNDFDVSSVDDSQVPFETQPQNTQQAPEPQQTTQEPPKQKQTQSIDDVLGGLDL +DNL* +>MW460250_1_75 # 71632 # 71856 # 1 # ID=1_75;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.333 +MARAKKGKEVDLTDLNTIDLGKELGLTLLSDTNRADIKNVIPTMVPQYDYILGGGIPLGR +LTEVYGLTGSGCLK* +>MW460250_1_76 # 72201 # 73169 # 1 # ID=1_76;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.268 +MVKRVWTNEEKQDIVKSFQEGKTFKELQDKYNAHYFTIKKILDEFNIDTNKKRRWTNKQK +QDILRMYTKESMTIAEIKKVYTTHAREIGRILKDFGVDTSYYQTRSVNRNINRNFFEVID +TEEKAYILGLLMADGCVRYRREGQCYLTLELIDKEIVKRVQKELNSDSKIYESHRKRDYI +KNEKQTYTFSVTDEKLCNDLAKYGIVPMKSKKTECLTQDIPYDLRKHYLRGLFDGDGSIG +YYNNRWFITLINNHPEFLKDVGTWINDLLGLKCPKVSKTSTSYRIGYTGKKAKELMKLLY +QDNNIHIDRKQKLADQAIQDIV* +>MW460250_1_77 # 73317 # 74264 # 1 # ID=1_77;partial=00;start_type=ATG;rbs_motif=AGxAG;rbs_spacer=11-12bp;gc_cont=0.344 +MEQLGVDVSKLFSIQSGEGRLKNTVELSVEQVGKELEYWIDTFNEKIPGVPIVFIWDSLG +ATRTQKEIDGGIDEKQMGLKASATQKVINAVTPKLNDTNTGLIVINQARDDMNAGMYGDP +IKSTGGRAFEHSASLRIKVHKASQLKQKSELTGKDEYHGHIMRIETKKSKLSRPGQKAEA +DLLSDYMVGKEDDPILLNGIDLEHTVYKEAVERGLITKGAWRNYVTLNGEEIKLRDAEWV +PVLKDNKELYLELFSRVYGEHFPNGYSPLLNNKVIVTQLEEYQALENYYKEWATDNKQEE +QEEELKGESQEKDSE* +>MW460250_1_78 # 74268 # 74621 # 1 # ID=1_78;partial=00;start_type=ATG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.314 +MDNLIDKNMNQVKESLGNANSSDVLPLPYKDIAKKFEEVKEKGESIIIEEGGFPYTDSTV +MYIEHVTDRWAGGYSLIRHEGEEVKVPKTIHFSDIYVKDKSHKVRIIFEGANPYEES* +>MW460250_1_79 # 74608 # 75270 # 1 # ID=1_79;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.275 +MKKANNGNRYVIDIDGIPVDFERDLDSLLNRYKNLRWSLYHRYAGILSNDFERQELREYI +DEQFIKLVKEYNIRSKVDFPGYIKAKLTLRVQNSYVKKNEKYKRTEIIGKKDYTVESLTE +DLNEDFEDNQIMSYVFDDIEFTEVQSELLKELLINPEREDDAFIVSQVAEKFDMKRKEVA +SELTELRDYVRFKINAYHEYYAKKELNNHRVNTENHIWEN* +>MW460250_1_80 # 75398 # 76021 # 1 # ID=1_80;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.361 +MAKKNVNDVLQQESVTVADKYLQVKVNRDGYTRTHEGQYAYKVVSEGEELFLYPVQTDGK +GTLNVMKKSPIAYTDGDNIHFVVNTVVDPYNHSFIRTEDIKGLDKGKQLIQAFLAFVEDR +FKFGVYNVFVANNKEDVLSIVDPTDNDADEVKDSLEHAHEDVIADFPASPARKDVKGVDS +GEGQGDTSEPSAPKNVQVTPKEDVSAE* +>MW460250_1_81 # 76044 # 76556 # 1 # ID=1_81;partial=00;start_type=TTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.365 +MAKLNLYKGNELLNSVEKTEGKSTITIENLDANTDYPKGTFKVSFSNDSGESEKVDVPQF +KTKAIKVISVTLDVDSLDLTVGDTHQLSTTITPSEASNKNVSFESDKSGVASVTSEGLIE +AVSAGTANVTVTTEDGSHTDIVVVTVKEPIPEAPADVTVEPGENSADITV* +>MW460250_1_82 # 76571 # 76798 # 1 # ID=1_82;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.355 +MEKTLKVYSNGEVVGSQVANNDGATTVSITGLEAGKTYAKGDFKVAFANDSGESEKVDVP +EFTTKTPTEEPSGDA* +>MW460250_1_83 # 76894 # 77154 # 1 # ID=1_83;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.295 +MDIPTILFRNPYDYTKVKKLMENKEQYIVVKFDSVSVHNLNVQGMMNVIQDYLHIYGYRV +KEYGQENSSKDDERDVKGYLYERVGE* +>MW460250_1_84 # 77158 # 77913 # 1 # ID=1_84;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.279 +MGIIVNSNHIQSDTLYEYDSFFDIEKVDTFEEGLLSIQDEPTVLAGFIYDDITFNKVINS +NSDIDDYIKNNDIYYVSDIGLLPDTFITVDSDRKYYSLLQQITELSKDPFPKWVEDDAKG +LTKYYNFQDFEDVFDLNSFYKKEVDMVREKCYNNGNVYLLYEVLPDYKLPLAYSLLSNKE +HGIVIIGSQTRSNNDILTFYVKGMDAKAIASMFNVEHDYDSNIFHTFVNSHINILGNQIT +KFIREKGSSYE* +>MW460250_1_85 # 77906 # 79156 # 1 # ID=1_85;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.289 +MSNYKTIEEVQAVIIGVLFKDEGKIVTSKFNKITKEFGLDRIGKDDLKEIVEDIRQDAYL +NELKNKAIKGKVTLGDLKDVADNQVFEGNNYHEEVSTYVVAKEKELSHLREQRKHNRHTA +YPQIMFDELKEHMVKELQGETLVEHHGSKANINDTELIVLLSDFHIGSIVSDMTNGKYDF +EVLKSRLNHFINTTVKEIEDREISNVTVYFVGDLVEHINMRDVNQAFETEFTLAEQISKG +TRLLIDILNVLSNVVSGELRFGIIGGNHDRMQGNKNQKIYNDNIAYVVLDSLLLFQEQGL +LNGVDIIDNREDIYTIRDTFGGKSIIINHGDGLKGKGNHINKFILDSHIDLLITGHVHHF +SVKQEDFNRMHIVASSPMGYNNYAKELHLSKTKPSQQLLFVNKENKDIDIKTVFLD* +>MW460250_1_86 # 79170 # 79538 # 1 # ID=1_86;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312 +MDTIFIIGVAFITFATFNIVFRLFDLWTTEKKMVSQGQPPLSNFEYYHVIVPYLVGVIVI +ILSIIFRDSLYSAQSGFGVIITSFIYMLVYVIIGLVGSFVLTIFQARKARQYQTQEDNNE +VQ* +>MW460250_1_87 # 79525 # 79836 # 1 # ID=1_87;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.272 +MKFNDIYEQLIKNDTVQNIHESQDDKGNIYTIQFDKGNDKYLFNVINDGFLKEMTNGMVD +HPEGQPYSVSLINKETPSMSVKQYLTDVEDIVPTIRKMEKDFL* +>MW460250_1_88 # 79900 # 80436 # 1 # ID=1_88;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.333 +MDFNFSAFDNSSLAMRISEGVYYFNDTPYYFIEHVEEEMSEYVIVYDIHDREEKENPQKK +YRIEPYQRTIPGGTPLSNLIKSMMPQRKYPKKVTEDPIFVANVIPLGTDTVTGKTGKGFF +ERDKDRTIYSQKEPTKVVHGQYTGVFIGLTSVKWNRTYTPLESVVEYYKRVKGDRLNV* +>MW460250_1_89 # 80429 # 81196 # 1 # ID=1_89;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.303 +MSNDVVKFYEKDIKDLIRTKKHMFKDDEITSDINDIRIFNEKVICQGKCRTDCLVLDRNG +TVMGIEIKTERDSTQRLNNQLKYYSLVCKYVYVMCHDKHVPKVEQILKRYKHNHVGIMSY +ISFKGKPVVGKYKDATPSPHRSPYHTMNILWKTNLMTILRLIRDPHTYRTGYSYNVSGRY +SGGEGNFSQTTQSKRMKKPAIINQIIHYVGVDNTYKLFTRGVIYGYNNRWEVIEEDFFNT +MKNGVRVINEQRQTK* +>MW460250_1_90 # 81174 # 81620 # 1 # ID=1_90;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.295 +MSKDKPNRRKEIQHQPVNFAPTNTLTGANNSFFAKKPSEPKDATSVIEYRILFIKRFDNV +TSTDVKLQKKYALNLISEALDVKETYLSLKQKGKKTESILHTDRVYYVHRGKKLIGKCSI +REQRTFKGKHLIFIFKTRHRVKAERKDK* +>MW460250_1_91 # 81620 # 82483 # 1 # ID=1_91;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.302 +MLKGFSEHVDKPTTIKTLYKTLTSGKVELLGVSYDSDYFPSGVTVQSYIEDIGNEDEGLQ +FVNKVNVVESMKQAVVGMNNQLGSSGLGYVRTEQLKKELEETGLMTDLLARGTNLTSTKK +VDIVSTFIEPEVTYQNITIAKDIKLRLYKVEEESPLNGYTHIVYLLTTEKLYDGQTLFGM +LSKKDKLSKGDTDKLLAFFRNNSLISKSVFCVKLLSKDYYFNLYNTHETGIFFLEDTDVI +TIACGQSYVKVNTKDIKSSYVKIEDKTHKLTELVINLKGDDTLTILF* +>MW460250_1_92 # 82855 # 83586 # 1 # ID=1_92;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.276 +MARKKNLRNKNSDIKVVPDKEKESILSKLYHNKLLRSKVDNALDEDMSYDDIIELCKEYD +LELSKSAITRYKSKRKEAIENGWDLGELIDKRKKTSVKDIKEKETPILEEEQLSPFEQSK +HHTQTIYDDIQVLDMIISKGAKGLEFVETLDPALMIRAMETKDKITGNQLKGMSFIGLRE +LQLKQTAQDTAMSEVLLEFIPEEKHEEVLQRLEELQNEFYKNLDLDEESRKLKEALDRVG +YTI* +>MW460250_1_93 # 83604 # 84062 # 1 # ID=1_93;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.296 +MADEISLNPIQDAKPIDDIVDIMTYLKNGKVLRVKQDNQGDILVRMSPGKHKFTEVSRDL +DKESFYYKRHWVLYNVSVNSLITFDVYLDEEYSETTKVKYPKDTIVEYTREDQEKDVAMI +KEILTDNNGNYFYALTGETILFDENKLNKVKD* +>MW460250_1_94 # 84127 # 84570 # 1 # ID=1_94;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.277 +MFISLNQEEKELLTKEESKYTPLETSREFNTPKEEFIVTSYNEGKPLDYIAKEAKVSMGL +IYTVLNYYKVGKRNKKSPVEERIAHILKDKNLVKEIIKDYQYMNLQDIYSKYNLHKNGLY +YILDLYHVERKSELKDKALEEDNIVVE* +>MW460250_1_95 # 84587 # 85291 # 1 # ID=1_95;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.306 +MRNKKSFQEQLNDMRNKEKWVSEEEFTEEVAPPEEPEVEEEKLYTLNELKESLLDAQGLK +DVVADFPASKDLYEPNKLYICTIPKGYQSTEVQPGQYIGISTGLLSESEDFSHLRGQMPR +NLYETSHVLKPLIRINNTNIEYQQHELLEDIKDDKKIYDVELEDLRLATGEEVSHLEIVD +NKFFESRINEVLDRYTELTDSNDLLKYYSKLRELVGSDKMIYCSLLDKCVKIID* +>MW460250_1_96 # 85353 # 85751 # 1 # ID=1_96;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.321 +MSRKASIFYILVVIVLAFSISSYYISSFMYHDKAKNEVSTELSNTGKIKEEKNVEFVGDY +TLKKVEDNKAYFMETLPTYLPGRTGDNSIDMRYYKTSRFKEGVNFKLIRVYTEDGEDNPI +HKYRFEAVPTKK* +>MW460250_1_97 # 85898 # 86140 # 1 # ID=1_97;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.333 +MEMADLERFDAFVRLISDDELSEERILELSVDLLNPILEGGTAYKAKKRIKSKFGKLEAK +NFKRNYKFLLKSIAQIDQRR* +>MW460250_1_98 # 86145 # 86309 # 1 # ID=1_98;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.279 +MTEREKLIKDIEEANRDIQLQLKEVDNYKDSIRSKGTRNYISTKVLDSIMVGFI* +>MW460250_1_99 # 86511 # 86687 # 1 # ID=1_99;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.294 +MVIPSIKAQNKFKNELEYYKQGHISESKMLELAFDYIQELEQNNEYVTNLLEEERYGE* +>MW460250_1_100 # 86677 # 87210 # 1 # ID=1_100;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.287 +MVSKFIGVYLFNLLIAIILTLTLIGTITDSIESTLAQIIVGMFIIITIYGILSALIPILV +HKAVSPGWSYTEWNESYYIRLPGEENYKYYSKWYLDLLGVKEFYYKRDNGEEVKEKNISW +AFQAEVKRPEDVNHWKNQLLTNRPLTILEYKKLKKLDKESEIRKQEDLEEYKQYNSN* +>MW460250_1_101 # 87225 # 87473 # 1 # ID=1_101;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.273 +MISSFDSILLVIYIIIAFAVAMAIIYLVFKGMTILLDKLMMLLLSKTTLDVEACSMIMAV +ISTIVFGIIVLLIWLAVNNILL* +>MW460250_1_102 # 87485 # 87661 # 1 # ID=1_102;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.288 +MDFNDFINSESDRVGKPKQKKKVENKLPSSTPIEDKEKKLKEIRKKSLYIDLRRKRND* +>MW460250_1_103 # 87654 # 87950 # 1 # ID=1_103;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.303 +MTKETNVLYKDKYRDYTIVVRLAGNIIVTEVDKKHKTAFTPIIFDNGVEGVELVMRIGSV +ELNMTDLREFTKEVSTAQKALEYFNKKLYIKGLTDEAF* +>MW460250_1_104 # 87998 # 88180 # 1 # ID=1_104;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.317 +MLLGILWFIWGFVSYFVLMFGIEFWKDRWMPGVIGAGTLLLFLFWIMKSIHNAMTVVYLY +* +>MW460250_1_105 # 88193 # 88561 # 1 # ID=1_105;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.279 +MDILIIHYKETNKRVLKETIQTIQNHLNDEHGLVKMTATKLSRENIEKRFNNYNIVIAED +DPDNSYHYGEAVEDADFIIDIPISYLDIHAGIEWDVDNPVDMLDRNPDFIEAVNKLNEDL +ML* +>MW460250_1_106 # 88574 # 88921 # 1 # ID=1_106;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.313 +MLNEKLKNLEDTKVYMINSIASLLSASTGKSSKVFFDEGTIKIVSGETKAVEVIDNLVHP +HSGRLPIKTTERIALGRLTDSLQFVISEIEVVKDQIIDEENEAYIDFVMEDWNWD* +>MW460250_1_107 # 88921 # 89199 # 1 # ID=1_107;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.308 +MPMDLLTIASVAFIAVVIVDLINDDMSYMLTGTAILINIWAGFYGWFFLLQAGMLLFLLL +ARKVKDDKESILYSSASLICALGMIINLLSFS* +>MW460250_1_108 # 89269 # 89574 # 1 # ID=1_108;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.252 +MSKETIRRQFSNAIEIMATTKEWWNFPKSFDTNKEFKIKTFKNDTLVFEVREGSRNLGSF +VVFTNIDFDYDKLEGTSTQYMINYFAKKLTKDMFNYHKLQL* +>MW460250_1_109 # 89589 # 89939 # 1 # ID=1_109;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.313 +MREELKPFNRKQVNVKGYLDDVKYSKRRRHKGNQHGCVKITVTDVKINGIPIDHVNIEVG +ISFYEKLKELQGKRIQFVGTVYKYVKHARGRKGRIKGFYKEDYSVTLDKKLQKEEK* +>MW460250_1_110 # 89939 # 90541 # 1 # ID=1_110;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +MIKRRKHLDHSLQPEKGWRTVPFNGYYEAHPTGLIRNKVTKKLIKGTQTRKNHPKWTAHE +IVYLINPKKTSYSRGVVIAHTFPEMISQSRGDLKNGHVCFKDGDRSNCHVDNMFIGKGNV +NKNIYKLNDSYLTRKDIEEDVNNLVNERLFSQLELLIKKNEPERITPSNHFIKRDNNVFS +ITDLSKNSLVEFELEIKNIK* +>MW460250_1_111 # 90555 # 90734 # 1 # ID=1_111;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.261 +MNEWYALCYYNKIGKKKIPRQIKAHRDVSVLEDLKDRLEEQNPKEEYKIKTTKEFDKER* +>MW460250_1_112 # 90961 # 91362 # 1 # ID=1_112;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.286 +MKLEDKVLERIDSLGNKAGNLSNQVMESLVKYQITYGIIDIVVSILVIALTIFLGKVYLK +EYKKVKMDLKESLLYDDYDDLSGIGWCYTILLILLTLFSLYAIVAGIPTDIMRLINPEVY +AVKDLIEQVKGGN* +>MW460250_1_113 # 91364 # 91624 # 1 # ID=1_113;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.341 +MKQRDFEFEEDFVLTYECEDCKHFEDWGHDEEPEECSECGSSDLISIIQVMKILSVICVE +GILICGKMDIDIWEIIKSILKKRNQV* +>MW460250_1_114 # 91676 # 91963 # 1 # ID=1_114;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.323 +MNKAVEQASNALGQGFSAMVWHQVLVGLGFILLGLVLSLLVWVLVKKFHVPFNHPTAFVV +YSIMLVSIVASFIWGGLHVINPEYYAILELKGFIK* +>MW460250_1_115 # 91974 # 92090 # 1 # ID=1_115;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.316 +MTKEELEQKVKELEAENKELKKQIERFEDEGGKTKDEQ* +>MW460250_1_116 # 92080 # 92343 # 1 # ID=1_116;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.280 +MNSREKKILTLTVNNFLMLALDIVALVRYKKGKIKQENYNTGQISRTIVTTANSLGILYL +EEQERKEKKSVKIGTLESGTLRGFKNK* +>MW460250_1_117 # 92420 # 92599 # 1 # ID=1_117;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.283 +MKHFILILGIVILVIALGIVLPAWILQLVLSAFGVKVSIWVCIGIFILISAIGSMFSRN* +>MW460250_1_118 # 92614 # 92877 # 1 # ID=1_118;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.330 +MAKYESNINGENYIATPSQALREALAKLITEEKSFAEYQTKGEEQYESQLQLRHFDTMIS +QYEEAIRVLEDKYRPQIFIPKDNKEEN* +>MW460250_1_119 # 92880 # 93197 # 1 # ID=1_119;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.283 +MKAESIARFFNDKVLQIEGYKVRFLQASSSYILDIDTIDESVLFLEAQVSTLSGKHLLDT +AITIERPETLSAKELYTEISNKLQAIVGDQTKTTIELSRYFKEEK* +>MW460250_1_120 # 93198 # 93878 # 1 # ID=1_120;partial=00;start_type=GTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.267 +MSNKTITNYLLNLEGIKGETYSIIAHINKQTGWGDKGDYFEISISYKADKDPRTTRYITT +EIFVDYGSNNPKEILLQLRDKIFSIVEEQVETDNDFIESIKEINSTKELEKLKPYINNEY +YSMFKSSIEKEIPVALSSEVLNRCTGKTSTLAYLALEKDLPLVVSNEPMRKMLKNKFPHL +RVASAEDYSNYDIKGEIVLIDEVDIDQLYSADKVSVDALLVGIIKN* +>MW460250_1_121 # 93967 # 94125 # 1 # ID=1_121;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.277 +MIPVIVILIGLILFLSSGYKLVLGKYYDDVDLKILFTIFGVGIALLLGGFIL* +>MW460250_1_122 # 94160 # 94360 # 1 # ID=1_122;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +MNYRDFITDCISGGYNVHISVTEKRVHIISEMTSASYPKKEINLDELQAYVYYMNNFGSQ +ITTEGL* +>MW460250_1_123 # 94361 # 94651 # 1 # ID=1_123;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.278 +MELVINIVAVLVGMYAIYFYVTKFSTGLSGILIVLGMAIGLYFYLDYLNVRENVIRLVSV +MFGAFLFSIEMIYNKIMFEIKKSNVQKTVRVYDKEQ* +>MW460250_1_124 # 94743 # 95051 # 1 # ID=1_124;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272 +MYPEIDVEELAYKLKSTREYLESITTKEVEIYEIYHLKTGKLVFKGEYIEVKELLRKMYK +ENLTLVDVDTMLSIGKGFIDVIKNISAENVFQITYKKELSTK* +>MW460250_1_125 # 95048 # 95956 # 1 # ID=1_125;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.285 +MIKIFSEVDKEYKPIITEKFPNGEINFKYDDLKYLVEEDLRFDVFFKWENDADLMHLYMF +TKYLEQLGIKDKAEFLEIAYLPYSRMDRVEEGHNNMFSLKYITEFINNLNYKSVWVAEPH +SPVTEELLTNSFAIDVTLKLLNQYIEMSEEPVTIVLPDKGAYDRYLFDVERILMESNIES +YSIVYGEKKRDFETGKIKGIKIIKDKNTLYDNCIILDDLTSYGGTFVGCKKALDKLKVSS +VSLILTHAERAFAEGALLSSGFKDIIVTDSMFPKNNWEKAIAKHRARINGTELQIKDIER +YL* +>MW460250_1_126 # 95974 # 97443 # 1 # ID=1_126;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.323 +MLNPTLMCDFYKLSHREQYPEGTEIVYSTLVPRSNKYYEHSDNIVVFGIQSLVKKYFIDM +FNKEFFNRPKEEVINEYKRTVKFTLGQENPDAKHLEQLHDLGYLPIDVRALKEGTVVHPN +TPVMTIENTHSDFFWLTNYLETIISTQTWQAMTSATLAYDMRKMLDKYAMETVGNIEAVD +FQGHDFSMRGMSSLETAQLSSAGHAISFKGSDTVPVVDFLESYYNADVEKEMVVASIPAT +EHSVMCANGNYETMDEYETYKRMLTEIYPTGIFSIVSDTWDFWGNMTKTLPRLKDIIMER +NGKVVIRPDSGDPVKIICGDPDADTEYERKGAVEVLWDTFGGTETEKGYKVLDEHVGLIY +GDSINYERAQQICEGLKEKGFASINVVLGVGSFSYQFNTRDTHGFAIKATYAKIKNEEKL +IYKNPKTDSGKRSHKGRVAVYKDGSWEDNLTLHQWLNKQNVNQLERVFEDGKLYRDQSLS +EIREIIKNN* +>MW460250_1_127 # 97522 # 97767 # 1 # ID=1_127;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.305 +MIYKISKHNYYSRFEHSTYPPDEGFAYVDYVDVILIGVDNPRKRKIITLKVNEFNPDDYR +VGHKYNIIKILWFEKWEWLKP* +>MW460250_1_128 # 97787 # 98179 # 1 # ID=1_128;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272 +MIIDKLNGVKLEIGGHVVSFSVRKFNTINGERQLIDYHHIKRNRQQYFRTTEEFYNEYKE +IKPDKNEIDEMFESLGYVDTELDDVVRNQEKVTEILGVSEQYLNQLSYKAIEEYVDKVVT +LEIKELKGEK* +>MW460250_1_129 # 98181 # 98402 # 1 # ID=1_129;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.342 +MNNNWEKEGVNYWEKEGVNYWENEDCPREYLEKAFIDLVEYVEGVTVPPKDVKQLREDKL +REDIGFYEYVADK* +>MW460250_1_130 # 98468 # 98779 # 1 # ID=1_130;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.317 +MKKLIVLLTITISLLLGGCSPDNHEGKVVGVGEYREPTTYIKSGSVTVPVIGEMKYYVDL +ETDKGEDRVYLNKEVYHKFDKGDDFSNVGEKVYKNDELIYKGD* +>MW460250_1_131 # 98782 # 99291 # 1 # ID=1_131;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.269 +MYLNDYVGKFIKEDNYYGYQSTDLVSNYVQRLTLGRYKTKLNANKMKYERLPSSWKIIKA +KDLLRTDDYREGDIFVSERISVFGFNGIIVYNHDFNNVTVITQNRDGKATNPVEEHLYPK +KDIDYIIRPIERDYREYFKKSDSKEKVTLSKQEYKKLLEAYNKMKEVFK* +>MW460250_1_132 # 99293 # 99622 # 1 # ID=1_132;partial=00;start_type=ATG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.294 +MNSTKLVEYFTNKQGKSLILPDENKVELYRVDVTPYTMRLNFTYNTEVVAIDIDKLHSDS +IEMHIPQGLYITTVVKITSTQSISSVLHKVLEEWVRQVQNDGIFGFVWE* +>MW460250_1_133 # 99628 # 99822 # 1 # ID=1_133;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.226 +MISIEHDYTIRTVDNRKYTYYSKYESLVTLYENIMSKDCIEVTKYGKDKKVIIDTRHIVS +IERW* +>MW460250_1_134 # 99846 # 100160 # 1 # ID=1_134;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.273 +MINAGHAKYLSEIYEDDVHYETIDSIVEDILDNINDGIIEEAMKGNTSYQYVLRDLRVDN +EVEYRVIEELTNQGYSVNHISNDIEYPSISTNNLAGLDYLNIKW* +>MW460250_1_135 # 100175 # 100342 # 1 # ID=1_135;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.238 +MINKYKKLWDEITQQIVNVEIINFKNETVTIESTDDSGLSEIRGFEEVEFIDYYG* +>MW460250_1_136 # 100379 # 100480 # 1 # ID=1_136;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.284 +MDLFAKIIIMSIGVVPLLTIIVAQLITDYHDNH* +>MW460250_1_137 # 101353 # 101652 # 1 # ID=1_137;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.290 +MIDIYLGEGYNKEYLSKALRLINDHAPRELSYDFNNVEADVNIHTMLYVKPEDRFIYKDI +SYYFPGDLIICIVDDDAIVYHQGEQISGISILRILEEIF* +>MW460250_1_138 # 101668 # 101853 # 1 # ID=1_138;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.247 +MIGITILITIMSISTISMYIYFLVDLIQSIRYNSFDKVINVITFVLMTVIIASGILAILG +I* +>MW460250_1_139 # 101960 # 102250 # 1 # ID=1_139;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.309 +MIHIFVKEDYNKETLRSLLEYINDTVGRELTYGINTDYDKDVVIETDDPIDEEDTIELSG +TNMFKDDLCILIEELYCKAFVNGEPVIIRKYVEEML* +>MW460250_1_140 # 102250 # 102537 # 1 # ID=1_140;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.281 +MIIIFLTEKYDAKALKKVLEHIDNCSSRGLSYLMGKGEADVCIEKNVFRERDDVRINSNI +IDEGKLCILINRHGLECSYYRGISCNIGSFVKERL* +>MW460250_1_141 # 102537 # 102830 # 1 # ID=1_141;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.327 +MIEIYLSENYDKNLLKAELKWIKETASRELTYDVNRSPGLDVYVNPYRCTKDEVEEWSTL +PPFEDDILVFIAETWIHEYLKGESIGVDSMEEYVKEM* +>MW460250_1_142 # 102834 # 103091 # 1 # ID=1_142;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MFKVYYTVYHRGSMKTIKDKLDRSSLIYFLYDTWYKDISNVFPNHYNKEFGSKSDDIDID +KLIEAVNEEGILLINRGNYVTIREW* +>MW460250_1_143 # 103169 # 103408 # 1 # ID=1_143;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.312 +MDTLTYTIIHKESDRVIASGLNETETMNLVQRMINTNLVTDISLDDYKRRPHGKIDVVNL +LVDIRRQGVFDFNHIWHVG* +>MW460250_1_144 # 103419 # 103766 # 1 # ID=1_144;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.305 +MIVIYTDVSKDYLKDEFLPWLNERDRYLEYYKDELPEDIDSSYIVSVVYCKDMEGLLERK +DIVLDNSYNEPVALLGVPEFFGNYSNYFYYRGESISKHDLGEIVRLKAWQRMGGD* +>MW460250_1_145 # 103975 # 104313 # -1 # ID=1_145;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.198 +MKINYIPMWDNEDVLQYAKSQLLVNELETKEIIFKNYQISDDLDGGTDKKYYEIYESKFY +VDEETTKEEFNNLIIENEKLIKEYKTQNGLIKNLIKSQHEVNEFEYNVINIL* +>MW460250_1_146 # 104624 # 104932 # 1 # ID=1_146;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.366 +MDVKEIANTIMELWQMDGYRCAEPPLYESTLNHTRTHTALIVSINGNYDTVQMFRKTPIM +SMRGQSQPASMLVNVIDDVIIIVYENVVYGVQNKEIKFIEEI* +>MW460250_1_147 # 105138 # 105422 # 1 # ID=1_147;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.337 +MTNKNYLYEETHTVQGQDITAFRIPNDANGNPRYVVHFMDLDIKLADYDNINKLYGFKKY +TAKWFGGGVVFQSYNIADTLEYAYTQVKTNRISQ* +>MW460250_1_148 # 105497 # 105688 # 1 # ID=1_148;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.177 +MKFKIEKNNSDIKTLWNLAKNGYMSYQTVHNIFKNESDEFIIFNSKQTYNKFMKLRYNRS +AIQ* +>MW460250_1_149 # 106005 # 106493 # -1 # ID=1_149;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.276 +MVENKINEIWKPIKKEYFNKYNFYVSNLGRVKINDRLSKVHQDRDGYLTVRVNNKKHMVH +RLVYEYFGNDFIKSNHVHHIDGNKQNNCIDNLECISPSEHNKRHHKDNTFNRYNRGYALT +EDERKAIASKYKPRKYTQPMLAKEYNISEITVRRIIKKYKKD* +>MW460250_1_150 # 106661 # 106819 # 1 # ID=1_150;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.245 +MIKFKWKNKTIKSTQKTDNILLLIIGGLVATVTPKLVNWFLLLQDNINIFLR* +>MW460250_1_151 # 106889 # 107020 # 1 # ID=1_151;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.242 +MKKITTTLNLIGMKNNERFTEELKNYRQDVTFLKANKIVKYSK* +>MW460250_1_152 # 107188 # 107424 # 1 # ID=1_152;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.350 +MKLINRDNEIVISIATLESVKQALIWEYIDHLDNNILDKEIHDQEAVVITSDTLQSLKFA +DTMEELEEYVNDIGWKLV* +>MW460250_1_153 # 107504 # 107974 # 1 # ID=1_153;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.380 +MTNTIQAFLQGQEASTVKDVATHGVQSGAIGKLIYTSDVVNFFDSYEQDIEAVITEYIEE +VTGQQYYDLLNYELMRDLENYANVEFEDEDEYNNIQFDLAENIASDEVEGFEDMDEADRA +EAIYEAMDDVELELQETDKVQYVNLAVEIVAQRMAL* +>MW460250_1_154 # 108004 # 108129 # 1 # ID=1_154;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.310 +MNNNTTSYSNSPYGSLEELREAYDLSSLSTGEIKELIQTFV* +>MW460250_1_155 # 108214 # 108393 # 1 # ID=1_155;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.256 +MRNLLEQEQLEKDVKDIIWVLDRMIAKGEQYTEAYDILVNKLERQEKRIVEIKKQNGIF* +>MW460250_1_156 # 108727 # 108963 # -1 # ID=1_156;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.287 +MKLYQVEHDNCEPYEDNFHFREDKIYTDKENLIKRIKEEGYKEETNHRGEQEFIKGDPRD +FYGMDMITIHELEFVNNT* +>MW460250_1_157 # 108965 # 109450 # -1 # ID=1_157;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +MNKEQAKLKLETSIINYENQIKFLDPASMYTRGLIDAKGYSKLALKELEDTGKHSYEDTT +WKDSYAKVFTDEEILEFLLSKPRVTFKGNQEKLDEIKKEREKIQKEATKDLPKGSPLGDL +SKENYEKFWGALQWSREEREKLTQESRAYYENYLKKIKENK* +>MW460250_1_158 # 109463 # 109870 # -1 # ID=1_158;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.282 +MGLDFEVIGVTLSNRKVEQKGLQHFINNARYRHILEKNYYKGFNFEDDFRKPGYFMDLLL +RDAETYYDEFEEWCEGVFVLTKDKLVNLMKNEFNEKTFKGTHDAEYYYRLMSHIYNVEQY +EGKFYDFYLIMSVNV* +>MW460250_1_159 # 109870 # 110301 # -1 # ID=1_159;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.315 +MENYKNFIIEEMNKAHILVTKAEQIKRNRKLAETELEEVYKKAEAFDEIVNELLYQLQNL +ESWDTLDQKDCQTLKQILEENIKEEKQLKRYKVKRTITTEEVRYIDAETEEDAWYSVEYE +DEGADTAHYNAEYGTWSYEEEEK* +>MW460250_1_160 # 110304 # 110495 # -1 # ID=1_160;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.328 +MIEISISWTYLISFLLLWSAGILYINYLVYRIRLTNKERKEMSKEHHRNREEIKQRIENR +RDK* +>MW460250_1_161 # 110492 # 110977 # -1 # ID=1_161;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.288 +MNKTFFKFLGKNTLEYSKQGLGFLVALPIMLIIFSVFLAFIIGIPAVIIYALHALNVDND +FIIQLVPVMWFIILYGIVRTGEHKKPFVKLKLKDYLLSILYLTTITAISVLENYLLFQSL +PFTGDVRAVITLLSFIVFVAVNRGICKIAIKSYKEYKEDSQ* +>MW460250_1_162 # 110970 # 111401 # -1 # ID=1_162;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.282 +MNIKYIDLVLENCDVVRLEPKDVSRFHISGITEGIDYYGTYKGTSSISRTRHCTYFGILI +DKPMEIPQVGFAYPDNTNAYEMITAYSDITAIDIIYENDANEYIYVDFNEYNDNYNINQK +NDYYNNMLEITITESNSKEEEDE* +>MW460250_1_163 # 111415 # 111957 # -1 # ID=1_163;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.243 +MDKINLNKKHEGSTVVNISNNITLKIQCTDLRKECDDSEAPTTYTHFKAYIVYNIFIVVN +DRKQKKKVKYDCYNDHVGRGNVKDLLKVKDVIFQLSTQLNTNEIIKISGADERRYKIYKY +FIEKDIRFEDNMYYSKSNIWIINNFSLLQKFQWNTVVTKDGDYNKKELKKVDKEWKELLI +* +>MW460250_1_164 # 111969 # 112457 # -1 # ID=1_164;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.315 +MRETREYIMFWGKEDIYSNFYPIKFKHQGRTFNNSEQAFMWRKARYFNDFQIAGEILNAK +NPNHAKSLGRKVRNFNEEQWNKVRYNIMVEVVKDKFMTTHLKQRILDTDVRKDFVEASPY +DKIWGVGLKANDPKILEQSNWKGQNLLGKVMEDVRVHCIYNK* +>MW460250_1_165 # 112470 # 112868 # -1 # ID=1_165;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.251 +MKKKYFKGLKLNDFEKEVFGIKKNKKYKKMKKKLGRNEPKYWNYDMSFFIQLYADLNAFI +ESSNHVDMEYHTFVDVDGKERTQIDMIKHILSLIKYYHKEMDDFDMDKYDELEQVQSKIL +DNFKIVLPSLWN* +>MW460250_1_166 # 112865 # 113572 # -1 # ID=1_166;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.297 +MAIYVVPDIYGEYQKLLTIMDKINNERKPEETIVFLGDYVDRGKRSKDVVNYIFDLMSND +DNVVTLLGNHDDEFYNIMENVDRLSIYDIEWLSRYCIETLNSYGVSTVTLKYSSVEENLR +NNYDFIKSELKKLKESDDYRKFKILMVNCRKYYKEDKYIFSHSGGVSWKPVEEQTIDQLI +WSRDFQPRKDGFTYVCGHTPTDSGEVEINGDMLMCDVGAVFRNIDFPFIKLEVKK* +>MW460250_1_167 # 113672 # 114226 # -1 # ID=1_167;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MMVNVLPSVYDAEKGEWVTLLAKPIAEEVLKIMKADYLEHKGNIGFFISKYKDGDSSIEQ +PNVVVFYNEKDYDTMELTESELTNALNEYIDYTLDGKYKPFSLNNFINYLEDYGYRLPVN +FEVDVTIILSDGQKFTYPRTSSITNNASIVDALKSEDQYIEVKYIYNDHAIDDKKLAHGN +DTLK* +>MW460250_1_168 # 114242 # 114559 # -1 # ID=1_168;partial=00;start_type=GTG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MERTLNLYDSKGKLLKSSEKITGASAKIIIEKLTPNTVYSQGSFKISWTINGKESILTDV +PEFTTKSNEDKQEIVFNTLNIDSNSFVVSETEPSDKSKLWFKPIN* +>MW460250_1_169 # 115545 # 116093 # -1 # ID=1_169;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.237 +MKKIYILEEEIEEMDYDLWEEDTVYTTSYEILGYTDSLEDAEYIRDNYGTSNPIFINEYP +YITKEKLIEEQRYFRYNSYIELKRVNGYFEISEINDLQVTEDFSINKDDKNFDSPFSINM +FSHNRNSIGIEFIMFSEYDDKEDIIEKEKNSFLMKLKYLLKHSKEADIRSTSKIIDSIDK +LT* +>MW460250_1_170 # 116097 # 116315 # -1 # ID=1_170;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.260 +MKNIINFLVDYNINFSYSEDSLNVMNNSYLVDKHGTQDYEIVGNYEHITGVFSYQTEEEV +IAKLKNLIGVWE* +>MW460250_1_171 # 116316 # 116510 # -1 # ID=1_171;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.272 +MRDKRIHSELLYDIIGKHIQEEENITPYIEAIYVDMMNIIVVEYTFYNENGTRMLGQYPI +GEVM* +>MW460250_1_172 # 116500 # 117237 # -1 # ID=1_172;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.255 +MNLEKSFLLSTIEFGSTYQGTSDEHSDKDYMSLVVQPLSDTIFRNNEKASKHTEVSRYYA +VERFISLVLKSGFDNVLNLCAQLEQAKNTRFNKTVLDLFYDDFIFLTYVRANFKPIAYSV +IGNINNILKKGELTGKDLVKFYTFYNHLEYYNDLLDDLDNLNVSYKDFAKVKYMPKEVLD +NKRSNVSIEKKKDLVNKVEPLIQEVKDKLKSNESNIKHYKDAMELVEKSLKDKTVEFLTE +VYNER* +>MW460250_1_173 # 117300 # 117404 # -1 # ID=1_173;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.238 +MKYILGLITLGIILFKVYEHFKYKQDEVDTEEDI* +>MW460250_1_174 # 117416 # 117655 # -1 # ID=1_174;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.279 +MDFYQFLNHENVRVNSITPSQKNFIRENLELTNLEDTDIDFISSKQAKEEIEKIIRIKNE +EEYDIAMDALAGWVTKHGY* +>MW460250_1_175 # 117657 # 118046 # -1 # ID=1_175;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +MFKKAPQYIMEKVEKENNILGEDLSLDIYYKGVKLTVKRHPETGHLNGYITLPSDINEKE +YDSLERRAHRGITYDDYDYEGKRVLGFDCAHAWDMTPYAIIGSLDDQYRDLEYVLSILKD +MAEYVKKDE* +>MW460250_1_176 # 118145 # 118318 # -1 # ID=1_176;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.328 +MEKVNHEFLAELAKSNSPVLNSKPLQDGDYNIEFDYDGFHFEFSQKNGYWRWSYNAK* +>MW460250_1_177 # 118359 # 118841 # -1 # ID=1_177;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.290 +MANEKEIIRMVNYLIDNMSMWHINYARAVLIPSEVEKIIKEHEKFDDLLKKRGEWLVKGS +DTDNIDDLETYNQIMNNQKDEMMIQEIDIYTQGKTITIDNEHYSSDDLGEVLNKLEQSED +IKIKSNYKSLYVGYTNVVGYEVTYASSYEETFKNDLEKDL* +>MW460250_1_178 # 118891 # 119433 # -1 # ID=1_178;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.276 +MDRIIGKHNLTQDLRLGDKVEVYDAHKFKENEDGTIELGDKITEGIVVDYKGDFTGNTSG +LVTLDSSEKELIIGEYNFKLIEEGNLQAVYDSVSKNKVESLSEDYDMYRKLLGVKSGELA +GIEDELEYLVRQYNSKVDNYNGLLTLSKEKARELSLLTGDKKMIPHMKNRRLELGTEADF +* +>MW460250_1_179 # 119433 # 119966 # -1 # ID=1_179;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.270 +MVYDSIISRTMAVSILNKWIAELITDVDLDKCKFTEEEYGKVVTNSINKIQDVLIEKNYE +VTDGELYDIVCTELINPIKNNTEEEKHNEKNDLLEHLEDLAFRHDIDLGYVSDGSYNLTV +THWLMQDEFTDVNIKVNNDEDFYTVTIPESKYFWLPITKENLEMFLTQDPINKGEVK* +>MW460250_1_180 # 119969 # 120133 # -1 # ID=1_180;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.267 +MKNLIKLLSMVVVTILTFSLTYVILKKETNNKRNGVAPFDFSLEDHIHLNKEIK* +>MW460250_1_181 # 120136 # 120411 # -1 # ID=1_181;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.264 +MANNIWAVVLSIVILLIILLILWFLFRKKVNGSSKNVEIQKAEEDNDNKEQEVEEAQYRE +LNEEEKEKNENSSKDYKYDKEKVKNKLKELE* +>MW460250_1_182 # 120411 # 121256 # -1 # ID=1_182;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.307 +MGRRLIDNSELNVIKYDGLPDFFSALKKNRVSGRDNSSDTGSYDFTGTHSFQEAYNLMVK +GDRESYDMVVKLKKMTDALFRMDKSVKRKPVVAPEGYQPHVPNAIKGLPNSMMSQQRVKA +EKKVIDVFYNSSISWMEDPENLAYRGAIMLSAIQTLETKGYSINLYLGKLSNSGYEDKLT +GFVVNIKHSYQRLNVFKSSFYLVNPSFLRRISFRVLEVEPDMVDLTNHGYGSVVSKSSYG +NKLTEHILDNAVIFDSSVGIDINNDSSENLRAVKKLFGGRL* +>MW460250_1_183 # 121268 # 122386 # -1 # ID=1_183;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.323 +MAKQDTIERLERLVEQQMETTKDLADKLGEKNSNPYEQAIVDAIVEKAGTESREIIITDV +KKQIEEYVEEQLSNLPVKIELQQEGKTIKDISGIFHYRYQDILKLVNQNIPVFLKGGAGS +GKNHVLEQVAEALDLDFYFSNAITQEFKLTGFIDANGKFHETQFYKAFTKGGLFFLDEMD +ASIPEVLLILNSAIANKYFDFPIGRVTAHEDFRVVSAGNTMGTGADHIYVGRQQLDGATL +DRFAQVEFDYDTKVEHQLSSNEDLVNFVQQLRHENDEKGLPYVFSMRAIINGSKLDGVME +DEFVVESIIFKSVPKDEINQFISSLPEGNRYTEATRKLLGMQQEPKQEPRKSDSTSKDSM +DFDTIMDKLGLE* +>MW460250_1_184 # 122540 # 122866 # -1 # ID=1_184;partial=00;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.254 +MSKRTDNFIYFCKYYFSEYLPSLGVEVLNHNETSHGTMEGVRKYYIANILYEGQELTVTI +DLEEFNNATSMHNMLEIMNNHTYNCMFMYDMDTHETKDIDDFFKLMYF* +>MW460250_1_185 # 122859 # 123275 # -1 # ID=1_185;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.273 +MNAKEFMKTQAQVEDYLDKLKVTIIEDALSVSKEWSNDSNDLGYALSSLGESIGLLEDYY +NIQVDAHLPEHYKGSKDVISFLEEHFSYDGFVDSMIFNIVKYTTRLGRKDAVDKEVQKIK +TYYVRLERNIKYGDSTRV* +>MW460250_1_186 # 123409 # 123711 # -1 # ID=1_186;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.300 +MEKVELIKQWAKDRNLQTGKPEGQMLKLLEEAGELASGIAKSNDHVTRDSVGDIFVVLTV +LCLQLDIDIEECIDMAYDEIKDRKGKLINGVFVKEEDLKK* +>MW460250_1_187 # 123711 # 123899 # -1 # ID=1_187;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.254 +MEKFQEDYVNIDIRVKAYVRVGYRYEEDITNNLHELVEDNLNVTSDSDNLIIKDTEIKGD +IE* +>MW460250_1_188 # 123943 # 124104 # -1 # ID=1_188;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.290 +MVKPVITLEPEDVKVLLDYLSFLEDDMRNYEGMRELYEELHKKYQLAKGNYSD* +>MW460250_1_189 # 124104 # 126152 # -1 # ID=1_189;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.257 +MAITYKQKGLTEQEIINLPKVNKGCIYIGEEDVFLKKKKNNIINLGSKELFRDIHNIFSF +DTATEIHLFLALCGNKEVTNFEGNPYETVEKLVEGVVDDNKGRSYKEYIESNREERKDFP +VYGYKSRRRIQSKGYVEEKIKELEGNDHLWRNESRQLEEYKKVVDSLNNDIMDVLDQGKY +GLIKSSIIVMNEDIEKGSSEYYSAMTDELYSRVWYMHPSTENYSSFGLKVKHIRDKHNMG +NKWVLENKSSFDVKTGEVKVFLTDSLVNKEITLNLYKDDISKSEYKNELTLSVLLNVILK +NYAQPNLTRGIIIKIIEQTLEHHNFDFSSWCPDNTDVYGHINYRGDKYRIFIGENSTSNY +LITLTDIVKNIDKINNLEEFGLFERNALLFHIPKKPKWKVHEAFNLTKQTYKKLLTLNKF +EQGNYLRFANILYKHYNHLHNEVNLHQLFDDTFLMVRDSRDVTDALKVKPIVNQILSISF +ANYKKMTHYLDVDAQDRQRITGYALDNYYLDYLHDLSILIREGYRTLESVSLTPFSLKLE +HDIVTDEKQSIQQQLDDAELKAKYDNKLEKIIDKTYKLKDGRKVKFLPADTVSKLKDEGK +MLSHCVGGYANRILKNSCLILLARLEEDLDNSWFTVEIRITDNGYVLGQQQSIDAYKLPN +ELKEALEKDIKKINKEEFKEVA* +>MW460250_1_190 # 126230 # 126493 # -1 # ID=1_190;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.258 +MSIEKKEEVIAHNEVVFRSLTQGLYVKEVDIYSDVVSYTKDVDEALAMPNTINFKNSRKY +KKLIMNLDLEPLNKIQKVIYETHLEGL* +>MW460250_1_191 # 126510 # 126683 # -1 # ID=1_191;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.270 +MNDLIKEGNKYYHKVRAGETLWTISKNYDVEIKKLQELNNIKSVSLTNLEYVLVCVE* +>MW460250_1_192 # 126690 # 127268 # -1 # ID=1_192;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.273 +MDNLSHYLSILYAILITVGYIPGLVALVKAESVKGVSNYFWYLIVATVGISFYNLLLTDA +SVFQIVSVGLNLTLGIVCLLVASYRKKDYFSIPFIIVFSLLLFLLSDFTALTQTVATITI +ILAYVTQITTFYKTKSAEGTNRFLFLIIGLGLASLIVSMVLTHTYVHIIATEFVNFVLIL +ICYLQANYYSRG* +>MW460250_1_193 # 127261 # 127887 # -1 # ID=1_193;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.308 +MGNKIKDKVIYMGGHILNEAMVDYRDKQHKEVDGIVGVTPYSPHKDKSINDKANAEQTKL +AERILTNDFKAMQESDIFVFDILNEGLGTIAELGILLGMKHQAEETINHIYDNGEEYFNY +FTNKFETSLNTEEELIVDKLENIVNKPVLIYCSDIRQGHGKPYNDPDRAEFSTNQFVYGM +VLELTDGEGFISWEEVINRLEKLGEQDG* +>MW460250_1_194 # 127880 # 128776 # -1 # ID=1_194;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.265 +MKSYTKVKNKGIVLDKFKERGLVVQEKLDGSNASFTVENGELVCFSRRKKLNENETLNGF +YDWVHENINVRNTYVSALEKYIIFGEWLVKHKIQYKEEFYNNFYVFDVYDKENEVYLSVE +DMNVIAHHLGLKTVKTLLVSKPSHYLNDLKPEEIQELVGKSDMTVKPDKGEGIVIKYLDG +KSEYDDYFKLVSNEFKEFSRQKMKTEVKKNESVADYAITRARMEKMIFRAIEEDRLSEDD +LELENFGLIMKQVGQNFVDDIMEEEKENILKIVDKQIKKKMPHILREILEEKGDTIDG* +>MW460250_1_195 # 128776 # 129000 # -1 # ID=1_195;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.244 +MNYLAKVFINNNWLVKLITIVLLTLFLSGLVYVISAISLFLSTVLNLPGLVVLAFLASVS +LILFSIVHNSKEDN* +>MW460250_1_196 # 129069 # 129809 # -1 # ID=1_196;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.306 +MAIQLKELDFKLKDYPNVRYNMGEHLVFNEFLEKATTEQLDFCEDFFNDNVEILWNESQA +GTGKTMCSVACAYADYLNKNRKLVFIISPVSEDLGSRPGNQTEKEMAYFMGLHDALIELN +MNPEQQITEMLMMEDNVKEDKLGDCWVSQISHLFLRGGNLRDATIIINEAQNFKRSELKK +VLTRVHTKNSTVIVEGNFKQIDLKNESKSGFGDYMEYFKNYEGAVFHNFTVNFRSKLAQY +ADNFKW* +>MW460250_1_197 # 129861 # 130475 # -1 # ID=1_197;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.289 +MKKINSVIKGEGKKVQTADVRKISYYVKDYNPCMTVDDANDYNATSQYLVSDNGKFIAKY +NKDMNAVGFYEESGDTVKHLTHTTPERLEGTVFTIEEETEIDLINDTLPQGDILIKFSDG +SIYLPDNESVLDSVNYLADNDWDSVDDIIYTGLSKGNSENCIVDFNYNNYDIGYDDVEDE +DVCDNYPECECSNYCSSTGEYIGN* +>MW460250_1_198 # 130491 # 130916 # -1 # ID=1_198;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.298 +MQDSVNIYTDGSSSYNKGKVGSGAVLVSKEGNIISEISKSVDKPGLIKYNNVAGEILACC +YGIEEAIKLGYNQAIVYIDYIGLIHWYEGTWSARNILSKTYINMIREYQKVIDINFVKVK +SHSNDKWNDYADNLAKKSIDI* +>MW460250_1_199 # 130906 # 131097 # -1 # ID=1_199;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.281 +MKKGVFTVIADGFKFNVIAKDKKEVQEHCFKCFDFNYISVSFCREVYSDCEFPQFMEDYK +YAG* +>MW460250_1_200 # 131120 # 131761 # -1 # ID=1_200;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.255 +MENNNLVNFLMTTDDIDDTIEMVDSFELQDINKVLGEDTFLTIMEITDSLPDNQYKIVLL +SSLDKLLNTDRKELVEYDEEFPTIRKHNVSELKRDTVNSVIDSYMNTNVEILYTEYPTIS +NYSVVVDSVKVLNTLYLIESKNGKIEATLSEDGEDLHEYISEEGYSVTDILNKFDDVEDL +FDEDDSLINFFSDIDEGKNKTIKSFIELVINLK* +>MW460250_1_201 # 131751 # 131981 # -1 # ID=1_201;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.329 +MDEKKESKPLNLQKIRVEKGHTLRSLASEIGVHYSLISYWEYGKKKPRSANLMRLEKALN +TPGKELFKELEEDDGE* +>MW460250_1_202 # 131984 # 132211 # -1 # ID=1_202;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.263 +MNKFKRWFRINVLKKETLLFKVYWRYESPSLKKPHVFHIELYAKSKAEARNKSQEYILKN +AKASEDFKFLKVEEK* +>MW460250_1_203 # 132321 # 133013 # -1 # ID=1_203;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.348 +MKKTIFATLALGTAITFGGIATNEASADEIDYNKLAEQAKSNSAEVNTKPIQAGNYDFSF +SDGEFTYHFYNYNGNFGYEYHSGSTQVDNTVSRLAGEEQTPEQKVDQQQAQFDTQNKQDT +KKEVQTTSAPVQKETKQPTQSTSSTGGSVAEQIRQAGGDEAMIEIAMRESTMNPNAVNAS +SGAQGLFQGLGKSWSGGSIAEQTKGAKQYMIDRYGSTSGALAYHNAHNSY* +>MW460250_1_204 # 133200 # 133835 # -1 # ID=1_204;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.280 +MIGETINKLKVIKESSKRDKSRCKMYECLCECGEVIIVRSSTLRQGKIKSCGCESNKIHS +ELMRERNTTHGLSSNPMYQRWLGMKQRCYDVNAINYKNYGGRGIEICEEWKNDFKKFYDY +MGDPPNENYQIDRINNDGNYEPGNVKWSTRSENSTNIRKKSTHNIYKKSNNVYNIQIVRK +NKVKYFSAKSLEEAIELRDNVINKYNETGEW* +>MW460250_1_205 # 133902 # 134693 # -1 # ID=1_205;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.298 +MRKSVVISGVLGFLAIIGFIILLMCITKIPQGHVGVVYSVNGVKEDTKSPGWHLTAPFDK +VNKYPTKTQTHKYKDLNVATSDGKNIKLDIDVSYKVDATKAVNLFNRFGSADIEELEKGY +LRSRVQDNVRQAISKYSVIDAFGVKTGEIKQDTLNKLNDNLEKQGFIIDDIALSSPTADK +NTQKAIDERVKANQELERTKVDKQIAEENAKKKEIEAKGEKKANDIRSESLTEEVLQQQL +IEKWNGKQPISIGSDSVITNLNK* +>MW460250_1_206 # 134693 # 135001 # -1 # ID=1_206;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.275 +MALLLTYFAIFIVFLVLVGFGISYLFDFLSMKEKKSNIRKQYRELVRQGTLDEYGLEQYV +KYKKQFLNDRRQSIVTRADKQEIDQEEKALNSLIKEIEKGEM* +>MW460250_1_207 # 135114 # 135743 # -1 # ID=1_207;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.379 +MSASDAQFLKNEQAVFQFTAEKFKEWGLTPNRKTVRLHMEFVPTACPHRSMVLHTGFNPV +TQGRPSQAIMNKLKDYFIKQIKNYMDKGTSSSTVVKDGKTSSASTPATRPVTGSWKKNQY +GTWYKPENATFVNGNQPIVTRIGSPFLNAPVGGNLPAGATIVYDEVCIQAGHIWIGYNAY +NGNRVYCPVRTCQGVPPNQIPGVAWGVFK* +>MW460250_1_208 # 136014 # 136514 # -1 # ID=1_208;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MEKKLNEIPGLEIYENYTITDKGEVISYKGKEPKKLKLQKNNKGYLFVRLRYHSPKIHRL +VAMAFIPNPDNKEQVNHLNGKNDNSVGNLEWVSNSENREHAIKTGLKNEINYNIAQYDLE +GNLLNVFYTAQEALEFLGISNKRSGNIGRCIKGERKTAYGYIWKQY* +>MW460250_1_209 # 136674 # 137477 # -1 # ID=1_209;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.353 +MAKTQAEINKRLDAYAKGTVDSPYRVKKATSYDPSFGVMEAGAIDADGYYHAQCQDLITD +YVLWLTDNKVRTWGNAKDQIKQSYGTGFKIHENKPSTVPKKGWIAVFTSGSYEQWGHIGI +VYDGGNTSTFTILEQNWNGYANKKPTKRVDNYYGLTHFIEIPVKAGTTVKKETAKKSASK +TPAPKKKATLKVSKNHINYTMDKRGKKPEGMVIHNDAGRSSGQQYENSLANAGYARYANG +IAHYYGSEGYVWEAIDAKNQIAWHTGK* +>MW460250_1_210 # 137477 # 137980 # -1 # ID=1_210;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.349 +MANETKQPKVVGGINLSTRTKSKTFWVAIISAVALFANQIIGAFGLDYSAQIEQGVNIVG +SILTLLAGLGIIVDNNTKGLKDSDIVQTDYLKPRDSKDPNEFVQWQANANNTSTFEIDSY +ENNAEPDTDDSDEVPAIEDEIDGGSAPSQDEEDTEEHGKVFAEEEVK* +>MW460250_1_211 # 138065 # 138250 # -1 # ID=1_211;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.258 +MASAKQLYYTESLVGKAIINNKVSNKEEVWDKLELLPETKLEDLDNKQMSEVIKKLNQIN +E* +>MW460250_1_212 # 139797 # 140015 # -1 # ID=1_212;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.274 +MKRQKMFYSSLICKECGNVFKVPRKRANKREEGHIKDIYCIKCCKTTKHIEDNRSEAERR +WDAIQEELTKDN* diff --git a/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out_tmp.fasta b/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out_tmp.fasta new file mode 100644 index 0000000..9d5684e --- /dev/null +++ b/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal-gv_out_tmp.fasta @@ -0,0 +1,2102 @@ +>MW460250_1_1 # 183 # 392 # -1 # ID=1_1;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.267 +ATGTCAAAACATATTGAAATAACAATGTCAAGTGGAGCTAAATACTTTTTAGTATCTACTGATGAAAAGA +GTTACAATAGGCAAGATATAGATTATATGTTAAGAGGAATGGATGAAACCTCTATAAAAGTATATACTGA +AAGTGCTATAACTTCACCTCAAGTATATATTAATCCAAACAGAATAGAATCTTTTAAAATAGTATTCTAA +>MW460250_1_2 # 405 # 737 # -1 # ID=1_2;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.306 +TTGGATAAGGAGATAAACAACCTTGTTTCTCAAGTAGAAACTATTAAGTCTAAGATACAAGAAGGTAATT +ATATAGATAGAGGAACTTTCAAAGATTTAGAAGTTGAGGTAGCTGAACTAAGAAAAATGATAGTTAGTAT +AGATAAAGATGTAGCTGTCAATAGTGAAAAACAATCAGCAATTTACGTCCAATTAGAAAGACTTGATGAA +AAGATTTCAGAACTTGCTGAAAGTACAAAAACTAAGGATACTGAGAAAAAGGATACAACTGAGAAGGTTC +TTTTACTTGTTCTAGGAGCTATATTATCCTTTGTCTTTAACAAATTTGCATAG +>MW460250_1_3 # 750 # 1076 # -1 # ID=1_3;partial=00;start_type=TTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.306 +TTGATTAAATATAAGGATATCCTTAAGTTAGAGTTTAAAGATGCACTAGCCCATTTTAAAAGAGACAGAC +GCTATTTCCACGTCTATAGAATAGACAGAGTATTAATAAATGGCTCAATTATATACTTCGACTATTATTA +CCTTCCTTCTGATGACCCTAATATTGTAATAAAAGAATTAGACCTCCAAAGTTTTGGTAAGCTTAGGTTT +GAAATTGATACAAAAACATCTTACGGAAAAGTCGTTACGGATAATTATATGGAGATTATCAATGATTTCT +TAGAAAATTATGATATCCATTCTGAATCTGAAACGGTTAGACCTTAG +>MW460250_1_4 # 1636 # 1902 # 1 # ID=1_4;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.292 +ATGACATACTTTATCTTTGGTGGTATTGTTTCTAATGTCGCACTTACTGTAACAGATAAGTTCTTACTGA +AGAAAGAAGACCCCCTACCTGAATATGTTCTTAAAAAAGTAGAGATAAATGATAAAGAAATAAGAATAAT +CAAGAAAATAATAGAAAGTAATTATGGTATAACAGCAGAAGAGATAAAAGTTAGGGCTAAAGCACAAAGA +AGAGTAGAGGAAGATAGTAAAAAGGAAGATTACAATGAAAACAAAGAAAGAAATTAA +>MW460250_1_5 # 1880 # 2158 # 1 # ID=1_5;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.294 +ATGAAAACAAAGAAAGAAATTAAAGAACAAAGGAAAGAACTTAAGGATGGTGCTACATCTGTTTCTTTAG +TAAAAAAGGGAGATAAGAGAATAGCTAGCCCTAGTAGAATCTGTAGTCTATGTGGTCAGCAGTTATCAGG +TATGAATTACACTAAAGGAAAAGCATTATCAAAAGTTAATCATTTTCATTTACAGTATTCTAAGTATATT +TATTTTGATATTTGCGCAGATATCAACAATTGTTATAAAAATTTAAGAAAACGAGGTGAAATGGATTGA +>MW460250_1_6 # 2155 # 2565 # 1 # ID=1_6;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.299 +TTGAGTGCAGAAAATATTAGAGATATAATTAACAAGAAAAAGTTAGAAGAAGAGGATACAAGAAAATATA +TAGCTGATGGGTTTATGAATGGTATCGGTAAATTAATGTACGAATTCAATAAGAAAGTAGATAACAAAGA +AATAGAAGTTAAAGACCCGAATGATTTATACAAATTATTTGTGATATTCTCTCAAATGCAAAATATGGTC +AATGAAACTTCTGAAGGAGGAGCAATACCTCAACTATCTAGACCTCAACAGGAATTATTTGATGAGATTA +CAACAGAAGATAGTAATGGAGAATCTACAGTTGATTTACAGAAGATATCAGAAATGTCAGCGGAAGATAT +TACAGCAATGATTTCTGAAAAGGAAAAAGTAATGAATGAGGAAAATTCAGAAACATTCTAA +>MW460250_1_7 # 2580 # 2777 # 1 # ID=1_7;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.323 +ATGGATGGAAAAGAACTAATTAAGATAGCACAAGAAACATTTCAAACTGAAAAAATAACAAGAGAACAGA +TAGACCATATAATCAATATGCTAAATCCTTCTACCTATATGCTTAAGTATCATACACTGAGAGGGCATCC +TATAACTTTTAGTATTCCTAATAGAGATAGAAGTAAAGCACAGGCTCATAGACCGTAA +>MW460250_1_8 # 3071 # 4042 # 1 # ID=1_8;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.256 +TTGACATTAGAAAAGAGAAAACAAGAATATCTTAAGAAGTTAAAGCAAATTAAAAATGATGAGTTTGAAT +TGCTAGGAGGATTTACTAAAACAAGAGAGAAAGCTTTATTTAAACATAAAGTTTGTGGTTATGAATGGTA +TACTACTCCTTATAATTTATTGAAGTCTAAAGGGACAGGATGTCCTAAATGTCAATACAGAGATAAATCA +TATACTACTGATGAATTTAAAAAGAAACTTAAAGATAAATTTGGTTATGAATATGAGTTAATAGAAGGAC +AAGAGTATAAAAATAGTAGAGAAAAATTATTGTTTATTCATAATAAGTGCGGTACTGAATTTAAAATTAC +AAGTGATAGTTTATTTCGAAGTAAAGTACCTTGTCATAAATGTTCTAAAGAAAATAGAAAAACTAAAAAG +AAAACAACAGAACAGTTTAAAAATGAATTGTATAATAAACATAAAGATGAATATATACTTGTTGAAGGGT +CAGAATATAAGACAGCTCTTGAAAAGGTTAGAATAATTCATACGAAATGTGGATATACATGGGATGTTAG +AGCCTCACATATTTTGCATACTAGTAAATGTCCTAATTGTAATGAGTCTAAAGGTGAGAGTTTAATTAAA +GACATTCTTGAAGATAATAATTTTTCTTATATAAGAGAGTATACTTTTGAAGATTTAAAGAATGTTAAAA +AATTACCTTTTGATTTTGCACTATTCATTGATAATGAATTAGTAGGTTTAATTGAATATGATGGTTCTCA +ACACTTTATTCCTTTTGAACACTTTGGAGGTAAGGAAAAATTAAGAAAAACTCAATATAACGATAGAAAG +AAAAATGAGTATTGTGATAAGAACAGAATTCCGTTAAAGAGAATAAAGTATGATTTAGATGAAAAAGAAG +TAATAAGAGAAATAGAAATGTTTTTAAATAGTATAGTTAAAAGTAAAGCAGAGAGTTATTAA +>MW460250_1_9 # 4183 # 5730 # 1 # ID=1_9;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.326 +ATGGGTGTAATGGAAATGGTTCATTTTGCAGATATGCATAGTTATGCTAACGCAAAGTGTCTGTATACAT +TCCCTACAAATGAACAAATGAAAAAATTTGTTCAGTCACGTTTGAACCCTGTTTTAGAGAAAGAATATTT +TAGAGACATTGTTGATTGGGATAAAGACTCGTTAGGTTTTAAAAAGATAAGAAACTCTAGTTTATTCTTT +AGAACAAGTTCTAAAGCAAGTACCGTAGAGGGTGTGGATATTGATTATTTATCTTTAGATGAGTATGACA +GGGTAAACTTATTAGCAGAATCGTCTGCATTAGAATCAATGTCTTCATCACCTTTTAAGATTGTGAGAAG +ATGGAGCACACCTTCTGTACCTGGGATGGGTATACACAAATTATACCAACAATCAGACCAGTGGTATTAC +GGTCATAGATGTCAACATTGTGATTACTTAAATGAAATGAGTTATAATGATTACAACCCTGATAATCTTG +AAGAAAGTGGAAATATGTTATGTGTTAATCCTGAAGGTGTAGATGAGCAAGCTAAAACAGTACAAAATGG +CAGTTACCAATTTGTTTGTCAAAAATGTGGTAAACCACTAGATAGATGGTATAACGGTGAGTGGCATTGT +AAGTACCCTGAGCGTACAAAAGGTAATAAAGGGGTACGAGGGTATCTAATAACACAAATGAACGCTGTAT +GGATTTCTGCTGATGAATTAAAAGAGAAAGAAATGAATACAGAATCTAAGCAAGCATTCTACAACTATAT +TTTAGGTTATCCTTTTGAAGATGTTAAACTTAGAGTTAATGAAGAAGATGTTTATGGTAACAAATCACCT +ATTGCAGAAACACAATTAATGAAACGAGATAGATATTCTCATATAGCTATTGGTATAGATTGGGGAAATA +CTCACTGGATAACTGTTCATGGTATGTTACCTAATGGTAAGGTAGACTTAATACGATTATTCTCTGTTAA +AAAAATGACAAGACCTGATTTAGTTGAAGCAGATTTAGAAAAAATAATTTGGGAAATATCTAAGTACGAC +CCTGATATTATAATTGCAGATAACGGGGACTCAGGTAATAATGTTTTAAAACTCATTAATCATTTTGGAA +AAGATAAAGTATTTGGATGTACTTATAAATCTTCTCCTAAATCTACCGGACAATTAAGACCTGAATTTAA +TGAGAACAATAACAGGGTTACAGTGGATAAATTAATGCAGAATAAAAGATACGTACAAGCACTTAAGACA +AAGGATATAAGTGTTTATAGTACAGTAGATGATGATTTAAAAACTTTCTTAAAACATTGGCAAAATGTTG +TTATTATGGATGAAGAAGATGAAAAAACTGGAGAAATGTACCAAGTTATCAAACGTAAAGGTGACGACCA +CTATGCACAAGCAAGTGTCTACGCCTATATAGGATTAACAAGAATAAAAGAACTTCTTAAAGAAGGAAAC +GGTACAAGCTTTGGTTCTACATTTGTTTCTACTGATTACAATCAAGAAGGAAATAAACAATTCTACTTTG +ATGAATAG +>MW460250_1_10 # 5723 # 6544 # 1 # ID=1_10;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.279 +ATGAATAGAGGTGAAATAGACTTGACAGATAAATTATTTTATGGTACAATTAGTAATGAAGAAATTAATA +AAAGTGTATTGAATTTGTTATTGGGTGAGGAATTATCCTTAGATTATGTTTCTAAAAATAGTGATATTTT +AGATGTTAAATATGAACATGTTTATAAATCTCTAGGATTCGATAATTTCTTTGATTGTTTTTTATATGCT +AATAGAGAGCCTGAAATAGTCCATAAAGGTGGAGATAAAAATCTTGGTGGACTAAATAAGGTTAAACGTA +CTGTTATTCGTAATGGTAAAGAAATGGAAATGACAGTTTACGAAGATGGTAATAAAGAGAACGATAGTAA +AGAAAAACAAGAAGGAAAAGAAGAAGTTAGTAGAAGTGCAGTAGGAGCAAGAGCTATTTCTAATGGTGAA +GAAGGAAAGGTAAACCCTAAAAAAGTAGCAAATTCATTATCTAATTTAAGTAAAAAAGGTGTAGATGTAT +CCCATATTAATACAAACTCATCATTGTATAAAGAGTTTGTTGATGATAACGGTGATACAATAGGAATTAC +ATCTTTTAAACGAACTGAAAATGATATAATATTAGAATCTTATGCAAGTTCACCTGATTCAGATGGTGTA +GGAGCAAGAGCTATTATGGAATTATTACGTTTAAGTATTAAAGAAAATAAAAATGCAGTTGTGTATGACA +TAGAATTACCTGAAGCAATAGAGTATTTAAAAACTTTAGGATTTAAACCTAATAAAGATGGATACATCTT +AAGAAAAAAAGATGTAAAACAATTCTTAGGTGATTATAGTGATTTTATTTAG +>MW460250_1_11 # 6531 # 6704 # 1 # ID=1_11;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.213 +GTGATTTTATTTAGCACTATAGTCATCTATTCTATTGTATTTATTCTATATATTGTATTAAAAACAATTT +ATATAAAGTCTAATATGAGTAGAATAGATAACACAACTGAATTATTAAAAATATTACAGGAAGATATTGA +AGGTAAGATAAAAAAGGAAGGAAGAAATAAATGA +>MW460250_1_12 # 6701 # 7180 # 1 # ID=1_12;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.292 +ATGACTTTAGAAGAAAATAAATTAACATTAGAAGAATCAATAACTCCACTTAGCAAAGAGGAGAAAGAAG +ATAGTATTAAAGAATTTAGCAGTTTATTATGTGAAATGGTAAATAGACTATATAAGTCTTATAATGTATT +TAGACAAGACCCTATGGATGAAACTCAACGTCTAGATGGCTCTTTAATGGTCTTTCAAAGTAGATTAAAT +GACCCTTTAACAGGAGATTTACATGATAAGATGTATAAACTTGCTTTTTCAAAACGTATTGATATTTTCG +AAGCTAATAAGCAATTTAGAAAAGATGTAGAAGCAGGTAAAGCAATTGAGTTAGGTGATGTAGCTATTAT +AGATACAGCATTAAGTAACATCCTTTCAGGCAATGAGTTCCAAGGAAGTATTTCATTTATGCTTAGAAAA +GACTTTGAAGAAAAAGAACGAATTAGAAAAGAAGAAGAAGAGAAACTTAATAACTTATAA +>MW460250_1_13 # 7222 # 8415 # 1 # ID=1_13;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.353 +TTGAAAAAGAAACCACAAGGCAATGAGGTAATCATAACCATAATAACGGTTATGATAGCAGTATTTGTAG +TCATTATGACCATATTTTTTAATAAATATCAAGATGCTAAAGAAGATAAAGATAGATATCAAAGATTAGT +AGAGATTTATAAAAAAGCAGATGATAATGATGGTGAGACTAAAAAGAAATATGTTAAAAGATTAAATAAG +GCTGAAGAAGAACTTAAAAAAGTAAAAAAAGAAACAAATTATAAAGATTATAATAAGAAGTCAAGTAAAG +AAAGACAAAAAGAAGATAAAGAAACTAGAGAGAAAATATATGATGTAACTGGTGATGATGACTTAATATT +AGTAAAAAATAATATTGAGTTTAGTGATAAAGTAGACAAGCCCGAAATACTTATTAGTGAAGATGGAATT +GGTACGATAACTGTTCCTGTAGATAGTGGGTATGAAAAACAAACAGTAGGTTCTATTATTACTAGTGTAT +TAGGTTCTCCTTTCCTATCACCTGGTTCAAATAGTATAGATGGTTTAAGTGTTATTAACGATAATGTTTA +TCCAAATACAGTAGATAGCATAGTAGAAGATACAAAACCTTCTATTAACTTACCAACGGATAATCCTATT +ATAACAAATCCAGTTGAACCAACTATACCTTCAGATATTATACCTCCTATTGATAATCCTTCAGTTCCGA +TATCTCCTGAGAACCCAGGAGATAATAATCAAGGAAATACAGATAATCCAAATCCTCCCCCTCCAGGGTA +CACAGATGAAGATGGTGGAAGAGGCTCCGGTGGTGGAGGAAATTCTGAACCACCATCAACGGAAGAACCT +TCGGATAATGGTAACACCGGAGGAGGAGATTGGGAAGAAAAACCTGACCCAGGAGAAGAACCTTCAGATA +ATGGTAATACAGGAGGAAATGGTGGAGAAGTTACGCCTGAACCTGAACCTGAACCTGAACCTGAACCTGA +ACCTGAACCTGAACCTGAACCATCTGAACCGTCTGACAATCCTGATGAAAATGGAGGATGGGAAACGGAA +CCAACTGAACCTGAGTCACCTTCAGAGCCGGACGATAAAGTGGACGAAGAGGATAAAAATGAAGATACTA +CAGATGATAAACAGTCCACTGAACAACCGGACGATAACAACATAGATAATGAAGATAAAACTGAAGAGGA +GTAA +>MW460250_1_14 # 8492 # 8842 # 1 # ID=1_14;partial=00;start_type=TTG;rbs_motif=GGxGG;rbs_spacer=5-10bp;gc_cont=0.234 +TTGTTAGGAATGAATATTATAACGTCACTATCAGTAGTATTTACCTGTTTAAGTCTTTTAACTTTAATGA +TTTTTGTTCATAGTAAGTTCTCTAGTAAAAACGTTTTTGTTTTGTATGTAATTTATGCTATAATAGGAAT +AGGTACATACATAGTTTTAACTATGTTTCAAACAACATCTGTACTTATTAAGAATGATGTAATAGATTCC +ATAGAAAATACTGAACATTATATTGTATTCAATGACCCTATAATTATATTTATTATAAGTTTTATAGGTG +CAATACTTGGAGGAATTTGGTACAAGATGATGAAAATTATTAAAAAAAGTAACTTTAAAGATAAAAAATA +A +>MW460250_1_15 # 8851 # 9231 # 1 # ID=1_15;partial=00;start_type=GTG;rbs_motif=None;rbs_spacer=None;gc_cont=0.281 +GTGAATAGGTTGATATTCTCTAAAGATAAAAAATGGGATGAAGCAAAAGATTTCATCAAAGGTCAAGGTA +TGCAAGATAATTGGATAGAGATTGTAGATTATTATAGACAGATAGGTGGAAAACACGTAGCTGTTTTTAT +TGCTTTAAACAAAGTAAAATACATGATTCTAGAAGCAACAAAAGACAATAAAGTAATATTAGTAGATAAA +GATAATAATATACTATTAGAAGATTATGATATTGTTATGGAAAGTAAGAAGATGTTTTATTACATTGAAG +AACCGTTTGAGGTTAAAATAAATATCCCTCAACATATTAGAGATGTAACTTATAATAATACTGTTGTATT +AACTACAGTAAGAGGGAGTAGAGGTGACTAG +>MW460250_1_16 # 9235 # 10926 # 1 # ID=1_16;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.315 +TTGGCAGATTTATTTAAGCAATTCAGATTAGGTAAAGACTATGGTAATAATAGTACCATTGCTCAAGTTC +CTATTGATGAAGGATTACAAGCTAACATTAAAAAAATAGAACAAGACAATAAAGAGTATCAAGATTTAAC +TAAGTCTTTATACGGACAGCAACAGGCTTATGCAGAGCCATTTATAGAAATGATGGATACGAATCCTGAA +TTTAGAGATAAGAGAAGTTACATGAAAAACGAACATAACTTACATGATATTTTGAAAAAGTTTGGTAATA +ACCCTATCCTTAATGCTATCATACTTACACGTTCAAATCAAGTAGCTATGTATTGTCAACCTGCAAGATA +TTCAGAGAAAGGTTTAGGTTTTGAGGTAAGATTAAGAGACCTAGATGCGGAACCCGGTAGAAAAGAAAAA +GAGGAAATGAAACGCATAGAAGATTTTATTGTTAATACAGGTAAAGATAAAGATGTAGATAGAGATTCAT +TTCAAACTTTCTGTAAGAAAATTGTTAGAGATACTTATATCTATGACCAAGTTAACTTTGAAAAAGTATT +TAATAAGAATAATAAAACTAAATTAGAAAAATTCATAGCAGTAGACCCTTCTACTATTTTTTATGCAACA +GATAAAAAAGGTAAAATTATTAAGGGTGGTAAGAGATTTGTTCAAGTAGTAGATAAAAGAGTAGTAGCTA +GTTTTACTTCTAGAGAGTTAGCTATGGGTATAAGAAACCCTAGAACTGAATTATCTTCTTCAGGATACGG +ATTATCAGAAGTAGAAATAGCTATGAAAGAATTTATTGCCTACAATAATACGGAATCATTTAATGATAGA +TTTTTCTCACATGGTGGTACTACTAGAGGTATTTTACAGATACGGTCAGACCAACAACAATCACAACATG +CATTAGAGAACTTTAAGCGTGAATGGAAATCTAGTTTATCAGGTATTAATGGTTCATGGCAAATTCCAGT +GGTAATGGCAGATGATATTAAATTTGTAAATATGACACCTACTGCTAACGATATGCAATTTGAGAAATGG +TTAAATTACCTTATCAATATTATATCTGCTTTATATGGTATTGACCCTGCAGAAATTGGTTTCCCTAATA +GAGGAGGAGCTACAGGTTCTAAAGGTGGTTCTACTTTAAATGAGGCTGACCCGGGTAAAAAACAACAACA +ATCTCAAAATAAAGGTTTACAACCTTTACTTAGATTTATTGAAGACTTAGTTAATAGACATATTATATCA +GAATATGGAGATAAGTATACATTCCAATTCGTAGGTGGAGATACTAAGAGTGCTACTGATAAACTTAATA +TTCTTAAACTAGAGACTCAAATATTTAAAACAGTTAATGAGGCTAGAGAAGAGCAAGGTAAGAAACCTAT +TGAAGGTGGAGACATTATTCTAGATGCTTCATTCTTACAAGGAACAGCCCAATTACAACAAGATAAACAA +TATAATGATGGTAAACAAAAAGAACGTTTACAAATGATGATGAGTTTACTAGAAGGAGACAATGATGATT +CTGAAGAAGGGCAATCAACAGATTCTAGTAATGATGATAAAGAGATAGGAACAGATGCACAAATAAAAGG +TGACGATAATGTTTATCGTACTCAAACATCTAATAAAGGTCAAGGAAGAAAAGGAGAAAAATCTTCTGAC +TTTAAACATTAA +>MW460250_1_17 # 11120 # 11893 # 1 # ID=1_17;partial=00;start_type=TTG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.323 +TTGGAAGAAATAAAATTTAATGCTTTTGTACCTATGGATTTGAAGAAATCTGTATCAACAGCTTCTGATA +CTAATGAGTATTCTATCGTTTCAGGATGGGCTAGTACTCCAAGTATGGATTTACAGAATGATATAGTTAA +TCCTAAAGGAATAGATATAGAGTATTTTAAGTCACAAGGGTACATTAATTATGAGCATCAAAGTGATAAA +GTTGTAGGTATACCTACAGAGAATTGCTATGTGGATATAGAAAAAGGTTTATTTATTGAAGCAAAGCTAT +GGAAGAATGACGAAAATGTTGTTAAGATGCTTGATTTAGCTGAGAAATTAGAAAAATCAGGTAGTGGAAG +ACGTTTAGGTTTTTCTATTGAAGGTGCAGTTAAAAAACGTAATATAAATGACAATCGAGTTATTGATGAA +GTTATGATAACCGGAGTTGCATTAGTTAAAAACCCTGCTAATCCTGAAGCAACATGGGAAAGCTTTATGA +AATCATTTTTAACTGGTCATGGTACATCACCTGACACTCAAGTTGATGCAGGAGCTTTAAGAAAAGAAGA +AATAGCATCTAGCATTACAAATTTAGCTTACGTCACTAAGATTAAAGATTTAAAAGAGTTTAATGATGTA +TGGAATGGCGTTGTTGAAGATTTGAGTAAATCTAATAGTATGGGATATGAGGAATCAGTCCTTACGTTAC +AACTAGCTAAAGGTTTATCTCGTAAAGATGCAGAACTAGCAGTAATGGATATAAACAAACAAAAACTAGA +ATAG +>MW460250_1_18 # 11912 # 12868 # 1 # ID=1_18;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.322 +ATGAGTAAAGAAATGCAAAATATTTTAGAAGAGTATGATAAGTTAAATGCTCAAGAGGCAGTTTCGAAAT +CTGTAGAAGATGATGAAAAGAATACAGTAGAATCTACCGAAGAGCAAGTAGCAGAAACAACTGAAGAACC +TGCTAAAGAACCTGAAAAAGTATCTGAGGAAGATGCTAAAGAAGCACAAGAGCAAGGTGAAAAAGTTGAA +TCTGAAGAGGTAGCAGAAGGCAATGAAGATGAGGAAGTTGAAAAATCAGCTAAAGAATCAAAAGACCCTG +TAGACCAAAAAGATACTAAGACAGAAAATAAAGACAACGAGAAACGTAAAAATAAAAAAGATAAAAAAGA +AGATTCTGATTCTGACGATGAAGATAAAGATACTGACGATGATAAAGATAAGAAAGAAGATAAGAAGGAA +AAAACTTCTAAATCAATTTCTGATGAAGATATCACAACAGTATTTAAATCTATCTTAACATCTTTTGAAA +ACTTAAATAAAGAGAAAGAAAACTTTGCTACTAAAGAAGATTTAAGTGAAGTTAGTAAATCTATTAATGA +GTTATCAGCAAAAATTTCTGAAATCCAAGCTGAAGATGTTTCTAAATCAGTAGACACTGATGAAGAAGCT +GTAGAAAAATCAGTAACATCTACAAACGGAGAGCAAGAAAAAGTAGAAGGTTACGTTTCTAAATCAGTAG +ACACTGAAGAACAAGCTGAAACTGGTGAAGCAAAATCAGAAGAAGCTGAAGAAGTACAAGAAGATAACAC +ATTTAAAGGATTAAGTCAAGAAGAACGAACTAAGTTCATGGATTCTTACAAAGCACAAGCTAAAGACCCT +AGAGCTTCTAAACATGACTTACAATCAGCTTACCAATCTTACTTGAACATTAACACTGACCCTACTAATG +CATCAGAGAAAGATATTAAAACTGTAAAAGACTTTGCACAAATTTAA +>MW460250_1_19 # 12984 # 14375 # 1 # ID=1_19;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.372 +ATGACTATCGAAAAGAACCTGTCAGACGTTCAACAAAAGTACGCTGACCAATTCCAAGAAGACGTAGTAA +AGTCATTCCAAACTGGTTATGGAATCACTCCTGATACACAAATTGACGCAGGAGCTTTACGTAGAGAAAT +TTTAGATGACCAAATCACAATGTTAACATGGACTAATGAAGACTTAATCTTCTATCGTGATATCTCACGC +CGTCCTGCTCAATCTACAGTAGTAAAATACGACCAATATTTACGTCATGGTAACGTAGGTCACTCTCGTT +TCGTTAAAGAAATCGGAGTAGCACCAGTATCTGACCCAAATATCCGTCAAAAAACTGTATCAATGAAATA +CGTTTCTGATACTAAAAATATGTCAATTGCATCAGGTTTAGTAAATAACATTGCTGACCCATCACAAATC +CTTACAGAAGATGCTATCGCAGTTGTTGCAAAAACAATTGAGTGGGCTTCATTCTACGGTGACGCTTCAT +TAACTTCTGAAGTTGAAGGTGAAGGTCTAGAGTTTGATGGTTTAGCTAAATTAATTGACAAAAATAACGT +AATTAACGCTAAAGGTAATCAATTAACTGAGAAACACTTAAATGAGGCGGCGGTACGTATCGGTAAAGGT +TTCGGTACAGCTACAGATGCTTACATGCCTATCGGTGTACACGCAGACTTCGTTAACTCAATCTTAGGTC +GTCAAATGCAATTAATGCAAGACAACAGCGGTAACGTTAACACTGGTTACAGCGTAAATGGTTTCTACTC +ATCTCGTGGATTCATTAAATTACATGGTTCTACAGTAATGGAAAATGAACTAATCTTAGATGAATCATTA +CAACCATTACCAAATGCTCCACAACCTGCTAAAGTTACAGCTACTGTTGAAACTAAGCAAAAAGGTGCTT +TTGAAAATGAAGAAGACCGTGCAGGATTATCATATAAAGTAGTAGTTAACTCAGATGACGCTCAATCAGC +TCCTTCTGAAGAAGTAACAGCTACAGTATCTAACGTAGACGATGGTGTTAAACTTTCAATTAATGTTAAC +GCTATGTACCAACAACAACCACAATTCGTTTCTATCTACCGTCAAGGTAAAGAAACAGGTATGTACTTCC +TAATCAAACGTGTACCAGTTAAAGATGCACAAGAAGACGGAACAATCGTATTCGTAGATAAGAACGAAAC +ATTGCCTGAAACAGCAGACGTATTTGTTGGTGAAATGTCACCACAAGTAGTTCACTTATTCGAATTACTT +CCAATGATGAAATTACCATTAGCTCAAATTAATGCTTCTATTACATTTGCAGTATTATGGTATGGTGCAT +TAGCATTACGTGCTCCTAAAAAATGGGCTCGTATTAAAAACGTTCGTTATATCGCAGTTTAA +>MW460250_1_20 # 14467 # 14763 # 1 # ID=1_20;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.310 +ATGTTATACTATAAGAAACTATTAGATAAAAAAATGGCTACTGTTTATGGTACAGTGGAGATTGACAAAG +ATGGAGTAGTCAAAGGATTAACTAAAGAACAAGAAAAAGAATTTGCCAATGTTCCAGGTTTTGAATTTGA +AGAAGAAAAGAAAACTACTAGAAAACAATCAGCTTCTACTAGTAAAGAAGAAGAGCCTAAGGAAGAGGAA +AAGAAAGCCTCTACTAGAAAAACTACAAATACTACTAGAAAATCTACAGCACGTAAAACAACAGCCAAAA +AAGATGAAAATAAGTAA +>MW460250_1_21 # 14776 # 15684 # 1 # ID=1_21;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.355 +ATGGTTAACTCAATGTTTGGAGGGGACTTAGACCCTTATGAAAAATCATTAAACTATGAATATCCTTATC +ATCCTAGTGGTAATCCTAAACACATAGATGTAAGTGAGATAGATAATTTAACATTAGCTGATTATGGATG +GTCACCGGATGCAGTTAAAGCATATATGTTCGGTATTGTAGTTCAAAATCCTGATACAGGACAGCCTATG +GGTGACGAGTTCTATAACCATATATTGGAAAGAGCGGTAGGTAAAGCTGAAAGAGCATTAGATATATCTA +TACTACCTGACACTCAACATGAGATGAGAGATTATCATGAGACAGAGTTTAATAGTTACATGTTTGTACA +TGCTTACAGAAAACCTATATTACAGGTAGAGAACTTACAGCTACAGTTTAATGGTAGACCTATATATAAA +TACCCTGCTAACTGGTGGAAAGTAGAGCATCTAGCAGGACATGTTCAATTATTCCCTACAGCACTTATGC +AAACAGGACAATCAATGTCATACGATGCAGTATTCAATGGATACCCTCAATTAGCAGGTGTATACCCACC +ATCAGGAGCAACATTTGCACCTCAAATGATACGATTAGAATATGTATCAGGTATGCTTCCACGTAAAAAA +GCAGGAAGAAATAAACCTTGGGAAATGCCCCCTGAGTTAGAACAGTTAGTTATAAAATATGCATTGAAAG +AAATATACCAAGTATGGGGTAACTTAATTATTGGTGCCGGTATTGCTAATAAAACATTAGAAGTAGACGG +TATTACAGAGACAATAGGTACTACTCAATCAGCTATGTATGGTGGAGCTAGTGCTCAGATACTTCAAATA +AATGAAGATATAAAAGAACTATTAGATGGTTTAAGAGCTTACTTTGGATATAATATGATAGGATTATAA +>MW460250_1_22 # 15698 # 16576 # 1 # ID=1_22;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.319 +ATGGAAAAACCGTATATGATAGGAGCTAACTCTAACCCTAATGTTATTAATAAGTCAACAACATATACTA +CTACAACACAAGCAGATGAACAAGATAAACCTAAGTATACTACTAGACTAGAGTTTGATACGATTGACAT +GATTAGGTTTATTAATGACCGAGGTATAAAAGTACTATGGGAAGAAGCATATTTCTGTCCTTGTCTTAAT +CCTGATACAGGACATCCTAGAGTAGATTGCCCTAGATGTCATGGTAAAGGTATTGCATATCTACCTCCTA +AAGAGACGATAATGGCAATACAGTCTCAAGAGAAAGGAACTAACCAGTTAGATATAGGTATATTAGATAC +AGGTACTGCAATAGGTACCACTCAATTAGAAAAGAGAATTTCCTATAGAGACAGGTTTACTGTTCCTGAG +GTATTGATGCCCCAACAAATGATTTATTTTGTGAATAAAGATAGAATTAAAAAAGGTATACCTTTATACT +ACGATGTAAAAGAAATAACTTATATAGCCACTCAAGACGGTACAGTCTATGAAGAAGATTATGAAATCAA +GAATAATAGATTGTATTTAAATGAAAAATATGAGAATCATACAGTAACTTTAAAGATACTTATGACTTTA +AGATATGTAGTATCAGATATACTAAAAGAAAGTCGTTATCAATATACTAAGTTTAATCAACCTAAATCAA +AATTTGAAAACTTACCTCAAAAATTACTTCTTAAAAGGGAAGATGTCATTGTACTACAAGACCCTTATAA +AGTTAATGATGGTATAGAAGAAGACCTAGAAATTCAAGTAGATGACCCTAAGGCTTCGGCATCTAATCCT +AGTAATTTAGGTGGATTCTTCGGAGGTGCATTTAAATAA +>MW460250_1_23 # 16576 # 17196 # 1 # ID=1_23;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.320 +ATGCCAGTTCATGGAAAGAGACCTAATTTATTTAAAAATAAAAACTATAAGCAGGTAGGTAAGAGAACAA +TTGATGGTATGCGTTCAGAAGTTCTTGATAAATTACAAGCAACAGCACAGCAAGTAGAGAATACTAGTAT +TAAACGTATGCCTACTTATCTACAAATAACAGAGAAAAAGCTTGAAAAAGAAGGAGTAGTAGACCTTAAA +AAAGCTTTTGCTCACTCATCTAAAAAGAAAACTAGTAAAGATGGCGGATGGTATTTAACTGTACCAATCC +GCATCAAAACTAGTAGAATGAATAACAGTACTTATCAAGATATGAGAACTTTAAAAGTAGATAAAGGAAC +AGGTTCAGTTTCGAAGATAACTGATTACCTAGAAGGACGTAGGAAGAATGTAAGCCACCCTTCAATGAAG +CCTGAACCTATGACTCATAATATGACTAAAGTTAAAAGAGGAAAGCAATCTTCTTACTTTATATTTAGAA +CTGTTTCTAGTAAGTCACCTGCTAGTTCTTGGATACTTAACAGAGATAAAGTTAATGAAGATAACTTCTC +TAAAACAACTCTAAAAACTGTTAAGCAATTAATGAACTGGAAGATGAAAAATTTAAATTAA +>MW460250_1_24 # 17215 # 18051 # 1 # ID=1_24;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.302 +ATGGCAATAACATCAGTTGATTCATATTTATTATCAGAAATAAAGCCTAGACTTAACACTGTGCTAGAGA +ATTGTTATATTATAGATGAAGTTTTAAAAGACTTTGATTATCAAACTAGAGAGAGCTTTAAAGAAGCTTT +CTGTGGTAAGAATGCACAACATGAAGTAACGGTAGGATTTAACTTCCCAAAATTTAAAAATAACTATGAA +GCTCATTACTTGATACAATTAGGTCAAGGACAAGAGACAAAAAACTCTTTAGGGAGTATTCAGTCATCTT +ACTTTGAGGCAACAGGAGATACTTTAGTCGAATCTTCTACAGCAATAAGAGAAGATGATAAGTTAGTTTT +TACTGTTTCTAAACCAATAGGAGAGTTAATAAAGGTAGAAGATATAGAGTTTGCTAAATACGATAATCTT +CAGGTTGAAGGTAATAAGGTATCATTTAAGTATCAAACAAATGAAGATTATGAGAACTACAATGCTAACA +TTATATTTACCGAAAAGAAAAATGATTCTAAAGGTTTAGTAAAAGGATTCACAGTTGAAGAACAAGTAAC +AGTTGTAGGTCTTTCATTTAATGTAGACGTTGCAAGATGTTTGGATGCTGTACTGAAAATGATTTTAATA +TCTATGAGAGATAGTATAGAAGAGCAACAAACATTCCAATTACAGAATTTGTCTTTTGGTGATATTGCAC +CAATAATAGAAGATGGTGACTCAATGATTTTTGGTAGACCAACAATTATTAAGTACACAAGTTCTCTAGA +TTTGGATTATACTATTACACAAGATATTAATAAACTAACTTTTAAAGAAAGAAAGGATTGGAAGTAG +>MW460250_1_25 # 18053 # 18268 # 1 # ID=1_25;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.306 +ATGGCTAGAAAAAAGACACCTGAAAATAACACTCCTAAATTTAATGGTTATGTTCATATAGATACATTCC +TTGATACTGCAAAAACCCTTTTTAATATGAGGGATTCACAAGTAGCAGGATTTAAAGCTTATATGGAAGG +TAGTCATTATTTGTTTAGTGAGCAAGAATTCTTACCATCATTAGAGAAGTATCTAGGTAGGAAATTAGAT +ATATAA +>MW460250_1_26 # 18295 # 20058 # 1 # ID=1_26;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.362 +ATGGCAGTAGAACCATTCCCAAGAAGACCTATTACCCGTCCTCATGCATCTATTGAAGTAGATACTTCAG +GTATCGGTGGCTCAGCAGGTTCAAGTGAAAAAGTATTTTGCTTAATCGGTCAGGCTGAAGGCGGAGAACC +AAATACAGTTTATGAATTACGTAACTATTCACAAGCTAAACGTTTATTCCGTTCAGGAGAATTACTTGAT +GCAATAGAATTAGCATGGGGTTCTAACCCTAACTATACAGCAGGACGTATTTTAGCTATGCGTATAGAAG +ATGCTAAACCTGCTTCAGCGGAAATTGGCGGATTAAAAATAACATCTAAAATCTACGGTAATGTTGCTAA +CAACATTCAAGTAGGATTAGAAAAGAATACACTAAGTGATTCATTACGTTTAAGAGTAATATTCCAAGAT +GACCGTTTCAATGAGGTTTATGATAATATCGGTAATATCTTCACAATCAAGTACAAAGGAGAAGAAGCTA +ACGCAACTTTCTCTGTAGAACATGATGAAGAAACTCAAAAAGCAAGTCGTTTAGTATTAAAAGTTGGAGA +CCAAGAAGTTAAGTCATATGATTTAACTGGTGGAGCTTATGACTACACTAATGCTATTATTACAGACATT +AATCAATTACCTGATTTCGAAGCTAAATTATCACCTTTCGGAGATAAGAACTTAGAATCTAGCAAATTAG +ATAAAATTGAAAATGCAAATATAAAAGATAAAGCTGTATATGTAAAAGCAGTTTTTGGTGACTTAGAAAA +ACAAACAGCTTACAATGGTATCGTATCTTTCGAGCAACTTAATGCAGAAGGAGAAGTACCAAGTAATGTA +GAGGTTGAAGCAGGAGAAGAATCAGCTACAGTAACTGCTACTTCACCTATTAAAACTATTGAACCGTTTG +AGTTAACTAAGTTAAAAGGCGGTACTAATGGTGAACCACCTGCTACATGGGCAGACAAGTTAGATAAATT +TGCACATGAAGGCGGATACTACATTGTTCCATTATCATCTAAACAATCAGTTCATGCAGAGGTAGCTTCT +TTTGTTAAAGAACGTTCTGATGCAGGAGAACCAATGAGAGCTATTGTTGGTGGAGGATTCAATGAATCTA +AAGAACAATTGTTCGGTAGACAAGCATCATTATCTAATCCACGAGTATCATTAGTAGCTAACTCAGGTAC +TTTTGTTATGGATGATGGACGTAAAAACCACGTACCTGCTTACATGGTAGCCGTAGCTCTAGGTGGTCTT +GCAAGTGGTTTAGAAATCGGTGAATCAATCACATTCAAACCACTACGTGTAAGTTCATTAGACCAAATCT +ATGAGTCAATAGACTTAGATGAATTAAATGAAAATGGTATTATTAGTATAGAGTTTGTTCGTAACCGTAC +TAATACATTCTTCAGAATCGTTGACGATGTAACTACATTCAACGATAAATCAGACCCAGTTAAGGCTGAA +ATGGCTGTTGGGGAAGCTAATGACTTCTTAGTAAGTGAGCTTAAAGTTCAACTTGAAGAACAGTTTATTG +GTACTCGTACTATTAATACAAGTGCTTCAATCATTAAAGACTTTATCCAATCTTACTTGGGTCGTAAGAA +ACGTGATAATGAAATTCAAGACTTCCCTGCTGAAGACGTACAAGTTATTGTTGAAGGTAACGAAGCAAGA +ATTTCAATGACAGTTTACCCAATCAGAAGCTTCAAGAAAATCTCTGTTAGCTTGGTTTACAAGCAACAGA +CATTACAAGCCTAG +>MW460250_1_27 # 20131 # 20559 # 1 # ID=1_27;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.354 +ATGGCATCAGAAGCTAAACAAACCGTCCATACTGGTAATACCGTCCTACTTATGATTAAAGGTAAACCGG +TAGGAAGAGCACAATCAGCATCAGGTCAACGTGAATACGGTACAACTGGTGTATACGAAATCGGTTCTAT +CATGCCTCAAGAACACGTATACTTACGTTATGAAGGTACAATTACAGTAGAACGTTTACGTATGAAAAAA +GAAAACTTTGCAGATTTAGGATATGCTTCACTTGGTGAAGAAATTCTTAAGAAAGACATCATTGATATTT +TAGTAGTAGATAACTTAACGAAACAAGTTATTATCTCATATCATGGTTGCTCTGCAAATAACTACAATGA +AACTTGGCAGACAAATGAAATTGTAACAGAAGAAATCGAGTTTAGTTACTTAACAGCAAGTGACAAAGCA +CGTACTTAA +>MW460250_1_28 # 20656 # 20796 # 1 # ID=1_28;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.241 +ATGGCAAATAAGAGAAAAACAATAGGAAAAATGAGTAACACAAGAGCAACATGGAATATTAATCCGGTAA +CTAAAGTTAAAAAAGATAAAACAAAATATTCTAGAAAAAATAAACATAAAGGTCTTGACAATTATAATTA +A +>MW460250_1_29 # 20839 # 21297 # 1 # ID=1_29;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272 +ATGAGTACATTTTGGTCAGAAAGAAGAACAACTAATAAAGATAGGCAAGTTAAAAAACATTATACTCAAA +TGAGTATGTATGAAAGAAAGAAATGTGTAGAGTTATTACAAGAGACAATTACTGAAAATAGAATTATTAA +TTTTACACGACATAGTGCAAAAAAGGTTAAAGGTAAACCAACAACAAATATACCTAAATTAATAGGTTTT +ATTTTTAAAAATAAGTTTGCCTACGAAAATATCATAGAGTACAATAACACAGATTATAATGGTAATATTG +AGAGGAGAATTGTTGTTAAACATCCTAAAGTTATAACTGTAGAAGGAAAACCTAGCTATCAGTTTTTGAC +AATTAGTCTTGAAGATGCTAGAGTTATTACGGTGTGGTATAACAGTGTAGATGATACACATAGAACACTA +GATTTAAATTATTATAGTAAAGACTTGACAATTCAATAA +>MW460250_1_30 # 21310 # 21504 # 1 # ID=1_30;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.246 +ATGGGTATAACAATAGTAAATAGTTATTTTATTCTGTCTAGCATTTTCCTCATCATATTAACCATATTAA +ATGGTAAGGGTACAGTTACAAGGGAATCATTAACTATGAGTAAAATATTAGTAGTAATAACATCAATTCA +ATTTTTAGCATGTTTAATTATTAATGGTATTTATTGGTCACTAAAATTTATGTAG +>MW460250_1_31 # 21586 # 21897 # 1 # ID=1_31;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.260 +ATGTCACAAGATAAATTAAGAGCAATTTACACAGAAATGAAAGTAGAATTACACAAATTTCCTAAAGAGG +TAGATATAACAAGTAAATCAACTGCAATTGCAATCAATCAGATTTTAGATAAATTCAAAACATTAACAGA +ACAAGCAGGAAAGATTACTAGAAAATATTTAGAAGGTCAAGAAATATTAACTATTGATTATGAGTATTAT +GATTCATTACAAGAATACTATATTTACCTACTTAGAAATAGTGAAAAGATTGAACAAAGTTTACAAGAAA +TTACTAAGCGTACAGGTGAATATGTAAAGTAA +>MW460250_1_32 # 22029 # 22487 # 1 # ID=1_32;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.305 +ATGGCAGAAGAAATTAAAAAGGAACAAGATGTACAAGAAACAACTAAAGAAGAAAAAAAAGATGTTAGTA +AAATGACACCGGAAGAAATAGATAAATTAAAATATCAAGACAAACAAGAAAAAGAACAAGTTATTAACAA +AGTTATTAAAGGCGTTAATGATACTTGGGAAAAAGAATATAACTTTGAAGAACTAGACTTAAGATTTAAA +GTTAAGATTAAATTACCTAATGCACGAGAACAAGGTAATATCTTTGCGTTACGTTCTGCTTACTTAGGTG +GTATGGATATGTACCAAACAGACCAAGTGATTAGAGCATATCAAATGTTAGCTACCTTACAGGAAGTAGG +TATTGAAGTTCCTAAGGAATTCCAAGACCCTGACGATATTTATAACTTATATCCTTTAACTGTTATGTAT +GAAGATTGGTTAGGATTCTTAAACTCCTTTCGTTACTAA +>MW460250_1_33 # 22531 # 23067 # 1 # ID=1_33;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.337 +ATGGAATCAATTGTTAAACAACCTTTATCTAGAAATCTATGGGCTATTATGAAAGAGTTTAATGTTTTAC +CTACCGAGCAAAGATTTAAGGACTTAGACGATTATCAGATAGAGTTTATTATTGGGAATATGAATAGAGA +TGTTTATGAACATAATAAACAACTTAAACAAGCTCAAAAAGGTGGAAAATTCGATAGTCAATTCGAAGAT +GATGATAGTAGTTGGTGGAATGAATCTCATGAAGACTTTGACCCAGTACCTGATTTCTTAGATGCTGATG +ATTTAGCACAACAGATGGAAGCTAAATTATCCGATAGAGATAAGGAAGAAAGAGCTAAGAGAAACGATGC +AGAGTTAAATGATGAAACAGAAGGACTTACTACACAACATCTAGCTATGATGGAATACATCAGACAGAAA +CAACAAGAATTAGATGATGAAGTAGGAAATGGTAAGACTAGTGAAGATGACGCTACTATATCACAAGATA +GCGTTAATAAAGCACTAGAAGACCTAGATGATGACTGGTATATGTAA +>MW460250_1_34 # 23120 # 27178 # 1 # ID=1_34;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.372 +ATGATGGCAATGAATGACGATTATAGATTGGTCTTGTCCGGTGATAGTTCGGATTTAGAGAATAGTCTAA +AGGCAATAGAACTTTATATGGATTCTTTAGAGTCTAAGAATATTGATGCTCCTTTAGATAATTTCTTAAA +AAAATTAAAAGTAATTGCTAAAGAAGTTAAAAATGTACAGAACGCAATGGATAAACAAGATGGTAAATCT +GTTATATCTTCTAAAGACATGGATGAATCTATTAAATCCACTCAATCTGCTACAAAGAATATAAATGAAT +TAAAGAAAGCTTTAGATGACCTTCAAAAAGAGAATATATCTAAAGGTATTGCACCTGACCCTGAAGTTGA +AAAAGCATATGCTAAGATGGGTAAAGTTGTAGATGAAACTCAAGAAAAACTTGAGAAAATGTCTTCACAA +AAAATAGGTTCTGATGCCAGTATTCAAAATAGAATTAAGGAAATGAAAACCTTAAATCAAGTAACTGAAG +AATACAATAAAATAAGTAAAGATTCTAGCGCAACTAAAGATTATACAAAACGATTAAGAGCTAATCGTAA +TATGACTAGAGGTTACATGGAGCGTTCAGAAGGAACAGGACGATTAACATATGACCAAGGTGCACGAGTT +AGAAGTGAGCTAGGTAAAATAAGTTCTTATGAGAGCCAAAGAAAACAAAACCAACGTAATTTAGGACAAG +CAAGAGAGCAATATAGCAACTATAGAAATCAACAACAAGACTTGACTAAACGTAGAGCTAGCGGTCAAAT +TAATAAGGAACAATATGAACAAGAGTTAGCTTCTATTAAACAGGAAATGAAAGCTAGAGAAGAACTTATA +TCTAACTATGAGAAATTAGGAGCAGAACTTGATAAAACAGTTCAGTATTATAAGGGTTCAGTTCAAAAGG +ATTTCCAATCTAGAGACGTAGACCAACAAAGAGGAACATTTGGTAGAATGGTTCAAGAACGTTTGCCATC +TATTGGTTCTCATGCTATGATGGGTACTACAGCTATGGCTACAGGTTTATACATGAAGGGTGCCTCATTA +AGTGAAACTAATAGACCGATGGTTACATCATTAGGTCAAAATTCCGATAATATGGATATAGATTCTGTAA +GAAATGCATATGGAGACTTGTCAATTGATAACAAATTAGGTTATAATAGTACTGACATGTTGAAAATGGC +TACTTCATATGAAGCATCAGTAGGACATAAAAGTGATGAGGACACAATGGCAGGAACTAAACAGCTTGCT +ATTGGAGGACGTTCTTTAGGCATTAAAGACCAAGAAGCTTATCAAGAGTCTATGGGTCAAATCATGCATA +CCGGTGGAGTAAATTCTGATAACATGAAGGAAATGCAAGATGCATTCTTAGGTGGTATTAAACAATCAGG +CATGGTTGGTCGTCAAGATGAACAACTTAAAGCACTAGGTTCTATAGCGGAACAATCAGGAGAAGGAAGA +ACTCTAACTAAAGACCAAATGAGTAACCTTACTGCCATGCAATCTACTTTTGCAGAGTCAGGAAGTAAAG +GATTACAAGGTGAACAAGGTGCCAATGCTATTAACAGTATAGACCAAGGACTTAAAAATGGTATGAATAG +TTCTTATGCTCGTATAGCAATGGGATGGGGAACGCAATACCAAGGTCTTGAAGGTGGATATGATTTACAA +AAACGTATGGATGAAGGTATATCTAATCCTGAAAACTTGACAGATATGGCTGATATAGCTACTCAAATGG +GTGGCAGTGAAAAAGAACAAAAATACCTATTTAATAGAAGTATGAAAGAAATAGGCGCTAACCTAACTAT +GGAGCAATCTGATGAAATATTTAAAGATGCTCAATCCGGAAAACTATCTAAAGAAGAGTTAGCTAAGAAA +GCTAAGAAAATGGAAAAAGAAGGTAAAAAAGAAGGAGAAGATAACGCCACTGATTATAAAGAATCTAAAT +CAGGAAAAAATGACCAAAATAAATCGAAGACTGATGATAAAGCAGAAGATACTTATGATATGGCTCAACC +ACTAAGAGATGCTCATAGTGCTTTAGCAGGTCTTCCTGCCCCTATATATTTAGCTATAGGAGCTATAGGA +GCATTTACAGCTTCACTAATTGCATCTGCAAGTCAATTTGGAGCAGGTCACTTAATTGGTAAAGGAGCCA +AAGGACTTAGAAATAAATTTGGTAGAAATAAAGGCGGTAGCTCCGGTGGTAACCCTATGGCAGGTGGAAT +GCCTAGTGGTGGTGGTTCACCTAAGGGTGGAGGCTCACCTAAAGGTGGGGGCACTCGTTCTACTGGAGGA +AAAATACTTGATAGCGCTAAAGGTCTTGGAGGATTCCTAGTAGGTGGCGCAGGATGGAAAGGTATGTTTG +GCGGGGAGTCTAAAGGTAAAGGCTTTAAACAAACATCTAAAGAAGCCTGGTCAGGTACTAGAAAAGTATT +TAATAGAGATAATGGTAGAAAAGCCATGGATAAATCTAAAGATATAGCTAAAGGTACCGGTAGTGGTCTT +AAAGATATCTATAATGATAGTATATTTGGTAAAGAAAGAAGACAAAACCTAGGAGAAAAAGCTAAAGGTT +TTGGTGGCAAAGCTAAGGGTCTCTATGGTAAGTTTGCTGATAAGTTTGGTGACGGAGGTAAAAATGGTAT +TCTTTCACAATCACCAAAAGCAGGTGGAAGTGGCATAGGGAAACTTGGAAAACTTGCAGGTGGACTTGGA +AAAGGAGCCGGAGTTTTAGGTGTTGCTACGTCTGCCTTATCATTAATACCTGCTTTAGCTTCCGGAGATA +GTAAAGCTATCGGCGGAGGAATAGGCTCTATGGGTGGAGGAATGGCAGGTGCATCAGCAGGAGCTTCTAT +AGGAGCTTTATTTGGTGGTGTAGGTGCAATACCTGGAGCTTTAATAGGTGGAGCTATAGGTTCCTTCGGA +GGAGGAGCTGTTGGTGAAAAAGTCGGAGACATGGCTAAAAAAGCTAACACTAAAGAAGGATGGAACCTAG +GATGGACTAACGGAGATAAGGATGGTAAGAATAAATTCCAAGATTCTTTATTAGGAAAACCTATATCTAA +AGCATGGAGCGGTATAACAGGTCTCTTTGATAATGACGCTGAAGCATCCGAAGAAGATAGTAAAGATAAG +AAAAAAGGTGTTAAAGGCGTTAAAGGAGATACTAAGAAGAAAGAAAAAATGACAGCAGAACAACTTAGAG +AAAAGAATAACCAATCTGAAACTAAGAATCTTAAAATCTATAGTGATTTACTTGACAGAGCTCAGAAAAT +TATTGAGAGTGCTAAAGGTATTAATATAGATGGAGGAACTTCTGATAGTGGTTCTGATAGTGGAGGCTCT +GCATCTGATGTAGGTGGAGAAGGCGCAGAGAAGATGTACAAGTTCCTTAAAGGAAAAGGACTATCTGATA +ATCAGGTAGGAGCTGTTATGGGGAACTTACAACAAGAATCTAATCTTGACCCTAATGCTAAGAATGCTTC +TAGTGGAGCATTTGGTATTGCTCAGTGGTTAGGGGCTAGAAAAACAGGATTAGAAAATTTTGCTAAATCT +AAAGGTAAAAAATCTAGTGACATGGATGTTCAATTAGATTACCTATGGAAAGAAATGCAGTCTGATTATG +AAAGCAATAATCTTAAAAATGCAGGTTGGAGCAAAGGTGGAAGCTTAGAGCAGAATACAAAAGCATTTGC +TACTGGATTTGAACGTATGGGAGCAAACGAGGCTATGATGGGTACTCGTGTTAACAATGCTAAGGAATTC +AAGAAGAAATACGGAGGCTCCGGTGGCGGAGGTGGTGGAGGAGCCCTATCCTCTACTTACCAAGAAGCTA +TGAGTAATCCTGTATTAACTACTGGTTCTAATTATAGGGGCTCTAATGATGCTTCTAATGCTTCTACAAC +TAACAGAATAACAGTCAATGTTAATGTTCAAGGTGGAAATAATCCTGAAGAAACTGGAGACATTATCGGA +GGAAGAATTAGAGAAGTTCTAGATAGTAATATGGATATCTTTGCAAATGAACATAAGAGAAGTTATTAG +>MW460250_1_35 # 27257 # 29683 # 1 # ID=1_35;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.331 +ATGCGTAGAATAAGAAGACCTAAGGTAAGAATAGAAATCGTTACAGATGATAATACATTTACATTGAGAT +TTGAAGATACACGTGACTATAATGGTGATGAGTTTGGAGCTAAACTTTTAGGATTCCAAACTAAAAACTC +TATGGAAGATGATAGTTCAGTTTTCCAAATAAATATGGCAGGAGATACTTATTGGGATAAGCTAGTTATG +GCTAATGATATCATAAGAATATTTATTACACCTAATGATGACCCTAACGATAAAGAAGGAAAACAAGAAC +GACTTATCCAGGTAGGTATGGTTTCTCAAGTATCAAAAGTAGGTAGTTACGGTAATGACCAAACTCAATT +TAGAATAACAGGTCAATCTTTTGTAAAACCTTTTATGAAATTTGGATTAGGCGTTATTCAGGAAGTTCAA +GCTGTATTACCTGAAGTAGGTTGGCTTATTGATGGTGATGGAGATAATGAAGTAAAATTTACTGGTAGCT +CAGCTCATGAAGTAATGACTGGTATTATACGTAGATTTATACCTTATATGAAATATAACTATACTGAAAA +AACATATAATACAATTGATAACTATCTTGATTATGATGATTTAAGTAGTTGGGATGAGTTTGAAAAACTT +ACAGAAGTTTCAGCCTTTACTAATTTTGATGGGTCATTAAAACAGTTAATGGATATGGTAACAGCTAGAC +CTTTTAATGAGTTATTCTTCAAAAATTCAGAAAAAACACCTGGAAAGGCTCAACTTGTATTAAGAAAGAC +CCCTTTTAATCCTACTGAGTGGAGAGCTTTAGATATGATTAAAGTACCTACTGAGGATTTTATAGAAGAG +GATGTAGGTAAAAGTGATGTAGAGACATATTCTATATTTACAGCAACACCTGCAGGTATGTTGAAAGAGC +TTAACGGTGATGTATTTTCTAAACCACAATTCCACCCTGAATTAACTGATAGATATGGTTATACTAAATT +TGAAGTAGAAAATATTTATCTTAGTACAAAATCAGGTTCAGCTACTGAGGATTCAGATTCTTCAGGTGAT +GATAATGGCACAGAACGAGGAACTTATTCTAAAATTATGAAAGATTTAAGTAACTATGGAAGAGATAATA +TATCTAAAGGTATAGATAAGTATACAAGTAAATTATCTTCAAAATATAAAAACTTAAAAAAAGCCCAAGC +TAAAAAAATTATAGAGAAGTTTGTTAAAGAAGGAAAAGTAACAGAAAAAGAATATGAAAAAATAACAGGT +AATAAGGTAGATGATGAATTAACATCAGATAACAGACCGAAGTTGACAAAAGATAAATTAAAGAGTATAC +TAAAAGAGAAGTTTAAAACACAAGATGATTTTAATAATTCTAAGAAAAAGAAAAAAGCTAAGACAGATGC +ACTTAAAGAATTGACAACTAAATATCGTTTTGGTAATAAAACACATGCTACAACTTTATTAGATGAATAT +ATTAAATATAAAGGAGAGCCACCTAACGATGAGGCTTTTGATAAATATCTTAAAGCTATTGAAGGTGTTA +GTAATGTAGCTACAGACACAGGTTCAGATGCAAGTGATAGCCCTTTAGTTATGTTTTCTAGAATGCTATT +TAATTGGTATCATGGTAACCCTAACTTCTATGCAGGAGATATTATTGTTTTAGGAGACCCTAAGTATGAC +CTAGGTAAAAGATTATTTATTGAAGATAAGCAACGAGGAGACACTTGGGAGTTCTATATTGAATCTGTAG +AACATAAATTCGATTATAAACAGGGGTATTATACAACTGTAGGAGTAACTAGAGGTTTAAAAGACGCTAT +TCTAGAAGATGGTAAAGGTAGTCCGCATAGATTTGCAGGATTATGGAATCAATCATCAGACTTCATGGGA +GGTCTTATGGGTGAAGATACTTCTAAAGAACTTAAAGAAAAAGGTGTAGCAGAGAAACAAAGTAGTGGAG +ATAAAGATGGTGGTTCTGATAGTGGTGGAGCTCAAGATGGTGGCTCTTTAGATTCACTTAAAAAATATAA +CGGCAAACTTCCTAAGCATGACCCAAGTTTTGTTCAACCTGGTAACCGACATTATAAGTATCAGTGTACA +TGGTATGCTTATAATAGAAGAGGTCAATTAGGCATACCTGTGCCTTTATGGGGGGACGCCGCCGACTGGA +TAGGTGGTGCTAAAGGAGCAGGTTATGGTGTAGGTAGAACACCTAAACAAGGTGCTTGTGTTATATGGCA +AAGAGGAGTTCAAGGAGGTAGCCCACAATATGGTCACGTAGCGTTTGTAGAGAAAGTATTAGATGGAGGT +AAAAAAATATTTATCTCTGAACATAACTATGCTACCCCTAATGGATATGGTACTAGAACGATAGATATGA +GTTCAGCCATAGGTAAGAATGCACAATTCATTTACGATAAGAAATAA +>MW460250_1_36 # 29697 # 30584 # 1 # ID=1_36;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.282 +ATGGCAACAGATAAAGAAGCTAAAGATGTTATTGATAAATTTATAGACAATGTATTTAATTTTGATGTAC +TTACAAAAGAAAGAATAAAAGAAAAAGATGAAGAAATTAAAAAAATAACTACAGATGATATGTATGAAAA +GGTTGTGTATATACGACCTTATGTTGGAGTAATACAAAGCCTTAACCCTCAGCATGTTCAGTATGAATCA +TTTTCTAATAATGGTTATGATATAGAGGCAGAATTAAGTTTCAGGAAAGTAAGTTATTTAGTTGATAAAG +GGTCTATACCTACAGATTCTTTATCTACTTTAACAGTTCATTTAGTAGAACGAAATCAAGAACTATTAAT +AGATTACTTTGATGAGATACAAGATGTGTTGTATGGAGAATATATGGAAGAAGAATATGTATTTGATGAA +GATGTACCATTAAGTACGATACTAGCATTAGACTTAAATGATAATCTTAAATCCTTATCAAATATAAAGT +ATATGTTCAAAGGTGCTCCTAAAGAGAATCCATTTGGAACAGATAAAGATGTTTATATAGATACTTATAA +CTTATTATACTGGTTATATTTAGGTGAAGATGAAGAGTTAGCATATCCTATGAATATTAACTACTTCTTT +ACAGAGGGAAGATTCTTTACTATATTCGGTAAAGGACATAAGTATAAGGTAGATGTTAGTAAATTTATAG +TTGGAGATATATTATTCTTTGGTAGAAGTGATACTAATATAGGTATTTATGTAGGAGATGGGGAGTTTAT +ATCTATGATGGGTAAATTCCCTAAAGATGAAACACCTATAGGAAAATATAAACTTGATGATTACTGGAAT +GAATTTAACGGAAGAGTTATGAGATTCGATGAAGAGGTGTATATTTAA +>MW460250_1_37 # 30584 # 33130 # 1 # ID=1_37;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.311 +ATGGTAGTAAGATTCCAATCTTCCATGGGGAGAAGTTTAAAAAGAGTAGATTCGGATGATTTAAATGTAA +AAGGATTAGTTTTAGCTACAGTTAGTAAAATTAATTATAAATATCAATCAGTAGAAGTTAAAGTTAACAA +TTTAACTCTAGGAAGCCGTATAGGTGATGATGGTAGCTTAGCTGTACCTTATCCTAAATCTTTCATAGGA +AGAACACCTGAAGGAAGCGTATTCGGTACAAAACCTCTTATTACTGAAGGTTCTGTAGTATTAATAGGGT +TTCTAAATGATGATATAAATAGTCCTATTATTTTAAGTGTTTATGGTGATAATGAACAAAATAAAATGAT +TAATACCAATCCTCTAGATGGAGGTAAGTTTGATACAGAAAGTGTTTATAAATATAGTAGTTCACTATAT +GAAATTTTACCATCTTTAAATTATAAATATGATGATGGAGAAGGAACAAGTATTAGGACTTATAATGGTA +AATCATTTTTCTCTATGACATCAGGTGAAGAAGAGAAACCTCAGGCAACAGATTTTTATACTGGAACTGA +GTATCAAGATTTATTTACTTCTTATTATGGTAATAAGACATTAATTGAGCCTAGAATACAAAAGGCTCCT +AATATGTTATTTAAACATCAAGGAGTTTTTTATGATGATGGCACGCCGGATAATCATATAACTACTTTAT +TTATATCTGAAAGAGGGGATATAAGAGCCTCAGTTTTAAATACAGAAACACAGAAAAGAACTACACAGGA +AATGTCAAGTGATGGGTCTTATAGAGTTATCAAACAAGATGACGATTTAATGTTGGATGAAGCTCAAGTT +TGGATTGAGTATGGTATTAGTGAAGATAATAAATTTTATATTAAAAATGACAAGCATAAATTTGAATTTA +CTGATGAGGGAATCTATATAGATGATAAACCTATGTTAGAAAACTTAGATGAGAGTATAGCAGAGGCTAT +GAAGAATTTGAATGAAATACAAAAAGAACTCGATGATATAAACTACCTTCTCGAGGGTGTAGGTAAAGAC +AATTTAGAAGAATTAATAGAGTCTACAAAAGAGTCTATAGAAGCTTCTAAAAAAGCAACTTCAGATGTCA +ATAGACTTACAACTCAGATAGCAGAAGTGAGTGGTAGAACTGAAGGTATTATAACACAGTTCCAAAAATT +TAGAGATGAGACTTTTAAAGATTTTTATGAAGATGCTTCTACTGTTATTAATGAAGTAAATCAGAATTTC +CCTACTATGAAAACAGATGTTAAGACCTTAAAGACTAAAGTTGATAACCTAGAGAAAACTGAAATACCAA +ATATTAAAACTAGATTAACAGAACTAGAGAACAATAATAACAATGCTGATAAAATAATCTCAGATAGAGG +AGAACATATAGGTGCTATGATACAGTTAGAGGAAAATGTCACAGTACCTATGAGAAAATATATGCCAATA +CCATGGAGCAAAGTTACTTATAATAATGCAGAGTTTTGGGATTCTAATAATCCTACTCGATTAGTAGTAC +CTAAAGGAATAACAAAAGTAAGAGTTGCAGGTAATGTTTTGTGGGACTCTAACGCCACAGGACAACGTAT +GTTGAGAATATTGAAAAATGGTACTTATAGTATAGGATTACCTTATACAAGAGATGTAGCTATATCTACA +GCACCTCAGAATGGTACTAGTGGAGTTATTCCTGTTAAAGAAGGAGATTACTTTGAGTTTGAAGCTTTCC +AAGACTCAGAAGGTGACAGACAATTCAGAGCAGACCCTTATACATGGTTTAGTATTGAAGCTATAGAATT +AGAAACTGAAACTATGGAGAAAGACTTTATGCTTATAGGACATAGAGGAGCAACCGGATACACAGATGAG +CACACGATAAAAGGATATCAAATGGCTTTAGATAAAGGTGCAGATTATATAGAATTGGATTTACAATTAA +CAAAAGATAATAAGTTATTGTGTATGCATGATTCTACTATAGACAGAACAACAACAGGAACAGGTAAGGT +AGGAGATATGACCTTATCTTATATACAAACTAACTTTACATCTCTCAATGGTGAGCCGATACCATCTCTT +GATGATGTACTAAATCATTTTGGAACAAAAGTTAAATATTATATAGAAACTAAACGTCCGTTTGATGCTA +ATATGGATAGAGAATTATTAACTCAATTAAAAGCAAAAGGATTAATAGGAATAGGTTCAGAGAGATTCCA +AGTAATTATTCAATCATTTGCTAGAGAATCTTTAATTAATATTCATAATCAATTCTCTAATATACCTTTA +GCTTACCTAACAAGTACATTTTCTGAAAGTGAAATGGATGATTGTTTAAGTTATGGTTTTTATGCTATTG +CGCCTAAATATACAACTATAACTAAAGAATTAGTAGATTTAGCTCATAGTAAAGGGCTTAAAGTCCATGC +ATGGACGGTAAACACAAAAGAAGAAATGCAAAGCTTAATACAAATGGGTGTAGATGGATTCTTTACAAAC +TACCTAGATGAATATAAAAAGATTTAA +>MW460250_1_38 # 33237 # 34028 # 1 # ID=1_38;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.332 +ATGCCACAATCAGATGGAATAAGTAATCTTCATAGAATAGCTTTACGCTTCCCTAAAGAAGGCGGTGGTT +ATGATATGTATAGATTTAAAGTTAACCCTGAGAACTACACAATAGATTCACCACAACGTACGACAGCAAT +TAAAACAAAATCAGATATTGTAATAGAAGATTATGGTAAAGACATAGAAGTTATTAACTTCACAGGTACA +ACTGGTTTTAGACCTGTTAGAGAAGCAGATGGATTAAAAACAGGTAAGCAGAAAATGGAAGAGTTACAAA +GTAGAGTTAGTGAATATGCTATGCAAGGTGGCAGTGGTAATGTAAGTGGTTCTTACTTACAATTTTTTAA +CTTTACAGATGATAGTTATTATAAAGTTCATTTAGCTCCTCAGGGGTTAAAGATAACTAGGTCTAAAGAT +GAACCATTACTTTTTAGATATGAAATAACATTAGTAGTTATTGGTTCATTAACAGAAGCAGATAGAAGTG +CTGTAACAACAGAAGAGTTTGGTAACGTTAAACCTAATGCTTCTCAAAGAGTAGATGAGGGTATAAAAGA +ATTAGATAAAAATGCTCGTAAAACGAGAGATAGAAACAATCAAGAAATATCTAGAAGAGAAAATACAATA +CCTAAATCTACAGGAGATAATACGAACGAGGGTAATAGACTTAAGCAAAGCTTCCCTAGTAGTTCTATAT +ATAATCCTAGACAATCTACTAACGGATTAAAAGGTAATATTGACAATATGGCGCTGATAATAGGTTACGG +TGATGGAGGTGTATCTAGCTAA +>MW460250_1_39 # 34028 # 34552 # 1 # ID=1_39;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.290 +ATGAATAATTTTATACCACAACCTCAAGGTCTACTTAGATTTTTAAATGCCCTAGATACAGATTTAACTT +CTTCTCATATGAATTTACTGGATGAAGAGGTATCATTTGTATCTAAATTTTATACACCACAGCTACAATT +AAGTGAATTAGCAAAAAAAGTATTGACAAATATAAAGACAGATGATATACCTGTATTAGAAAGAGAATTT +AATGATAATACAATTATCCATAAAGCTAACGATACATTACTAAAAGTACAGGCTCCAAGAATGTATATGA +TTCTACAGTCGATTGTACTTGAAGCATATGCTATTGTTAATTGCTTTGTAGAAAATCCGAGCTCTTTAAA +ATACTTAACTGAAGAAGATGTTAGTATAACACGGGAAAATTTAAATTATGTAGCTGACTACTTAGGTAAC +TATGATGACTATAATAGTGTTGTCTTAGACTTAAGAGATTTAGACTTATGTTTTAGTGCTATAGAATTAC +AATTACCTCTAATCAAAAAGGAGGCTAACGTATAA +>MW460250_1_40 # 34552 # 35256 # 1 # ID=1_40;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.308 +ATGAGATTTAAGAAGCACGTAGTTCAACATGAAGAAACGATGCAAGCAATAGCACAGAGATACTATGGTG +ATGTGAGTTATTGGATAGACCTAGTAGAGCATAATAATTTAAAGTACCCCTATTTAGTAGAAACTGATGA +AGAAAAAATGAAAGACCCTGAACGATTGGCTTCTACAGGTGATACACTGATTATACCTATAGAATCTGAT +TTAACAGATGTATCAGCAAAAGAAATTAATTCTAGAGATAAAGATGTACTAGTTGAATTAGCTTTAGGAA +GAGATTTAAATATTACTGCAGATGAAAAGTATTTTAATGAACATGGTACTAGTGATAATATACTAGCATT +CAGCACAAATGGTAATGGAGATTTAGATACTGTAAAAGGCATAGATAATATGAAACAGCAATTACAGGCA +CGTTTATTAACTCCTAGAGGTTCTTTAATGCTACATCCTAATTACGGTTCAGATTTGCATAATTTATTTG +GTCTTAATATACCTGAACAAGCTACATTAATAGAAATGGAAGTATTGAGAACATTAACATCAGATAATAG +AGTAAAATCTGCTAATCTAATTGATTGGAAAATTCAAGGTAATGTTTATTCAGGTCAATTTTCAGTGGAA +ATAAAATCTGTTGAAGAATCAATAAATTTTGTCTTAGGACAAGATGAGGAAGGAATTTTTGCTTTATTTG +AATAG +>MW460250_1_41 # 35271 # 36317 # 1 # ID=1_41;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312 +ATGAAAACTAGAAAATTAACTAACATACTATCAAAATTAATAGATAAGACAATGGCAGGTACAAGCAAGA +TAACAGACTTTACTCCTGGTTCAGCTTCTCGTTCATTATTAGAAGCTGTATCATTAGAGATAGAGCAATT +CTATATTCTAACAAAAGAAAATATTGATTGGGGTATACAAGAAGGTATCATTGAAGCTTTTGATTTTCAA +AAAAGACAATCTAAAAGAGCTTATGGTGATGTTACTATTCAATTCTACCAACCCTTAGATATGAGAATGT +ATATACCCGCAGGAACAACTTTTACTTCAACACGACAAGAATACCCTCAGCAATTTGAAACATTAGTTGA +TTATTATGCAGAGCCTGATTCTACTGAGATTGTTGTTGAAGTTTATTGTAAAGAAACAGGGGTTGCAGGT +AATGTTCCTGAAGGAACAATTAATACTATAGCATCAGGTTCTAGTTTGATTAGAAGTGTTAATAACGAGT +ATTCTTTTAATACAGGAACTAAAGAAGAGAGCCAAGAAGACTTTAAGCGCAGATTCCACTCTTTTGTAGA +ATCTAGAGGTAGAGCAACTAATAAATCAGTAAGATATGGTGCACTGCAGATACCTGATGTAGAAGGTGTT +TATGTTTATGAAGAAACAGGACATATTACAGTATTTGCTCATGATAGAAACGGTAATTTATCAGATACCT +TAAAAGAAGATATAATTGATGCTTTACAAGACTATAGACCAAGTGGTATAATGTTAGATGTTACAGGTGT +AGAAAAAGAAGAAGTTAATGTTTCTGCTACAGTAACTATATCTAATAAATCTAGAATTGGTGATACATTA +CAAAAACATATCGAAAGTGTTATTAGAAGCTATTTAAATAATTTAAAAACTTCTGATGACCTAATAATTA +CAGACCTTATTCAAGCTATAATGAATATTGATGATGTATTAATATATGATGTGTCATTTGATAACCTAGA +TGAGAACATTATAGTACCGCCACAAGGAATTATTAGAGCAGGAGAAATAAAAGTAGAACTAAAGTAA +>MW460250_1_42 # 36338 # 39397 # 1 # ID=1_42;partial=00;start_type=GTG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.288 +GTGGCTAATTTTTTAAAGAATCTTCATCCATTATTAAGAAGAGATAGAAATAAAAAAGATAATCAAGACC +CTAACTTTGCTCTGATAGATGCACTCAATGAAGAGATGAATCAAGTAGAGAAAGATGCTATAGAAAGTAA +GTTACAATCTTCTCTAAAGACATCTACCAGTGAATATTTAGATAAGTTTGGGGATTGGTTCGGAGTTTAT +CGTAAGACCGATGAGAAAGATGATGTTTATAGAGCAAGAATTATAAAATATTTACTCTTGAAAAGAGGAA +CTAATAATGCTATAATAGATGCTATAAAAGATTATTTAGGTAGAGATGATATTGATGTAAGTGTATATGA +ACCTTTTACAAATATTTTCTATACTAACAAATCACATTTAAATGGTGAAGACCACTTAATGGGATACTAT +TATAGATTTGCTGTTATTAATGTCTCTATAGGTGATTATTTCCCTGTAGAGATTATAGATGTAATTAATG +AATTCAAACCTGCAGGTGTAACTCTATATGTCACTTATGATGGGGCTTCTACTATTAGAGGTGGAGCAAT +TATTAAGTGGTTAGATGGGTTACCTAAAATAGAAACATACCAAGAGTTTGATAGATTTACAGGTTATGAT +GATACATTCTATGGTCATATTAATATGAATCAAAGTAAAGATACTGATAACAGTTCATCAGATATTTTTA +AAACAAACCATAGCTTAATTAATAGTTTAGATGTTTTAACAGGTTCATCTAGTGTAGGGAGACAGTATAT +TAACTACGGATATGTAACATCATATGTTTATAATCCAGGTATGACATCTTCTGTAAATCAAATAAGCGCT +AGTACAAAAGGTAGAGGTCAAGAAGTACCTACTGACTATTATATGTATACTAGTACTAAGAATAACAATA +CAGTAGAACTTAGTATGCAAACTACTTCCGGTGTGTCTTATTTATATAATAACTTTAATTTTAGGGACTA +TATGAGTAAATATAGACCTCAAGTAGATTTACAATCTGATGAGGCTAGAAGAATTGTATCTGATTATATA +AAAGAATTAAGTATTGATTACTATCTTAGTGCTGTGATACCTCCTGATGAAAGTATAGAAATTAAACTAC +AAGTTTATGATTTTTCTATTAATAGATGGCTTACAGTATCAATTAATAATTTATCTTTCTATGAAAAAAA +TATCGGGAGCAATATAGGATATATAAAAGATTATCTAAACAGTGAATTAAATATGTTTACTAGGTTAGAG +ATAAATGCAGGTAAAAGAGATTCAGTAGATATTAAAGTTAATTACTTAGATTTAATGTTTTATTACTATG +AACGAGGTATTTATACAATAAAACCGTATAAAGCATTAATAGAAAATTATTTAGATATATCTAGAGAGAC +TTATGTAGAAGCATTTAAAATAGCATCATTATCTAATGGAGATATTATAACTAAAACAGGTTTTCAGCCT +ATAGGGTATTTAAAACTAGTTGGTAATTATGAAAATACAATACCTAGCACAATAAATATAGTAGCTAAAG +ATACAGATAATAACCCTATAGAATCTAATGAATTAGATGTATATAATACAGTAGAGAATAGAAATTTATT +ACAATCTTATAAAGGTGTAAATACGATAGCTAGAGAAATAACTTCTACAAAAGAGTTTACTGTATCAGGA +TGGGCTAAAGAGATATACTCAACTAATTATCTTTCTAAAGTATTAAAACCAGGTAAAGTGTATACGTTAT +CTTTTGATATGGAAATAACAGGTAATGACCCAACTCTTAAATCTTATTCTGATAGTCATGGTATATATTT +ATACAGTAATACTAAGGGAATTGTTGTTAGTGGTGTTAAATCTATGGAACGTACTATAGGTAACAAAGTA +TCCGTAACTCAAACTTTTACAGCCCCTACTATTACTGACCATAGATTACTAATATATACTGGAAGATATA +CATCTGATGGTAAAGCATCAACTCCTCCAGTGTTCTTTAATACAGTTAAAATTACGGAATTAAAATTGAC +TGAGGGTTCTTCTAAGCTAGAGTACTCACCTGCTCCGGAAGATAAACCTAACGTAATAGAAAAAGGAATT +AAATTTAATAATATCCTAACTAATATACAGACTTTAAGTATTAATTCGGATACTATCTTAAAAAATGTAA +CTTTATATTATTCTTACTATGGTGATAGTTGGGTAGAACTAAAGACTCTAGGAAATATTAGTACTGGAGA +AACAACAGAAACCAATAACTTAATAGATTTATATGGATTACAGACAGTAGATTATTCTAATATAAATCCA +ATGTCTAAAGTATCATTACGTTCCATTTGGAATGTTAAGCTAGGTGAATTGAACAATCAAGAAGGTTCTT +TATATAATATGCCTAATGATTACTTTAATGCTGTATGGCAGGATATAGATAAATTATCAGATATTGAGCT +AGGTTCTATGAGAATGGTTAAAGACACTGAGGGCGGAGTATTCGATGGAGCTACAGGTGAAATTATTAAG +GCTACTCTATTTAATGTCGGTGCTTATACTGATTTAGATATGTTAGCCTATACTTTGACTAATTATACTG +AACCGTTAACGTTAGGCTCTAGTCGATTAATAATTGAGCTAAAAGAAGAACTACTAACATCAGAATCATT +TAATGTCGATAATAGAATTAAAGTAATTGACTCAATATATGAGGAGTTACCAAATACAAGCATTATTAAA +AATGGATTTGTTGAAAGAGAAGTTACAGGTTCTAAATATTTAGATTACGGTTTATATGAGCCTATAGAAG +ATGGTACTAGATATAAACTTATTGTCGAAGGAGAATTTAAAGATAATATAGAATTTATATCTTTATACAA +TTCTAACCCTAACTTTAATGAAACATTTATATATCCATCAGAGATAATTAATGGAGTTGCTGAAAAAGAA +TTTATTGCAAAACCATCTACTGAAGACAAACCAAGGTTAAATACAGATGTTAGAATATATATACGACCTT +ATGATTCAACTATCTCTAAAGTAAGAAGAGTAGAATTAAGGAAAGTTTAA +>MW460250_1_43 # 39508 # 40029 # 1 # ID=1_43;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.324 +ATGGCAATTGCAACGTATAATTCTCATGTTGAGTTAGCAAAATATCTAGTTAGTAAAGCTGATTCAGTTT +ACTTAACAATTGGAAAGAGCACACCGTGGTCTAATGAAACAAACCCACCGCAACCTGATGAAAATGCAAC +AGTATTACAGGAGGTTATTGGATATAAAAAAGCTACTAAAGTTACTTTAGTTAGACCTTCTAAATCACCT +GAAGATGATAATAAGAATTTAATTTCTTATGGTAATAAATCGTGGGTAGAAGTAACACCTGAAAATGCTA +AAGCTGAAGGAGCTAAATGGGTTTACTTAGAAAGTAGTATTGTTGGTGACGAACTACCTCTTGGAACATA +TAGACAAGTAGGATTTGTTATGGACTTAGTAGCAAAAAGTGGTATTAGTAAATTTAACTTAGTACCTAGT +GAAGTAGAATCAACTGGAACATTATTATTCTTTGATAATAAACAATTCCAAAATAGAAGTGAGCAGACAA +CTGCTAAAGAAAGATTTATTGTAGAAGTTTAA +>MW460250_1_44 # 40050 # 43508 # 1 # ID=1_44;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.342 +ATGGCAATTAATTTTAAAGGTTCACCTTATTTAGATAGATTTGACCCGTCTAAAGATAGAACAAAAGTAT +TATTTAATCCTGATAGACCTCTACAACAGGCAGAATTAAATGAAATGCAGTCTATAGACCAATATTATTT +AAAAAATCTAGGAGACGCTATTTTTAAAGACGGAGATAAACAATCAGGGCTTGGATTCACATTGTCTGAA +GATAATGTATTGACAGTAAATCCTGGTTATGTATATATCAATGGTAAAATAAGATATTACGATAATGACG +ATTCAGTTAAAATAACTGGCGTAGGTAAAGAAACTATTGGTATTAAATTAACAGAACGTATTGTTACACC +TGATGAAGATGCTAGCCTATTAGACCAAACTAGTGGAGTACCAAGTTACTTCTCTAAAGGTGCAGATAGA +TTAGAAGAAAAGATGTCATTAACAGTTAATGACCCGACATCAGCAACTATTTATACTTTCATGGATGGGG +ATTTATATATTCAATCAACTAATGCTGAGATGGATAAAATCAACAAAGTATTAGCTGAACGTACTTATGA +TGAGTCAGGTTCATATAAAGTAAATGGTTTTGAACTATTTTCAGAAGGTAATGCTGAAGATGATGACCAC +GTTTCTGTAGTTGTAGATGCAGGTAAAGCCTATGTAAAAGGTTTTAAAGTAGACAAACCCGTATCAACAA +GAATTAGTGTACCTAAATCTTATGACTTAGGAACAGCAGAAAATGAAAGTACTATCTTTAATAAGTCTAA +TAACTCTATTAGTTTAGCTAATAGCCCTGTAAAAGAAATTAGACGTGTTACAGGTCAAGTACTTATTGAA +AAAGAACGAGTTACAAGAGGAGCTCAAGGTGATGGTCAAGATTTTCTTTCAAATAATACAGCATTTGAAA +TTGTAAAAGTTTGGACTGAAACAAGCCCTGGAGTTACTACAAAAGAGTATAAACAAGGAGAAGACTTCAG +ATTAACAGATGGTCAAACAATTGATTGGTCACCTCAAGGTCAAGAACCTTCAGGAGGTACTTCATACTAC +GTTTCTTATAAATATAACAAACGTATGGAAGCCGGTAAGGATTATGAAGTAACAACTCAAGGTGAAGGGT +TAAGTAAGAAATGGTACATTAACTTTACACCTTCAAATGGTGCTAAACCTATTGACCAAACAGTAGTATT +AGTAGACTATACTTACTACTTGGCTCGTAAAGATTCAGTGTTTATTAATAAGTATGGTGATATTGCAATA +TTACCTGGTGAACCTAATATTATGAGATTAGTTACACCACCATTAAACACAGACCCTGAGAATTTACAAT +TAGGTACAGTTACAGTATTACCTGATTCAGATGAAGCCGTATGTATTTCATTTGCAATCACTAGATTGTC +TATGGAAGACTTACAGAAAGTTAAAACAAGAGTAGATAACTTAGAGTATAACCAAGCAGTAAATGCTCTA +GATGATGGTGCTATGGAAGGACAGAACCCTCTAACATTACGTTCAGTATTCAGTGAAGGTTTCATTAGTC +TTGACAAAGCAGACATTACACATCCTGACTTCGGAATTGTATTTAGTTTTGAAGATGCAGAAGCTACTCT +AGCTTATACAGAAGCAGTTAACCAACCTAAGATTATTCCAGGAGATACAACAGCTCATATTTGGGGTAGA +TTAATTTCAGCACCATTTACTGAGGAACGTACAATCTACCAAGGTCAAGCATCAGAAACATTAAATGTTA +ACCCTTATAATATTCCTAACAAACAAGGTGTGTTAAAATTAACACCTAGTGAGGATAACTGGATTGATAC +TGAAAATGTTACAATCACTGAACAAAAAACTAAAAAAGTAACTATGAAACGATTTTGGAGACATAATGAA +AGTTACTATGGTGAGACTGAGCATTACTTGTATTCTAACTTACAGTTAGATGCAGGACAAAAGTGGAAAG +GTGAAACTTACGCTTATGATAGAGAGCATGGTCGTACCGGTACTTTATTGGAATCAGGAGGACAACGTAC +TCTAGAAGAAATGATTGAATTCATTAGAATCAGAGATGTATCCTTCGAAGTTAAAGGACTAAACCCTAAT +GATAATAATTTATATTTATTATTTGATGGAGTAAGATGTGCTATAACACCTGCAACTGGCTATAGAAAAG +GCTCTGAAGATGGTACGATAATGACAGATGCTAAAGGAACAGCTAAAGGTAAGTTTACTATTCCTGCAGG +TATTCGTTGTGGTAACCGAGAAGTTACACTTAAGAATGCTAACTCTACAAGTGCTACAACTTACACAGCC +CAAGGACGTAAAAAAACCGCTCAAGATATTATTATCAGAACTCGTGTAACAGTAAACTTAGTAGACCCGT +TAGCACAATCATTCCAATATGATGAGAATAGAACTATATCATCATTAGGATTATACTTTGCTTCTAAAGG +TGATAAACAATCTAATGTTGTTATCCAAATTAGAGGTATGGGTGACCAAGGTTATCCTAATAAAACAATC +TATGCAGAAACAGTTATGAATGCTGATGATATTAAAGTATCTAATAATGCTAGTGCTGAAACTAGAGTAT +ACTTTGATGACCCTATGATGGCTGAAGGCGGTAAGGAGTACGCTATTGTTATTATTACTGAGAACAGTGA +TTATACAATGTGGGTAGGTACTAGAACTAAGCCTAAAATTGATAAACCTAATGAGGTTATTTCAGGTAAT +CCATACCTACAAGGTGTATTATTCAGTTCATCAAACGCATCAACATGGACTCCTCACCAAAACTCTGACC +TTAAATTTGGTATTTATACTTCTAAATTTAATGAGACAGCAACGATTGAATTCGAACCAATTAAAGATGT +ATCAGCGGATAGAATAGTTCTTATGTCTACGTACTTAACTCCTGAGAGAACAGGATGTACGTGGGAAATG +AAATTAATTCTAGATGATATGGCATCTTCTACAACATTCGACCAATTGAAATGGGAGCCTATCGGTAACT +ATCAAGACTTAGATGTTTTAGGTCTAGCAAGACAAGTTAAGTTAAGAGCAACTTTCGAATCTAATAGATA +TATCTCACCATTAATGAGCTCTAGTGATTTAACATTCACTACATTCTTAACAGAGTTAACAGGTTCATAT +GTTGGTAGAGCTATTGATATGACAGAGGCTCCTTACAATACAGTAAGATTTAGTTATGAAGCTTTCTTAC +CTAAAGGTACTAAAGTTGTTCCTAAGTATTCTGCGGATGATGGAAAAACTTGGAAAACATTTACTAAATC +CCCTACAACTACTAGAGCCAATAATGAGTTTACACGCTATGTCATTGACGAGAAAGTAAAATCATCAGGA +ACAAATACTAAACTACAAGTTAGATTAGATTTATCAACTGAAAATAGCTTTTTACGTCCTCGTGTTCGTA +GACTTATGGTTACTACTAGGGATGAATAA +>MW460250_1_45 # 43557 # 43715 # 1 # ID=1_45;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +ATGCCTAGAGAAGTTAGAGACCCTTATTCTCAAGCTAAATTATTTATACCTACAGTTGAGGAAAAATCAA +TTAAGGAATTAGAAAAAACATACAAAGAAAAAATTGATGAAGCTACTAAGTTAATCAATGAATTAAAGAA +AGAGAGAGGAGAAAAATAG +>MW460250_1_46 # 43716 # 45638 # 1 # ID=1_46;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.316 +ATGGCATTTAACTACACGCCTCTTACTGAAACACAGAAGTTAAAAGATATGTATCCTAAAGTTAATGATA +TAGGTAACTTTTTAAAAACAGAAGTTAACCTTAGTGATGTAAAACAGATATCACAACCCGACTTTAATAA +TATTTTAGCATCTATACCTGATAGTGGTAACTATTATGTAACTAATTCAAAAGGTGCTCCTAGTGGAGAA +GCTACAGCAGGATTTGTAAGATTGGATAAAAGAAATGTAAATTATTATAAAATTTACTATTCACCATATA +GCAGTAACAAAATGTATATCAAGACTTATGCTAATGGTACTGTATATGATTGGATTAGTTTTAAATTAGA +TGAAGGTAGCTTATACAATGAAGGTAATACTTTGAATGTAAAGGAACTTACTGAATCCACAACTCAATAT +GCAACACTAGTTAATCCTCCAAAAGAGAACTTAAATACAGGTTGGGTTAATTACAAAGAAAGTAAAAATG +GTGTTTCTTCTTTAGTAGAATTTAACCCGGTTAACTCCACTTCAACTTTTAAGATGATAAGAAAGTTACC +AGTACAAGAACAAAAGCCTAACTTATTGAAAGATAGTTTATTTGTTTATCCTGAAACTAGCTATTCTAAT +ATTAAAACAGATAACTGGGATACGCCTCCATTTTGGGGATATTCTTCTAATAGTGGTCGTTCAGGAGTTA +GATTTAGAGGAGAGAATACAGTACAGATAGATGATGGGTCTGATACGTACCCTTCAGTAGTTTCTAATAG +GTTTAAAATGGGTAAAGAACTTTCTGTAGGTGATACTGTAACGGTATCAGTATATGCTAAAATTAATGAC +CCTGCTTTACTTAAAGATAACTTAGTTTACTTTGAATTAGCAGGATACGATACTGTAGATGATACTAGTA +AAAATCCTTATACAGGAGGACGTAGAGAAATAACAGCAAGTGAGATAACAACTGAGTGGAAAAAATACTC +TTTCACATTCACTATACCTGAAAATACAATCGGAGCATCAGGCGTTAAAGTTAATTACGTATCTTTACTA +CTAAGAATGAATTGTTCATCTAGTAAAGGTAATGGTGCTGTAGTATACTATGCCTTACCTAAATTAGAAA +AATCATCTAAAGTTACACCATTTATTACACATGAAAATGATGTTCGTAAATATGATGAGATTTGGTCTAA +TTGGCAAGAATTTATTAGTAAAGATGAATTAAAAGGTCACTCCCCTGTAGATATTGAATATAATGATTAT +TTTAAATATCAGTGGTGGAAATCTGAAGTTAATGAAAAGAGTTTAAAAGATTTAGCTATGACAGTACCTC +AAGGATATCATACATTTTATTGTCAAGGCTCTATTGCCGGGACGCCTAAGGGACGTTCTATTAGAGGAAC +CATTCAGGTAGATTATGACAAAGGTGACCCATATAGAGCTAATAAGTTTGTTAAATTATTGTTTACTGAC +ACAGAGGGTATTCCTTACACATTATATTATGGTGGTTATAACCAGGGTTGGAAACCCTTAAAGCAATCAG +AAACTTCTACTTTACTATGGAAAGGTACTTTAGATTTTGGGTCTACGGAAGCTGTTAACTTAAATGACTC +ATTAGATAATTACGATTTAATTGAGGTAACTTATTGGACTCGTTCAGCAGGACATTTTTCTACAAAAAGA +TTAGATATAAAAAATACATCAAATTTACTGTATATTAGAGATTTTAATATTTCAAATGATAGTAAAGGTT +CTAGTGTAGACTTTTTTGAAGGGTATTGCACTTTTCCTACTAGAACATCAGTACAACCTGGTATGGTAAA +ATCTATAACTTTAGACGGGTCTACAAATACAACAAAAGTAGCATCATGGAATGAAAAGGAACGTATACAG +GTATACAATATTATGGGAATTAATAGAGGATAA +>MW460250_1_47 # 45661 # 46035 # 1 # ID=1_47;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=11-12bp;gc_cont=0.307 +ATGGCTGTTAAATATGATATAGGTAATAATGAGATAGTATTACATTTAAGAGAAGGTAAATATATAACAG +GGTTTACAACAGTAGGAGGGTATGATAAGGAGTTAGGACAAGTAAAAGTTAATAGAGAAATCTTACCTGC +TTACTTCTTTGATAATTTTGCCTATGAAAGATATTTGTATTATAGTAAACCTGAAGAGGTTATAGAAAAT +AAAAACTATGTACCACCACAAATCAATGATGATGATGAGGAATCCCAACAAATTACTGTACCTAAAGAAC +AATATGATAGTTTAAAAGAAGAACTAGAGCTTATGAGAAAACAACAAGAAGCTATGATGGAAATGCTTCA +AAAGCTCTTAGGTCAAAAGGGGTAA +>MW460250_1_48 # 46042 # 47418 # 1 # ID=1_48;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.328 +ATGGCATTAAATTTTACTACAATAACGGAAAACAATGTTATTAGAGACCTGACTACTCAGGTCAATAACA +TTGGAGAAGAATTAACAAAAGAAAGAAATATATTTGACATTACCGATGATTTAGTTTATAATTTTAATAA +ATCACAGAAAATTAAACTAACTGATGATAAAGGATTAACTAAATCTTATGGAAACATAACAGCCCTTAGA +GATATAAAAGAACCTGGTTATTACTATATAGGTGCTAGAACATTAGCAACATTATTAGATAGACCTGATA +TGGAATCTCTTGATGTTGTTTTACATGTAGTACCTCTTGATACTTCTAGTAAGGTAGTTCAACATTTATA +TACACTATCTACTAACAATAACCAAATTAAAATGTTATATAGATTTGTCTCAGGAAACTCTAGTTCAGAA +TGGCAATTTATTCAAGGATTACCTAGTAATAAAAATGCTGTTATATCAGGAACTAATATTTTAGATATAG +CTTCACCAGGTGTTTACTTTGTTATGGGAATGACAGGAGGAATGCCTAGTGGAGTAAGCTCCGGATTTTT +AGACTTAAGTGTAGATGCTAATGATAATAGATTAGCTAGACTAACTGATGCTGAAACCGGTAAAGAATAT +ACTAGCATTAAGAAACCTACAGGAACATACACAGCCTGGAAAAAAGAATTTGAGCTAAAAGATATGGAGA +AATATCTACTAAGTAGTATTATAGACGATGGTAGTGCATCATTCCCACTCCTAGTTTATACTAGTGATAG +TAAAACATTTCAACAAGCTATTATAGACCATATAGATAGAACAGGTCAAACAACCTTTACTTTCTATGTT +CAAGGCGGTGTATCCGGTTCCCCTATGTCGAATAGTTGTCGAGGGTTATTCATGTCAGACACACCTAATA +CTTCTAGTTTACATGGTGTTTACAATGCTATAGGTACAGATGGTAGAAATGTAACAGGTTCAGTGGTAGG +TAGTAATTGGACTTCACCAAAAACATCCCCTTCTCATAAAGAATTATGGACAGGAGCACAATCATTCTTA +TCTACAGGAACTACTAAGAATTTATCAGATGATATTAGTAACTACTCTTATGTAGAAGTTTATACTACAC +ATAAGACAACAGAGAAGACTAAAGGTAATGACAATACAGGAACTATATGTCATAAGTTTTATTTAGATGG +TAGTGGAACTTACGTTTGTTCAGGTACATTTGTTTCCGGGGATAGAACCGATACAAAACCCCCTATCACG +GAGTTTTATAGAGTAGGTGTATCTTTTAAAGGTTCTACATGGACTCTTGTAGATAGTGCAGTACAAAATA +GTAAAACTCAATACGTTACAAGAATTATAGGTATTAATATGCCATAG +>MW460250_1_49 # 47510 # 49258 # 1 # ID=1_49;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.310 +ATGAGATTAAGAATTAAGAACTTATATACCTATGTAGAATTTGAGGAGGATGATAAATACTTAAAAGATA +TATTTTTAAAGAGAGTCCATACGACTATAGGAGCAAGACAAGAAGGATTTCAGTACAGCCCTGCGTACAA +AAGAGGTAGTTGGGATGGTTATGTAGATTTTTATGTTTATGAGGAAGATAAATTCCCCACTGGACTTTTA +TTTAAAATTGAGTTATTATTAGGTGAGCTACAATCAAGGTATAATTTCCAGTTTGAAACAATTGATGAGC +GTGATGAAAGTTTCTTATCTGAAGAAGATATTGATGATGAGATAACATTGCTTGATAATAATGTCGGTCA +AATTACCTTAAGAGATTACCAATATGAAGCAGTGTACAATAGCTTAACATTTTACAATGGTATTGCTCAC +TTAGCTACTAATGGTGGTAAAACTGAGGTTGCTAGTGGTATTATAGACCAATTATTACCTCAATTAGAAA +AAGGTGAGAGAGTAGCATTCTTCACAGGCTCTACGGAGATATTCCATCAGTCTGCAGATAGGCTCCAAGA +GCGTTTAAATATTCCTATTGGTAAAGTAGGTGCAGGTAAGTTTGATGTTAAGCAGGTTACAGTTGTAATG +ATACCTACTTTAAATGCAAACCTTAAAGACCCAACACAAGGGGTAAAGGTTACGCCTAAACAAAATATTA +GTAAAAAGATTGCTCAAGAGATATTACCTAAATTTGAAGGTGGAACAAATCAAAAGAAATTACTAAAAGT +ATTACTTGATAACACAACACCTAAAACAAAAGTAGAACAAAATGTATTAAGTGCCTTAGAGATAATTTAC +CAAAATAGTAAGACAGATGCAGAAGTTTTATTAAACTTAAGAAATCATAATGCACATTTTCAAAAAATTG +TTAGAGAAAAGAACGAAAAGAAATATGATAAATATCAAGATATGAGAGATTTTTTAGACTCAGTTACAGT +TATGATAGTTGATGAGGCACACCATTCTAAATCTGATTCTTGGTACAATAATTTAATGACATGTGAAAAA +GCTTTATATCGAATTGCATTAACAGGGTCTATAGATAAAAAAGATGAATTACTTTGGATGAGATTGCAGG +CGCTATTCGGTAATGTTATTGCACGAACTACTAATAAGTTTTTAATTGATGAAGGTCATTCTGCTAGACC +AACAATAAATATTATACCTGTAGCTAATCCTAATGACATAGATAGAATTGATGATTATAGGGAAGCTTAC +GATAAAGGTATAACAAATAATGATTTTAGGAATAAACTTATTGCAAAACTAACAGAAAAGTGGTATAATC +AAGATAAAGGTACATTGATTATTGTAAACTTCATTGAACATGGAGACACAATATCAGAAATGTTAAATGA +TTTAGATGTAGAGCATTACTTCTTACATGGAGAAATAGACTCTGAAACTAGGAGAGAAAAATTAAACGAT +ATGAGAAGTGGTAAGCTTAAAGTAATGATAGCTACATCACTTATTGATGAGGGTGTAGATATATCAGGTA +TTAATGCACTAATATTAGGTGCAGGAGGTAAGTCATTAAGACAAACATTGCAACGTATTGGTCGTGCTTT +ACGTAAGAAAAAAGACGATAATACAACACAAATATTTGATTTTAATGATATGACAAATAGATTTTTATAT +ACTCATGCTAATGAGCGTAGGAAAATTTATGAAGAGGAAGATTTTGAAATAAAAGACTTAGGAAAATAG +>MW460250_1_50 # 49270 # 50883 # 1 # ID=1_50;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.284 +ATGGCAACAAAAACACAAAGAAAGCTATACCAATATCTAGAGGAAAATGCTACAGAAAATAAATTTCATA +TTTCTACTAAGAAAGAGCTAGCAGATTCTCTAGGTGTTTCCATCTCTGCTTTATCCAATAACCTTAAAAA +GTTAGAAGAAGAAAATAAAGTCGTTACTGTTTCTAAAAGAGGAAAAAACGGCGGGGTAATAATAACTTTA +GTTAGAGAGTATGACACAGAAGAATTGAAAGAATTCAATAATTCTACAGATAATATTATTACTTCCGATT +TACAGTATGCTAAGGCATTAAGAGAAAAGCACTTCCCTTCTTATAGATATGAGAGAAAAGAACAACGTAG +ACGTACTAAGATAGAAATGGCACAATACAATGCCATTAAGGATGAGAAGAGAAGAATTATAGCAGATATG +AATTTCTATTCAGAAGGTCTTCCTTATCCTTCTAAAGATATTTTTAATATGTCCTATGACCCGGAAGGGT +TTTATAAAGCGTACATCTTATGTAAGTTATACGACCAATATGCTATTTCTCATATGGATGCTAAACATAC +AAGTCATCTTAAAGCAATGAGTAAGGCAACAACTAAAGATGAATACGACTACCATCAACATATGTCTGAA +TACTATAGAAATAAAATGATTCAAAATTTACCTAGAAATAGCGTTAGTGATAATTTCTTTGGTAGTAAAA +TGTTTAATACTTTTTATAATTTTTATTTAAAAATAAAAGATAAAAATATTAATGTATTTAAGTATATGCA +AAATGTATTTAAAAATGTAACATTTTATTACGAGAACGGTATGCAACCTAATCCAATACCTTCTCCTAAC +TTCTTTAGCTCAGATAAGTATTTTAAAAACTATAATAATTATATTAAAGGAATAAAAAAAGGTGTTAACA +GTACTAATAGACACCTAGGTGATACAGACAGCATCATTAATTCATCAGACTATGTGAAAAACCCTGCTGT +ATTACATCTACACCAACTATATACTACAGGATTAAATTCTACTTTACATGATATTGATACTATGTTTGAA +CAAGCCTTAGACCTTGAAAATGCCTCCTATGGATTATTTGGAGATATGAAACATATTATTTTACTACAGT +ATAATTCTATGATTGAAGAAGAAATTAAGAATTTACCTAGAGAAGAAAAGGATATTATTAATAAATATGT +AAAACAATGCATAATTAATGATTATTCACCAACAAGTATTTCACCTTCTGCAAGGTTATCAATGTTTACT +ATGCAGAAAGAGCATATAGTTTACAATAAGCAGTTAAATAAAGGAATCAAGAGAGAGGATTTATTACCAT +TAAGTCTAGGAGGTATAGTGAATAAAGATTTATTGAGTGGTATGGATATACAAAACTTAGAACAGAATGG +TAATGAATACCTATATATGAGACAACATACTTCAACTTATTATATATTAAGAATGTTTGGTGACTATTTA +GGGTATGAGGTAAACTTAAGAGAAGTAAAATATATTGTAGAGAAATATAATTTAATTGATAAAATACCAT +TGACAAAAGAGGGTATGTTGGATTATAATAAACTTATACATTTAGTAGAGGAAGAGGTTAATAACTATGA +GTAA +>MW460250_1_51 # 50876 # 52318 # 1 # ID=1_51;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.312 +ATGAGTAAGAAGATAAAGGAGCTTATCCTTCATAAATCAATGAAGGATATACATTTTGCAAGAGAAGTAT +TAGATAACTTACCTAAGAATCTATTTTCAGCAGAGTCTGAGGACATGGGTTACTTATTTACAGCTATAAA +GAGAACAGCACATATTTCCGATAAGATGTCAAATGAAGCATTAGCAATTAAAGTAGAACAGCTTATGGGT +AATAATAAGGAAGATGAAGAGAAAGTAACCAAGACATTAACTTACTTAGAAGATTTATATAAAGTAGACG +TTAATGAAAAAGATGAATCTGTTAATTATGAAATAGAGAAGTATATTAAAACAGAAATGTCAAAAGAAGT +TTTAGTTAAATTTATTGCAGAAAATAAACAAGAAGACTCTGATAATCTACATGAACTTGTAGACAAACTA +AAGCAAATAGAAGTAAGTGACATCTCAGGAGGTAATGGGGAGTTTATTGACTTCTTCGAAGATACAGAAA +AGAAACAAGAACTATTGAGTAATTTAGCTACAAATAAATTCTCTACTGGATTTACTTCTATTGACAACCA +TATTGAAGGTGGTATAGCAAGAGGAGAGGTTGGATTAATCATAGCTCCTACCGGTAGAGGTAAATCATTA +ATGGCTTCAAACTTAGCTAAGAATTATGTTAAAAGTGGATTAAGTGTTTTATATATTGCCTTAGAGGAAA +AAATGGATAGAATGGTTTTGCGTGCTGAGCAACAAATGGCAGGAGCAGAAAAGAGTCAAATTGTAAATCA +GGATATGTCTTTAAATAATAAAGTTTATGATGCAATACAAAATCATTATCAGAAGAATAGAAAGTTATTA +GGTGACTTTTATATTTCTAAACATATGCCAGGTGAAGTTACACCAAACCAATTAGAACAAATTATTGTCA +ATACAACAATTAAGAAGGATAAAAATATTGATGTTGTTATTATTGACTATCCTCACTTAATGAGAAATCC +TTATGCTAAATATCATTCAGAATCAGATGCAGGAGGGAAATTGTTTGAAGATATTCGTAGATTATCACAG +CAATATGGATTTGTTTGTTGGACGTTAGCTCAAACTAACCGTGGTGCTTATGGTTCAGATGTTATTACAA +GTGAGCATGTAGAAGGTTCTCGTAAGATTGTCAATGCTGTTGAGGTGTCTTTAGCAGTAAACCAAAAAGA +TGAAGAATTCAAGAGCGGTTTCTTAAGATTGTATTTAGATAAAATTCGTAATAGCTCTAACACAGGAGAA +CGATTTGTTAATCTTAAAGTAGAACCAACTAAGATGATTGTAAGAGATGAAACACCTGAAGAAAAACAAG +AGCATATACAATTGCTATCAGATAATGGAAAAGAAGACACAAGTAAATTTCAAAATAAAGATAATAAAAT +AGAAGCTATAAATAACACATTCGGAGGATTACCGGGAGTTTAA +>MW460250_1_52 # 52397 # 53434 # 1 # ID=1_52;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +ATGAAATTTGTATTCTTTACAGATAGCCACTTTCACTTATTTACTAACTATGCTAAACCTGATGAGCAGT +ATGTGAATGATAGATTTAGAGAACAGATACAAGCTTTACAGAAAATGTTTGATATTGCAAGAGAAGAGGA +TGCAACAGTTATATTTGGTGGGGATTTATTCCACAAACGTAACGCAGTAGATACTAGAGTATATAATAAG +GTATTTGAAACATTCCAACTTAATAGAGATATAGAAGTACTAATGTTAAGAGGTAATCATGATTCAGTTA +CAAATAGTTTATATACAGATTCTAGTATAGAACCTTTCGGTTACTTACCTAATGTAGAGGTTTGTAAAAA +CCTTGATACTTTAGGGTTTTTAGGAGAAGAACAGGATATTAATATTGTTATGGCTCCTTATGGAGACGAG +ACTGAAGAAATTAAAGAGTTTATTAAAAATAAATATGTAGAAGATAGAGTAAATATCTTAGTAGGTCATT +TAGGTGTAGAAGGCTCTTTGACTGGAAAAGGGTCTCATAGATTAGAAGGGGCATTTGGATACCAGGATTT +ATTACCTGATAAATATGATTTCATTTTACTAGGTCATTATCACCGTAGACAATATTTCCAAAATCCGAAT +CATTTTTATGGTGGTTCATTAATGCAACAATCATTTTCTGATGAGCAAGAAGCTAATGGTGTTCATTTAA +TAGATACAGAAAAAATGACTACAGAATTCATCCCAATCCATACACGTAGATTTATTACTATTCAAGGAGA +AGATATTCCTGAGAACTTTGAACAGCTAATCGAGGAAGATAATTTTATTAGGGTTATCGGTACAGCAAAT +CATGCTAAGGTTTTAGAAATGGATGACAGTATGAAAGATAAGAATGTTGAAGTTCAAATTAAAAAAGAGT +ATACTGTAGAGAAACGTATTGATAGTGATGTGTCTGATGACCCTTTAACAATTGCTAGTACCTATGCTAA +ACAATACTCACCTGAATCAGAACAAGAAATACTTGAGTGTTTGAAGGAGGTTTTATAA +>MW460250_1_53 # 53434 # 53811 # 1 # ID=1_53;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.280 +ATGAAAAAATATAGAGAATATCTAAATAAGACAGATGCAGAAAATTTAGCAGAGGATTGGGAGAAAGTAA +CCGAAGATTTATGGAAAGTGTTTAAAGATATGAAACCTAAAATTAATACATTAGATATCAGTAATGTAGT +AAGTAAAGACTTAGATAAAAGTAAACCTATTTTACAATTCCAAGATTCAGATGGAGTAATAGAGAATATT +TGTAATGTTGAAGGTTTAGAAGATGGTCTAAGTAAAATGAAAAAGATTTTTGATGATAGTAATTTTGAAA +AGCATTATTACAATAGAGTAGTAGACCATGATGAGTATTACTGGATTGATTATGGCTCTCATCATTGTTT +CTTTAGAGTTACGAAAGGGGATAAGTAA +>MW460250_1_54 # 53811 # 55730 # 1 # ID=1_54;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.297 +ATGGTTGTATTTAAACAAGTAGAAGTTAATAATTTTTTAGCAATTAAAGAAGCTACGCTAGAGTTAGACA +ATAGAGGATTAATTCTAATTGAAGGTGAGAATAAATCTAATGAGTCATTTCATTCAAACGGCTCAGGAAA +ATCAACTTTAATATCTGCCATTACTTACGCTTTATATGGTAAAACTGAAAAAGGACTAAAAGCAGATGAT +GTAGTAAATAATATTGAGAAGAAAAATACATCTGTTAAACTTAAGTTTGATATTGGGGAAGATAGTTATT +TAATTGAACGTTATCGTAAAGATAAAGAGAATAAGAATAAAGTAAAATTATTCGTTAATGAAAAAGAGAT +TACAGGTTCAACAAATGACGTTACCGATAAACAAATACAAGATTTATTTGGTATTGAGTTTAATACTTAC +GTTAATGCCATCATGTATGGTCAAGGAGATATCCCTATGTTCTCTCAAGCAACAGATAAAGGTAAGAAAG +AAATTCTTGAATCTATTACTAAGACAGACGTATATAAACAAGCACAAGATGTAGCAAAAGAGAAAGTTAA +AGAAGTGGAAGAACAACAAAATAACATAAGACAGGAAATCTATAAACTAGGTTATCAGTTATCGACAAAA +GATGAGTACTTTCAAAGAGAAATAGAGCAGTACAATCAATATAAAGAACAATTGGTTCAGATAGAAAACA +GTAATAAGGAAAAAGATAGATTAAGAGAACAAGAGGAGAAGCAAATAGAAGCTCAAATAGAGCAACTAGC +TTCACAGATACCAACAATACCTGAAGATGAATTTAAGCACTCAGAGGAGTATAATAAAGCCTCTCAAAGC +CTAGATTTACTTTCTAATAAATTAACGGAGTTAAATCAAGTTTACTCAGAGTATAATACCAAAGAACAAG +TACTAAAATCTGAAATAGCTACATTAAGCAATAGTCTAAATCAGTTAGATACAAATGACCATTGTCCTGT +TTGTGGCTCCCCTATAGATAATTCTCATAAATTAAAAGAACAGGAAAATATCAATAATCAGATTGAGAAT +AAGAAACAAGAGATTACTAGTGTATTAGAAATGAAAGATACGTATAAAGAAGCTATTGATAAAGTAAAAG +ATAAATCACAAGAAATTAAAGATAAAATGTCACAGGAAGACCAACAAGAACGAGAGCACAATAATAAGAT +TAACAGCATAATTCAAGAGGCTTCTAGGATTAAATCAGACATTAGTTCATTAGAGAATAATAAAACGTAT +TTAAAAGTTAAATATCAACATCAATCTGTTCAAGGATTAGAGAGAGAAGAACCAAGTAAAGAAAAACATG +AGGAAGATAAGAAAGAATTACAAGAATCTATTGACAAACATGAAGAGAATATAGTACAATTAGAAACTAA +GAAAGGTAAATATCAGCAAGCTGTAGATGCTTTTAGTAATAAAGGTATACGTTCAGTAGTGTTAGACTTT +ATTACACCATTCTTAAATGAAAAAGCAAATGAGTACCTTCAAACTTTATCAGGTTCAGATATTGAAATAG +AGTTCCAAACTCAAGTGAAGAATGCTAAAGGAGAACTAAAAGATAAGTTTGATGTTATTGTTAAGAATAG +CAAGGGCGGAGGTTCGTACAAATCCAATTCAGCAGGAGAACAAAAACGTATTGATTTAGCAATTAGTTTT +GCAATTCAGGATTTAATTATGAGTAAAGATGAGATATCTACGAATATTGCACTTTACGATGAGTGTTTTG +ATGGATTAGATACTATCGGTTGTGAAAACGTGATTAAATTATTAAAAGATAGACTTAATACAGTAGGAAC +AATATTTGTAATTACTCATAATACCGAGCTTAAACCACTGTTTGAACAAACAATTAAAATCGTAAAAGAA +AATGGAGTATCAAAACTGGAGCAAAAATAA +>MW460250_1_55 # 55730 # 56326 # 1 # ID=1_55;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.281 +ATGAAATTAAAGATTTTAGATAAAGATAATGCAACACTTAATGTGTTTCATCGTAATAAGGAGCACAAAA +CAATAGATAATGTACCAACTGCTAACTTAGTTGATTGGTACCCTCTAAGTAATGCTTATGAGTACAAGTT +AAGTAGAAACGGGGAATACTTAGAATTAAAAAGATTACGTTCTACTTTACCTTCATCTTATGGTTTAGAT +GATAATAACCAAGATATTATTAGAGATAATAACCATAGATGTAAAATAGGTTATTGGTACAACCCTGCAG +TACGCAAAGATAATTTAAAGATTATAGAGAAAGCTAAACAATATGGATTACCTATTATAACAGAAGAATA +TGATGCTAATACTGTAGAGCAAGGATTTAGAGATATTGGAGTTATATTCCAAAGTCTTAAAACTATTGTT +GTTACTAGATACCTAGAAGGTAAAACAGAAGAAGAATTAAGAATATTTAACATGAAATCAGAAGAGTCAC +AACTGAATGAAGCACTTAAAGAGAGTGATTTTTCTGTAGATTTAACTTATAGTGACTTAGGACAAATTTA +TAATATGTTGTTATTAATGAAAAAAATTAGTAAATAG +>MW460250_1_56 # 56341 # 57408 # 1 # ID=1_56;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.290 +ATGAGGTTTGAAGACTTTTTAACCCAAGAATTAGGAGAACCAAAAGAAAATACTATAGGTGAGCTAAGAT +ACTGTTGTCCGTTTTGTGGAGAAAAAAGTTATAAGTTCTATGTTAAGCAAGCCCTAGACTCTAGTAATGG +TCAGTATCATTGTAAAAAATGTGATGAATCAGGTAACCCTATTACATTTATGAAGACTTATTATAACATT +ACAGGTAAACAAGCTTTTGATTTATTAGAGTCTAAGAATATAGATATAGAGAGAGCCCCTTTACTTACGA +CCAATAATAAGGATTTGACAGAATCAGAGAAACTTATATTAATGCTTAGAGGTGTGCACCAAGATAAAGG +AAATACTAGTATTAAACCTCCTAGATTACCTGAAGGGTATAAATTATTAAAAGATAACTTAAATAATAAA +GAGATTATACCCTTTTTAAAATACTTAAAAGGCAGAGGTATAACTTTAGAACAAATTATTAATAATAATA +TAGGTTATGTTATTAATGGGAGCTTTTATAAAGTTGACGGGGAATCCAAAGTATCATTAAGGAATAGTAT +TATATTTTTTACTTATGATAATGATGGAAATTACCAGTATTGGAATACACGAAGTATAGAAAAGAACCCT +TATATTAAATCTATTAATGCTCCTGCTAAACAAGATGAAGTAGGGAGAAAAGATGTCATATTTAATTTGA +ATATAGCAAGAAAGAAAAAGTTCTTAGTTATAACTGAGGGTGTATTTGATGCTTTAACCTTCCATGAGTA +TGGCGTAGCAACATTAGGTAAACAAGTAACTGAGAATCAAATAAAAAAAATAATTGATTATGTTAGTATA +GATACATCAATATATATTATGTTAGACACTGATGCATTAGATAATAATATAGACTTAGCTTATAAGTTAA +AAACACATTTTAATAAAGTTTACTTTGTACCTCATGGTGATGAAGATGCAAATGATATGGGAACAAGGAA +AGCTTTTGAGTTATTAAAACAGAACCGGGTGCTAGTAACACCTGAAAGTATACAGAGTTACAAAATACAA +CAAAAACTTAAACTTTAG +>MW460250_1_57 # 57475 # 57813 # 1 # ID=1_57;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.274 +ATGTCAAATAATAAAAAAGATATTTTAGAATTTGTAGATGAATACATTACAGCTTTAAGAGTTGGTAATG +AGCAACGACAACATCAATTAGAAGAAATGGGTAAAGAAGAAACAGCAACATTAACAGATGTAGCTAAAGC +TATTACTAACCTTATGTTAGGTGTTAATGAGCAGATGACAGACTTAGAATATAATAACGAGTTAAACTTA +AATATTTTAATTGATGCTTTATATAAAGCAGAGCTTATTAATGAAGATGTATTAGACTACATTCAAGAAT +CAATTGATAAATCACAAGAAGAACCTAAAAATGAAGAAGAAAAAGGAGAACAAGAATAA +>MW460250_1_58 # 57813 # 58265 # 1 # ID=1_58;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.331 +ATGGAAAAAAATATTAGCACACACACAAAAGGTATTAGTCAAGCAGACATGGAGAAATGGATTGAAGCTG +TAGTACAAGGAACTGTTGATGGTAAACAAGTTGATGAGAAAACAGCTAAACAATTAGATAGAATTGGTTC +ACGAAGTGTTTCTTTAGAAGAAGCAACTCGTATTGCTAAAGTTCTTAATGCTGTAACAGCTCAAGAGGTT +ACAGGAGACTTTAATGATGCATTTAATGCAATTGACTTAATGATGATTATCATGGAAGATGAGTTAGGAG +TAACTCAAGAAAAAGTAGGTAAAGCTAAAGATAAACTAAATGAAAAACGAGAAGCTTACCTAAAAGAGAA +ACAAGAAGAATTACGTCAAAAACAACAAGAAGAGGCACAGAAAAAAACTGAATCTGACAGCAATGAAAAA +GTAATTCAGTTGAAGAAAAATGACGAACAGTAA +>MW460250_1_59 # 58252 # 58860 # 1 # ID=1_59;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.328 +ATGACGAACAGTAAGAAAAAAGGGGATACATTCGAACGTAAAATAGCTAAAGAATTAACTGCTTGGTGGG +GATACCAATTCAATAGGTCTCCTCAATCAGGTGGTGCTTCATGGGGTAAAGATAATAATGCTGTCGGAGA +TATAGTAGTACCTCAGGAAGCTAATTTTCCTTTAGTAGTAGAATGTAAACATAGAGAAGAATGGACTATA +GATAACGTTCTTTTAAACAACAGAGAGCCACACACATGGTGGGAGCAAGTCATTAATGATAGTAGTAAGG +TGAATAAGACACCTTGCTTAATATTTACTAGAAATAGAGCTCAGAGTTATGTTGCTTTACCTTATGATGA +GAAAGTATATGAAGATTTAAGAAATAATGAATACCCTGTCATGAGAACAGACTTTATTATTGATAATATT +AGAAAAGATAAATTTTTTTATGATGTCCTTATAACTACCATGAATGGGTTGACCTCATTTACACCTTCTT +ATATTATATCTTGCTACGACAAAAAAGATATAAAACCATACAAGAAGGTCGAGTCTAATTTATCTGAGGT +AAGTAAGCATGAAGATGAATTGATTAATGACCTTCTTAGTGATATATAA +>MW460250_1_60 # 58877 # 59269 # 1 # ID=1_60;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.293 +ATGACAAGCAAAGAAAGACCATTAATCGTATATTTTTCAGGTACAGGACAAACAGAAAGATTAGTAAACA +AAATTAATATTAATAATTCATTTGAAACATTTAGGGTTAAGAGTGGAAAAGAAAAAGTAAATAAACCTTT +TATACTAATAACACCTACTTATAAGAAAGGTGCAATACCTAAACAAATAGAAAGATTCCTAGAAATTAAT +GGGAGCCCTAAAGAAGTTATTGGTACAGGAAATAAACAATGGGGCTCTAATTTCTGTGGAGCAAGTAAAA +AGATTTCAGAGATGTTTAAGATTCCTTTAATTGCTAAAGTAGAGCAATCAGGACACTTTAACGAGATACA +ACCAATATTAGAACACTTTAGTAATAAATATAAAGTAGCGTAA +>MW460250_1_61 # 59284 # 61398 # 1 # ID=1_61;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312 +ATGGCAACATATGGAAAATGGATTGAGTTAAATAATGAAATAACTCAATTAGATGACAATGGAAAAAATA +AACTCTATAAAGACCAAGAAGCTTTAGATGAGTATTTAAAATATATTGAAGACAATACAAGAAAGTTTAA +TAGTGAAGTAGAAAGAATTAGAGTATTGACAAAAGAAGGAACATATGATAAAATATTTGACAACGTTCCT +GACACTATTATTGATGAAATGACTAAGTTAGCTTACAGTTTTAATTTTAAATTCCCTAGTTTCATGGCAG +GGCAAAAGTTTTATGAATCTTACGCATCAAAACAGTATGATGAAAACAAAAAACCTATTTTTGTTGAAGA +CTATGAACAACATAATGTTCGAGTAGCTTTATATTTATTTCAAAATGACTATGTAAAGGCTAGAGAATTA +CTAGTACAACTTATGGAGCAAACATTCCAACCATCTACACCTACGTATAACAACTCAGGACAAGCTAATA +GAGGTGAACTAAGCTCATGTTATCTATTTGTAGTAGATGATTCAATTGAGTCTTTAAACTTTGTTGAAGA +TAGTGTAGCTAATGCTAGTTCTAATGGTGGTGGAGTTGCAATTGATTTAACTAGAATTAGACCTAAAGGA +GCTCCAGTACGTAATAGACCTAATTCAAGTAAAGGTGTTATTGCTTTTGCTAAAGCTATTGAACATAAAG +TTAGTATTTATGACCAGGGTGGTGTAAGACAGGGTAGTGGTGCTGTTTATCTAAATATATTCCATAATGA +TATCCTGGATTTATTAAGTTCTAAGAAAATCAATGCTAGTGAGTCTGTTAGACTCGATAAATTATCTATT +GGTGTTACAATCCCTAACAAATTTATGGAGTTAGTTAAAGAAGGTAAACCTTTCTATACTTTTGATACTT +ACGACATTAATAAAGTGTACGGTAAGTATTTAGATGAGCTAAACATTGATGAATGGTATGATAAGTTACT +AAATAATGATAGTATCGGTAAAGTAAAACATGATGCTAGAGAAGTTATGACAGATATTGCTAAAACACAA +TTAGAATCAGGGTACCCTTATGTATTCTATATTGATAATGCTAATGATAATCACCCATTGAAAAACCTAG +GTAAAGTTAAAATGAGTAACTTATGTACAGAAATTTCACAATTACAAGAGGTATCAGAAATTTATCCGTA +CTCTTACAGTAATCAGAATGTTATTAATAGAGATGTTGTCTGTACATTAGGTTCTCTTAACTTGGTTAAT +GTGGTTGAAAAAGGTTTATTGAATGAATCTGTAGATATTGGTACAAGAGCATTAACAAAAGTTACTGATA +TTATGGATTTACCTTACTTACCTAGTGTTCAAAAAGCAAATGATGATATTAGAGCTATCGGTTTAGGTTC +AATGAATTTACATGGACTTTTAGCTAAGAATATGATTAGTTATGGTTCTAGGGAAGCATTAGACCTAGTA +AACAGTTTATATAGTGCTATTAACTTCCAGTCTATTAAGACATCTATGTTAATGGCTAAAGAAACAGGAA +AACCATTTAAAGGCTTTGAGAAGTCCGATTATGCTACAGGTGAATACTTTGTAAGATACATTAGAGAATC +CAATCAACCTAAGACAGATAAAGCTAAGAAAGTCTTAAATAAGGTTTATATTCCAACACAAGATGATTGG +GATGAATTAGCTAAAGCAGTAAAAGTACATGGCTTGTATAATGGTTATAGAAAAGCAGAAGCACCTACTC +AATCTATATCTTATGTACAGAATGCTACAAGTTCTATTATGCCAGTTCCTAGTGCTATAGAGAATAGACA +ATATGGAGATATGGAGACATATTACCCAATGCCTTACCTAAGTCCTATAACTCAGTTCTTCTATGAAGGA +GAAACAGCTTATAAGATTGACAATAAACGTATTATTAATACAAGCGCAGTTGTTCAGAAACATACAGACC +AAGCAGTGTCTACAATACTTTATGTAGAATCAGAAATCCCTACTAATAAACTAGTATCATTATACTATTA +TGCTTGGGAACAAGGATTAAAATCATTATACTATACACGTTCACGTAAACTTTCTGTTATTGAATGTGAA +ACATGTTCGGTTTAG +>MW460250_1_62 # 61412 # 62461 # 1 # ID=1_62;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.320 +ATGGATATTACACAAAAAGTAAAACAACATAATAAAAATGCTGTATTAAAAGCAACAAACTGGAATATTG +AAGATGACGGGATGTCTGATATTTATTGGGAGCAAGGAATCTCCCAATTTTGGACTCCTGAAGAGTTTGA +TGTATCAAGAGATTTAAGTTCTTGGAATAGTTTAACTGAAAGTGAAAAGAACACTTATAAGAAAGTCCTT +GCAGGGCTCACAGGGCTCGATACAAAGCAAGGAGGAGAAGGTATGAACTTAGTATCCTACCACGAACCAA +GACCTAAATACCAAGCTGTATTTGCGTTTATGGGTGGTATGGAAGAGATACATGCTAAATCTTATAGTCA +TATTTTTACAACATTACTAAGTAATAAAGAAACAAGTTATTTATTAGATACTTGGGTAGAAGAAAACGAC +TTTTTAAAAGTAAAAGCTCAGTTTATCGGATATTACTACGACCAACTATTAAAACCTAATCCTACTATAT +TTGATAGATACATGGCTAAAGTAGCTAGTGCCTTTTTAGAAAGTGCATTATTCTACTCAGGATTTTATTA +TCCTTTACTTCTTGCAGGAAGAGGTCAGATGACACAATCAGGAGCTATTATTTATAAAATTACTCAAGAT +GAAGCTTACCATGGTTCGGCAGTAGGATTAACAGCTCAATATGATTATAATCTTCTAACAGAAGAAGAGA +AAAAACAAGCAGATAAAGAAACTTATGAATTATTAGATATTCTTTACACTAATGAAGTAGCGTATACACA +TAGTCTATATGACCCACTAGAATTAAGTGAAGACGTAATTAACTATGTTCAGTATAATTTTAATAGAGCT +CTTCAAAACCTTGGAAGAGAGGACTATTTTAATCCTGAACCTTATAACCCTATTGTAGAAAATCAAACTA +ATGTAGACAGATTACGAAATGTTGATTTCTTTAGTGGTAAAGCAGACTATGAAAAATCTACAAATATCAA +AGATATTAAAGATGAAGATTTCTCATTCTTAGATAGTAAAGAATACAGTACTGCCAAGGAATTCCTATAA +>MW460250_1_63 # 62479 # 62808 # 1 # ID=1_63;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.306 +ATGGATAGAAAAGAAGCAATGGATTTACTAAGTAAAGCAGAAATATTATTTAAAAAACATGATGAGTTTT +CATGTGTAAGTGATATCAATGACCCTATGAAGTTATTCAGTAACTCTAAGGATGCTAAAGCTGATGATAC +GTCTAATTCTTTTCAGCTAGAGTTTATGCATGATATGACCATGTATACTTTATCTTATGGCTCAGGACAG +CTAAAACTTATTGATTTAGCAGAAGGTTATGAAGCACAAAAAGCTACAATAGTTAACTCATTTCCCGAAA +TTATTAAAACATTAGAAAAGGATGATTCAGAAGATGGAAAAAATGAATAG +>MW460250_1_64 # 62792 # 63112 # 1 # ID=1_64;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.293 +ATGGAAAAAATGAATAGTTTAGTAGATTTAAATACAGCAATTAGACAAAAGAAAGATGTTATTGTCATGA +TTACACAAGATAATTGTGGTAAGTGTGAGATTTTAAAAAGTGTAATCCCTATGTTTCAAGAGTCAGGTGA +CATTAAAAAACCTATCTTAACATTAAATCTAGATGCTGAAGATGTAGATAGAGAAAAAGCTGTTAAGTTA +TTCGATATCATGAGTACACCAGTATTAATTGGGTATAAAGATGGTCAGTTAGTTAAAAAGTATGAAGACC +AAGTTACACCTATGCAATTACAAGAATTAGAGTCACTTTAA +>MW460250_1_65 # 63319 # 63915 # 1 # ID=1_65;partial=00;start_type=ATG;rbs_motif=3Base/5BMM;rbs_spacer=13-15bp;gc_cont=0.283 +ATGGATGAATTAATATCTAAGTCTAGAAGATATATCATGAGGGATGAAAAGCATTACATGCTATTTAATG +AGAAGTACAATAATGATAGGCTTATAGAAAAAGTATGTAAACACGGTGGTAAAGTTACATACTATACTGA +TTCAGTATTACCTTACTATGTTTTAAAAGACTTATCTAGTCACCCTGACTCAGAAGTTGTTTATCGTATG +CGCAACGGTTTTACTGCAAAAGAAGTAGATAATATAGCTTTATCATTCATGGGTACAAAAGTTATTATTG +ATATTTCTGTAGTATTTCCTTATGTAAACCCTTATGATATTATTAGAAGTTTACATGATATTAAAACAAA +TGTAGATGAAGTTCATTTATCATTTCCACGAATATTAGGGGTAGATGAAAAACAAGAAAAGTTTTATTTC +TTTGATGGTGAAGCTTATGATTTAAAACCCGAATATAAAGTCGATTTTGCAGATAAAATTAGAGTATCTT +TATCAGTATGGAAAATGTATATCTATATCTTAACAAGTAGTCGTGATTTTGAGGATGTAGACAATGTAAT +TACGAAATTAAAACAACAACGAAAGATTAAGATATAA +>MW460250_1_66 # 63925 # 64230 # 1 # ID=1_66;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.307 +ATGAGTACAGCAAATAGAAGAGATATAGCAAGAAAGATATCAGAGAATACAGGTTACTATATCCAAGATG +TAGAGGAAATACTAAGTGCAGAGACAGATGCTATTTCTGACTTGCTAGAAGAAGGGTATACTAAAGTAAA +GAATCATAAATTTATGCAAATAGAAGTTATTGAAAGAAAAGGTAAAAAAGCGTGGGATGGTCTGAATAAA +GAATACTTCCATTTACCTAATAGAAAAGCTATAAAATTCAAACCACTAAAAGAACTAGAAGAGGTTATTG +ATAGACTTAATGAAGAAGAGAAATAA +>MW460250_1_67 # 64306 # 65178 # 1 # ID=1_67;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.310 +ATGAAAGTATTAATCTTATTTGACCACATTAGAGAAGAGCATTTTTCTGTAAGTAAAGATGGGAGTGTGA +AATCTAATGTACTAAATACACCTAACGGAAAAACACTTAAGAAATTACTTGAGAAGTGTTCTAACTTAAA +GAGAGATAAAACAAACAGAGATTATGATATTGATTTTCTCTACAATGCAGTACCTACACCTATTAGAAAT +GACTACGGTAAAATCATTAAATACCAAGATGTTAAACAAGCAGAAGTAAAGCCATACTATGAGAGAATGA +ATAATATTATTATTGATAATTCTTATGATATGGTAATTCCTGTAGGTAAACTAGGTGTTAAATACCTATT +AAATGTTACAGCTATTGGTAAAGTAAGAGGTGTACCAAGTAAAGTAACTATTGAAAATGGAACATCTTCT +CATGATGTGTGGGTATTACCTACTTATAGCATTGAATATACTAATGTAAATAAAAATAGTGAACGTCATG +TAGTATCAGATTTACAAACAGTTGGTAAGTTTGTAGAGCAAGGAGAAGAGGCATTTAAACCTAAGGAAGT +ATCTTACGAGTTGGTAGATAACATTGAAAGAGTAAGAGAAATATTCAATAAGGAAGTAAAGAATGATAAT +TATGATGGGGTAGATATTACCGCATGGGACTTAGAGACTAACTCATTAAAACCTGATAAAGAAGGAAGTA +AACCTTTAGTACTATCTCTATCATGGAGAAATGGTCAAGGTGTAACTATACCCTTATACAAATCAGACTT +TAACTGGGAAAACGGTCAAGATGATATTGATGAAGTCTTAGAATTGCTTAAGAATTGGTTAGCTAGTAAA +GAAGATATTAAAGTAGCACATAACGGTAAATGA +>MW460250_1_68 # 65344 # 65856 # 1 # ID=1_68;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.285 +GTGTATCGACTTAATAGAGGTGGTACAGTGAAAAAAGATTATATGACATCAGTTAAAAATAACAAAAAAG +TATGTAGAAGATGCAACGAAGAATTAGATTTATCTAACTTTAAAACATATAAGAAGAATGATAAAACTTA +TTATCAAAGTATGTGTATACCTTGTCGGAAGGAATATAATAAGTTAGATAAAACTAAAAATACTATTAAA +AAATGTTATGAGAAAAACGGAGATAAATATAGAAGACAAAGTAATGAGTATAATACTTCTGACAGAGGTA +GAGAGCTTAATAAAAATAGGTCTAGGAAATACAGAGAAAACAATTCTTTAAAATCGAAAGCTAGAAGCTC +TGTAAGAACCGCATTAAGAAATGGTTCTCTCATAAGACCTGATAAGTGTTCAGAGTGTAATAAAGATTGC +ATACCTGAAGCTCACCATCCTGATTATACTAAACCTTTAGAAATAAAATGGTTATGTAAATCCTGTCATG +AAGATACTCATCATAAAAAATAA +>MW460250_1_69 # 65992 # 67335 # 1 # ID=1_69;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.303 +ATGAGTACTGAAAACTTTAAAGATTTTGAGAGTATTCAGGATACTAAAGTAGGTTGGTACCTAGCTGTTA +CCCAAGAAGTTAAAGAATCTTTAAGATTATCTGATTTAGCTTATGAGGTTACAGATGTCGGAGGCTATGA +TAAACCATTAGAAGACTTTAAATTATGGTTTGTTACTAAGTTATTAAGATTCTTCTCAGATAAAATTAAA +GAGATACAGAAAGAAAATAAAAAGATTGCTAAGAAAGAGTATGATGTTAAAGCTCCTGAATATAAAGAAT +GGTTAGAGAATAAATTAAATGAAACAGTAGTAGAACTAGATGATACTGAGAAAAAATTTAGAGTTAGTGA +ATTAGAGAAAAAGTATATTCAACTAGGTCTTTCACCTGAAATTGTAAATATGAATTTAGTTATGGATAAT +GATGAATTCATAAATATTGCAGAACAATCACCTGAGTACATGGGGTTATCTGACTACGCTAAGTCTTACA +CGTTAAATACTGCAATTAATTTAATTAATGAGTATAGAGATGTAAAAGATGTAGTTAATGATATTGACGG +AGGTAACTTTAATTATGATTGGTTCCCTATTGAGTTAATGCATCCATACGCATCAGGAGATACTGATGTA +TGTAGAAGAATTTATTGTGATGTAATTAAGAAACTTAAAGAACAAGATAGACCTAAGTCAATGCATTTAT +TAGAAGTTAATTACCCAAGACTTACTAAGTCTTTAGCTAGAATTGAATCAAATGGTTTATATTGTGACTT +AGATTATATGAAAGAAAATGATGAGTCATACGAGTCTGAGATGGCTAAGAACCATGCTACAATGAGAGAG +CACTGGGCTGTTAAAGAATTTGAAGAATACCAATACAATCTTTACCAAATGGCGTTAGAAGAACATGAGA +AAAAGCCAAAAGATAGAGATAAAGATATCCATCAGTACAGAGATAAATTTAAAGATGGTAAATGGATGTT +TTCCCCAAGTTCCGGAGACCATAAAGGTAGAGTAATTTATGATATTCTAGGAATTCAATTACCTTATGAT +AAAGAATATGTCAAGGAAAAACCATTTAATGCTAATGTTAAAGAAGCAGACCTTACTTGGCAGGACTATA +AAACAGACAAGAAAGCTATTGGTTATGCGTTAGATAATTTAGAATTAAAAGATGATGTTAAAGAGCTTCT +TGAATTACTTAAATATCATGCTAGTATACAGACAAAACGTAATTCATTTACTAAGAAATTACTTAATATG +ATTAATAAACAAAAACGAACATTACATGGTTCTTTTTCTGAGACAGGCACAGAGACATCAAGACTAAGTA +GTAGTAACCCTTAA +>MW460250_1_70 # 67603 # 68310 # 1 # ID=1_70;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.275 +ATGAAAGAGATTTGGAAGAAAGTAGTAGGATTTGAAAACTACGAGGTAAGTAATAAAGGAAAAGTAAGGA +ATATAAAAACTAACTATATTTTAAAGCCGTGGATAATAAATTCCGGATATGAGCAAGTATCTATAGGTAT +TGCTAATGTATTAGTACATAGATTAGTGGCTATGACATTTATACCTACCGACAGCTATAGTATAGTTAAC +CATATTGATAATAATAAATTAAATAACTGTGTTGAAAATTTAGAATGGGTAAGTTACAAAGGTAATAGTG +CTCACGCTAATAAGCAAGGAAGATTGAATACTTATAGTGCAAGAGAAAAACTTAGTTCTGTATCTAAGAA +AGCCATTTATCAAAAAGATATGGAAGGTAACATCATTAAGTTATGGGATTCACCAAGTGAAGCTGAAAAA +GAATCTAATGGGTACTTTAAAAGTACTAAGATTAGTTCCGTTGCTCACGGTAAACGTAAGCATCATAGAA +GTTATACTTGGGAATACGTATATAAGGATTCAAAGAGAAGTTTAAATAAGTCTATTAATATGTATGATTT +AAATAATAATTTATTATATGAAGATTTGACAATGAATAAAATTATGGGTATACTAGAAATGAATAATCAT +AAAACATTAAGAGATAAACTAAGAAATACAGATGACTTTGTTGAATACAGAGGATATAAATTTAAAAATA +ATAATTAA +>MW460250_1_71 # 68544 # 69404 # 1 # ID=1_71;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.304 +ATGCGTATTATCGGATTATTTACTAAAGACCCTGATATGCTACAATCATTCTTAAATGGGGAAGATATCC +ATAAGGCTACTGCAAGTATTGTTTACAATAAACCAGTAGAAGAAGTAACTAAAGAAGAACGACAAGCAAC +TAAAGCAGTTAACTTCGGATTAGCCTTTGGTGAATCACCCTTCTCATTTGCAGGTAAAAATAATATGGAA +GTAAGTGAAGCAGAAGAAATATTTGAAAAGTATTTCCAAACAAAACCAAGTGTAAAAACTTCTATTGACA +ATGTACATGAGTTTGTGCAACAATATGGTTATGTTGATACAATGCACGGACATAGAAGATTTATCCGTTC +AGCCCAATCAACAGATAAAAAGATAAAAAATGAAGGTCTAAGACAGTCATTTAACACTATCATCCAAGGT +TCAGGTAGTTTCTTAACAAACATGTCTTTAACTTACTTAGATGATTTTATTCAATCTCGTAATTTAAAAT +CAAAAGTTATTGCCACAGTACATGATAGTATCTTAATTGATTGTCCTCCTGAAGAAGCTAAAATTATGGC +TAAAGTGACAATTCATATTATGGAAAACTTACCATTTGATTTCTTAAAAGCAGAAATTGATGGAAAAGAA +GTACAATATCCTATTGAAGCCGATATGGAAATTGGGTTAAACTATAATGATATGGTTGAATATGATGAGG +AAGAAATAGATACATTTAATTCTTACCAAGGTTATATTAAGTATATGATGAATTTACAGACCTTAGAAGA +TTATAAAGAGTCAGGTAAACTAACAGATGAACAATTTGAAAAGGCTACTAATGTTGTTAAAAGTGAAAAA +CATATTTATCAAGAAATTTAA +>MW460250_1_72 # 69473 # 69715 # 1 # ID=1_72;partial=00;start_type=GTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.337 +GTGAATACAGGGGAGATTAGATTTAATCGTTCTATGGATGAATGGATTATAACAAGCATGTACCAGGATG +AGCTAGGTGGGATGAATATTGTTGTTACATTCTATAATAGAGAAGAAAATAAACATGGTTCTACAGTTTT +ACCAACAGAGTCATCTACTGGAGAAGTAACAGAGGAATTGGCAAGTCTTGAAGAAGAATATCCTTTAGCT +TTACCTTTAAGTAGTATCTCAGTTAATATTTAA +>MW460250_1_73 # 69732 # 70214 # 1 # ID=1_73;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +ATGGAAATACACATTGATTCCCTAGATTTTACAAACTTTACTATTAAAGATAGAAATGGGAACTCACAAG +AGTTTGATATTACAGATGAGTTAAGAATTACAGAGTATACAATACAAGAGGATTTTATGCAACAATCAGC +TAAATATGCTTTTTGGGCTTCTATATTAGAGAAGGTAAGAGCATATTCTGAAATGGAACAAAGAAACCTA +GAAACAATTGGTAGTAAGCTAAACCTTACAATTAGACAAGAGTACGAACAACAAGGTAAAAAGCCTACTA +AAGATATGATTGAATCTAGTGTTTATATTCACGATTCTTATCAACAACAACTTAAAGTTGTTGAGGCTTG +GAATTATAAAGTTAAACAACTTCAATATGTTGTAAAAGCTTTTGAGACAAGAAGAGATATGATGATTCAA +TTAGGTGCAGAATTACGACAAACAAATAAAAATGGTGGAATTACTAATCCATTTTCACATTAA +>MW460250_1_74 # 70301 # 71572 # 1 # ID=1_74;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.355 +ATGGATTTCAATCAATTTATTAACAATGAGGCAAGCAAATTAGAAAGCAATAACAGTTCTTTTAACAATA +ATGTAGAGAGCTACAAACCTAAAAACCCTGTACTACGTTTAGGTAATATTAAAGATGCAAACGGAAATAA +GGTTGTTAAAGAAAATGCTTTTGTACGAGTATTACCTCCTGCACAAGGAACAAATGTTTTCTTTAAAGAA +TTTAGAACAACAGGTATTAACTATTCTAAGAAAGATGGTTCTCAGGGATTCACAGGGTTAACATTACCTG +CAGAAGAGGGTTCATCTGTCCTTGACCCATACATTCAGGATTGGATAACAAATGGTGTTCAGTTCAGTAG +ATTCCCTAATAAACCAGGAGTACGCTATTACATTCATGTTATTGAATACTTTAATAACAATGGTCAAATT +CAACCAAAAACGGATGCTCAAGGAAATGTAATGATTCAACCTATGGAATTATCTAATACAGGATATAAAG +AATTATTAGCTAACTTAAAAGACACTATGTTAAAACCATCACCTAATGCACCTCATAGCTTTATCTCAGC +AACTGAAGCATTCCTAGTTAATATTGTTAAAGCTAAGAAAGGTGAAATGTCATGGAAAGTAAGTGTTTAT +CCTAATGCCCCTTTAGGTGCGTTACCTCAAGGTTGGGAACAACAATTATCTGACTTAGACCAATTAGCAA +AACCAACAGAAGAACAAAATCCTAATTTTGTTAACTTCTTAATCAATAACGTTAATAACACAGAGTTAAG +TCATGATAACTTTAAATTTAACCGTGAAACAAATGTCTTAGGTGAAGAACCTTCAGAGCCTAAACAAGCA +CCCACACAACAAGATGTAGATAGTCAAATGCCAAGTAATATGGGAGGACAACCTAATCAGCCTCAGCAAG +GTCAAGTAGGTCAGTATGCACAACAAGGTCAAAGTAATGGTCAAGGACAGCAGTTACAAGGTACACAACA +ACCTATCAATAACACTCAATTTGGTCAAGGAACTCCTTCAGGACAACAACCAAGTAACACAGGTTCTGTT +GATTGGGATAACTTAGCGCAACAACAATCACAACCTGATTCAAACCCATTCAATGATTTTGATGTTAGCA +GTGTTGATGATTCACAGGTACCTTTTGAGACACAACCTCAAAATACACAACAAGCACCTGAACCACAACA +AACTACACAAGAGCCTCCAAAACAAAAACAAACGCAAAGTATTGACGATGTATTAGGTGGTCTAGACTTA +GATAACCTATAA +>MW460250_1_75 # 71632 # 71856 # 1 # ID=1_75;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.333 +ATGGCAAGAGCAAAAAAAGGTAAAGAAGTCGATTTAACAGATTTAAATACAATTGATTTAGGTAAAGAAT +TAGGATTAACATTGCTATCAGATACAAACAGAGCAGATATTAAAAACGTTATACCTACAATGGTTCCTCA +GTATGACTATATTTTAGGTGGAGGTATTCCATTAGGTCGGTTAACAGAAGTTTACGGTTTAACTGGCAGT +GGTTGCCTTAAATAG +>MW460250_1_76 # 72201 # 73169 # 1 # ID=1_76;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.268 +ATGGTAAAACGAGTTTGGACTAATGAAGAGAAACAAGATATAGTAAAATCATTTCAAGAAGGTAAAACGT +TTAAAGAATTACAAGATAAGTATAATGCACATTATTTCACTATCAAGAAAATTTTAGATGAGTTTAATAT +TGACACAAATAAAAAACGAAGATGGACTAATAAGCAAAAACAAGATATACTAAGAATGTATACTAAAGAG +TCTATGACTATAGCAGAGATAAAAAAGGTATATACAACACACGCAAGAGAAATAGGTAGAATACTAAAAG +ACTTTGGAGTAGATACTTCTTACTATCAAACTAGAAGCGTTAATAGAAATATAAATAGAAATTTCTTTGA +AGTAATTGATACTGAAGAGAAAGCTTATATACTAGGTTTATTAATGGCTGATGGGTGTGTTAGATATAGA +CGAGAAGGGCAATGTTACTTAACACTAGAATTGATTGATAAAGAAATAGTAAAAAGAGTTCAAAAAGAAT +TGAATTCAGACAGTAAAATATATGAATCACATAGAAAAAGAGATTATATAAAAAACGAAAAACAAACTTA +TACTTTTAGTGTTACCGATGAAAAACTATGTAATGACTTAGCTAAATATGGTATAGTACCTATGAAGTCT +AAAAAAACAGAATGTTTAACACAAGATATACCTTATGATTTAAGAAAGCATTATTTAAGAGGATTATTTG +ATGGTGATGGTTCAATCGGGTATTATAATAATAGATGGTTTATAACTTTAATTAATAATCATCCTGAATT +TTTAAAGGATGTAGGTACTTGGATTAATGATTTATTAGGTTTAAAATGTCCTAAAGTATCTAAAACTAGT +ACCTCTTATAGAATAGGTTATACAGGTAAAAAAGCTAAAGAATTAATGAAATTACTATATCAAGATAATA +ATATTCATATAGACCGAAAGCAAAAACTTGCAGACCAGGCAATTCAAGATATAGTCTAA +>MW460250_1_77 # 73317 # 74264 # 1 # ID=1_77;partial=00;start_type=ATG;rbs_motif=AGxAG;rbs_spacer=11-12bp;gc_cont=0.344 +ATGGAGCAACTTGGTGTAGATGTTTCAAAACTATTCTCTATTCAATCAGGAGAAGGTAGACTTAAAAATA +CAGTAGAATTATCTGTAGAGCAAGTAGGTAAAGAATTAGAGTACTGGATTGACACTTTCAATGAAAAGAT +TCCGGGAGTACCTATTGTATTTATTTGGGACTCATTAGGGGCTACAAGAACTCAGAAAGAGATTGATGGC +GGTATTGATGAGAAGCAAATGGGTCTTAAGGCATCAGCTACCCAAAAAGTAATTAATGCAGTAACACCTA +AACTAAATGATACAAACACAGGGTTAATTGTTATTAACCAAGCCCGTGATGATATGAATGCAGGTATGTA +TGGTGACCCTATTAAATCTACAGGTGGTAGAGCTTTTGAACATAGTGCTAGTTTACGTATTAAGGTTCAT +AAAGCATCTCAGTTAAAACAGAAAAGTGAGTTAACTGGTAAAGATGAATACCATGGTCACATTATGCGTA +TTGAAACTAAGAAATCTAAACTATCACGACCAGGGCAAAAAGCTGAAGCAGACTTACTATCTGATTATAT +GGTAGGTAAAGAAGATGACCCTATCTTATTAAATGGTATCGACTTAGAACATACTGTATATAAAGAAGCA +GTTGAAAGAGGTTTAATTACCAAAGGAGCATGGAGAAACTATGTTACATTGAATGGTGAAGAAATTAAAC +TTAGAGATGCTGAATGGGTTCCTGTACTTAAAGATAATAAAGAGTTATATCTAGAATTGTTTAGTAGAGT +TTATGGAGAACACTTCCCTAATGGTTACTCACCATTACTTAATAACAAAGTAATCGTAACTCAATTAGAA +GAGTATCAAGCTCTTGAAAACTACTATAAAGAATGGGCTACAGATAATAAACAAGAGGAACAAGAGGAAG +AACTAAAAGGAGAATCTCAAGAAAAGGATTCTGAATAA +>MW460250_1_78 # 74268 # 74621 # 1 # ID=1_78;partial=00;start_type=ATG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.314 +ATGGATAATTTAATAGATAAAAACATGAATCAGGTAAAAGAATCTTTGGGGAATGCAAATTCCTCAGATG +TTCTTCCTTTACCTTATAAAGATATAGCAAAGAAATTTGAAGAAGTAAAAGAAAAAGGTGAATCAATTAT +CATTGAAGAAGGTGGATTCCCTTATACAGATTCTACAGTGATGTATATAGAACATGTAACAGATAGATGG +GCAGGAGGATATTCCTTAATTAGACATGAAGGTGAAGAAGTTAAAGTACCTAAGACTATCCATTTCTCTG +ATATATATGTTAAAGATAAATCACACAAAGTAAGAATAATCTTCGAGGGGGCTAATCCTTATGAAGAAAG +CTAA +>MW460250_1_79 # 74608 # 75270 # 1 # ID=1_79;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.275 +ATGAAGAAAGCTAATAATGGTAATAGATATGTAATAGATATAGATGGTATACCTGTTGATTTTGAAAGGG +ATTTAGATAGTTTACTTAATAGGTATAAAAACCTTAGATGGTCGTTATATCATAGGTACGCAGGGATTTT +ATCTAATGATTTTGAAAGACAAGAACTAAGAGAATATATTGATGAGCAATTTATTAAATTAGTTAAAGAA +TATAATATTAGAAGTAAAGTGGATTTTCCTGGATATATTAAAGCTAAACTAACTTTAAGAGTTCAAAATA +GTTATGTTAAGAAGAATGAAAAATATAAACGTACTGAAATTATCGGTAAAAAAGATTATACAGTAGAGTC +TTTAACAGAAGATTTAAATGAAGACTTCGAGGATAATCAAATTATGAGTTATGTATTTGATGATATAGAA +TTTACAGAGGTTCAAAGTGAGTTACTTAAAGAATTACTTATTAACCCTGAAAGAGAAGATGATGCCTTTA +TCGTTTCTCAAGTAGCGGAAAAGTTTGATATGAAAAGAAAAGAAGTAGCAAGTGAGTTGACAGAACTCAG +AGACTATGTTAGATTTAAAATAAATGCATACCATGAGTACTATGCTAAGAAAGAATTAAATAACCATAGA +GTTAATACTGAAAATCATATTTGGGAAAACTAG +>MW460250_1_80 # 75398 # 76021 # 1 # ID=1_80;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.361 +ATGGCTAAAAAGAATGTTAATGATGTATTACAACAAGAATCTGTTACAGTAGCAGATAAGTATTTACAAG +TTAAAGTTAACCGTGACGGTTATACTCGTACACATGAAGGACAATATGCGTACAAAGTAGTTTCAGAGGG +AGAAGAATTATTCTTATACCCTGTACAAACAGATGGTAAAGGTACATTAAATGTAATGAAGAAATCACCT +ATTGCTTACACTGATGGAGACAATATCCATTTCGTAGTAAACACAGTAGTAGACCCTTATAATCACTCAT +TTATCCGTACTGAAGATATTAAAGGATTAGATAAAGGTAAACAACTTATTCAAGCTTTCTTAGCTTTCGT +TGAAGACCGTTTCAAATTTGGTGTTTATAACGTATTTGTTGCAAACAACAAAGAGGATGTATTATCTATT +GTAGACCCTACAGATAATGATGCAGATGAAGTTAAAGATAGTTTAGAGCACGCACATGAAGATGTAATTG +CGGATTTCCCTGCTAGCCCTGCTCGTAAGGACGTTAAAGGCGTAGATTCAGGAGAAGGTCAAGGAGACAC +TTCAGAACCATCAGCACCTAAGAACGTTCAAGTTACTCCTAAGGAAGACGTATCAGCAGAATAA +>MW460250_1_81 # 76044 # 76556 # 1 # ID=1_81;partial=00;start_type=TTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.365 +TTGGCTAAGTTAAATTTATACAAAGGTAATGAGTTACTAAACAGCGTAGAAAAAACAGAAGGAAAATCAA +CAATCACGATTGAGAATTTAGATGCTAATACGGATTACCCTAAAGGTACTTTTAAAGTATCATTCTCAAA +TGATTCAGGAGAGTCAGAGAAGGTCGATGTTCCTCAGTTTAAGACAAAAGCAATTAAAGTTATTTCAGTT +ACCCTTGACGTTGATAGTTTAGACCTTACAGTTGGAGATACTCACCAACTATCAACAACTATCACGCCTA +GTGAAGCATCTAACAAAAATGTGTCATTTGAATCAGACAAATCAGGTGTTGCTAGCGTAACATCAGAAGG +CTTAATTGAAGCAGTTAGTGCAGGAACAGCTAATGTTACTGTAACTACTGAAGATGGTAGTCACACTGAT +ATTGTTGTTGTAACAGTTAAGGAACCTATTCCTGAAGCACCTGCAGACGTAACAGTTGAACCTGGTGAAA +ATAGCGCAGATATTACTGTATAG +>MW460250_1_82 # 76571 # 76798 # 1 # ID=1_82;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.355 +ATGGAAAAGACATTAAAAGTTTATAGTAATGGTGAAGTTGTGGGCTCTCAAGTAGCTAATAACGATGGAG +CTACTACAGTATCTATTACAGGCTTAGAAGCCGGAAAAACTTATGCTAAAGGAGATTTTAAAGTAGCATT +TGCTAATGATTCAGGTGAATCAGAAAAAGTAGATGTTCCTGAATTTACAACTAAAACTCCTACTGAAGAA +CCTTCAGGAGACGCATAA +>MW460250_1_83 # 76894 # 77154 # 1 # ID=1_83;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.295 +ATGGATATTCCAACAATATTATTTAGAAATCCATATGATTATACGAAAGTAAAAAAATTAATGGAAAACA +AAGAGCAGTATATTGTAGTAAAGTTTGATTCTGTTTCTGTTCATAATTTAAATGTTCAAGGTATGATGAA +TGTCATCCAAGATTACCTACACATCTATGGTTACAGAGTTAAAGAGTACGGACAAGAAAATTCTTCTAAA +GATGATGAAAGAGACGTTAAAGGCTACTTATATGAAAGAGTAGGTGAGTAG +>MW460250_1_84 # 77158 # 77913 # 1 # ID=1_84;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.279 +ATGGGAATTATAGTAAACTCCAACCATATTCAATCAGACACTTTATATGAGTATGATAGCTTTTTTGATA +TTGAGAAAGTAGATACATTTGAAGAAGGATTGCTTTCAATACAGGATGAGCCAACTGTTTTAGCAGGATT +CATCTATGATGATATCACATTTAATAAGGTCATTAATTCTAATTCAGATATTGATGATTATATTAAGAAT +AATGATATTTATTATGTCTCTGATATAGGATTACTTCCTGATACTTTTATCACTGTTGATTCTGATAGAA +AATATTATTCATTATTACAACAGATAACTGAGTTAAGTAAAGACCCTTTTCCTAAATGGGTAGAGGATGA +TGCAAAAGGTTTAACTAAGTATTATAACTTTCAAGATTTTGAAGATGTATTTGATTTAAATAGTTTTTAC +AAAAAAGAAGTTGACATGGTAAGAGAAAAGTGCTATAATAATGGTAATGTATATTTATTATATGAGGTTC +TGCCTGATTATAAATTACCTCTAGCTTATAGTTTACTTTCAAACAAGGAGCATGGTATTGTTATTATCGG +TTCACAGACACGTTCTAATAATGATATACTGACTTTTTATGTTAAAGGTATGGATGCTAAGGCAATAGCT +AGTATGTTCAATGTAGAACATGATTATGATTCTAATATTTTCCATACATTTGTAAACAGTCACATTAATA +TTTTAGGAAATCAAATAACTAAGTTTATAAGAGAGAAAGGAAGCAGTTATGAGTAA +>MW460250_1_85 # 77906 # 79156 # 1 # ID=1_85;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.289 +ATGAGTAACTATAAAACAATAGAAGAAGTACAAGCAGTTATTATTGGGGTATTATTTAAAGATGAAGGTA +AAATTGTAACATCTAAGTTTAATAAAATTACTAAAGAGTTTGGTTTAGATAGAATCGGTAAAGATGACCT +TAAAGAAATTGTAGAGGATATTAGACAAGACGCTTATCTAAATGAACTTAAAAACAAAGCAATTAAAGGT +AAAGTAACGTTAGGTGATTTAAAAGATGTTGCAGATAACCAAGTATTCGAAGGTAATAACTACCATGAAG +AAGTATCTACTTATGTAGTAGCTAAAGAAAAAGAATTGTCTCACTTAAGAGAACAGCGTAAGCACAATAG +GCATACTGCATACCCTCAAATTATGTTTGATGAACTTAAAGAACATATGGTTAAGGAATTACAAGGGGAA +ACATTAGTAGAACATCACGGAAGTAAAGCTAATATTAATGATACAGAGCTAATTGTGTTACTATCAGATT +TCCATATTGGAAGTATTGTATCTGATATGACTAATGGTAAATATGATTTTGAAGTTCTTAAATCAAGATT +AAATCATTTTATTAATACAACAGTTAAAGAAATTGAAGATAGGGAAATTTCTAATGTAACTGTTTACTTT +GTTGGGGACTTAGTAGAACATATTAATATGAGAGATGTTAACCAAGCATTTGAAACAGAGTTTACTTTAG +CAGAACAAATCTCTAAAGGTACTCGATTACTTATTGATATCCTAAATGTACTATCTAATGTAGTTTCAGG +AGAACTAAGATTTGGTATTATTGGTGGTAACCATGACCGTATGCAAGGTAACAAGAATCAGAAGATTTAT +AATGATAACATTGCTTATGTAGTGTTAGATTCTTTATTGTTATTCCAAGAACAAGGGCTATTAAATGGTG +TAGATATTATTGATAATCGTGAAGATATTTATACTATTAGAGATACCTTTGGCGGTAAATCTATTATCAT +TAACCACGGAGATGGGTTAAAAGGTAAAGGTAATCATATCAATAAATTTATCTTAGATAGTCATATTGAC +TTATTAATTACAGGTCATGTACATCATTTCTCAGTAAAACAAGAAGATTTTAATAGAATGCACATCGTAG +CTTCATCTCCGATGGGATATAATAACTATGCTAAAGAGTTACATTTATCAAAAACTAAACCTTCACAGCA +GTTATTATTTGTAAATAAGGAAAATAAAGATATTGATATTAAAACAGTATTTTTAGATTAA +>MW460250_1_86 # 79170 # 79538 # 1 # ID=1_86;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312 +ATGGATACAATTTTTATTATAGGTGTAGCGTTTATAACTTTTGCAACATTTAACATAGTCTTTAGATTAT +TTGATTTATGGACTACAGAGAAAAAAATGGTAAGTCAAGGACAACCTCCACTAAGTAACTTTGAGTACTA +TCATGTGATAGTACCTTACTTAGTAGGTGTTATTGTTATTATACTGAGTATTATTTTTAGGGATTCCTTG +TATTCCGCACAATCAGGGTTCGGTGTTATTATTACAAGCTTTATTTACATGCTAGTTTATGTTATAATTG +GTCTTGTAGGGTCATTTGTACTTACAATATTCCAAGCTAGAAAAGCTAGACAGTATCAAACACAGGAGGA +TAATAATGAAGTTCAATGA +>MW460250_1_87 # 79525 # 79836 # 1 # ID=1_87;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.272 +ATGAAGTTCAATGATATTTATGAGCAATTAATTAAAAATGATACAGTACAAAACATTCATGAGTCTCAAG +ATGACAAAGGAAATATTTATACAATACAGTTTGATAAAGGTAATGATAAGTATTTATTTAATGTTATTAA +TGATGGATTCTTGAAAGAAATGACAAATGGTATGGTAGACCATCCTGAAGGTCAGCCATATTCAGTAAGT +TTAATCAATAAAGAAACACCTAGTATGTCAGTGAAACAATATTTAACAGATGTAGAAGATATTGTACCTA +CTATTAGAAAAATGGAAAAGGATTTCTTATAG +>MW460250_1_88 # 79900 # 80436 # 1 # ID=1_88;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.333 +ATGGATTTTAATTTTAGTGCTTTTGATAATAGCTCATTAGCAATGAGAATTAGTGAGGGTGTATACTATT +TCAATGATACGCCTTATTACTTTATTGAGCATGTAGAAGAAGAAATGTCTGAGTATGTTATTGTATATGA +CATACATGACAGAGAGGAAAAAGAAAATCCTCAGAAGAAATATAGAATAGAACCTTACCAACGTACAATA +CCGGGAGGAACACCTCTTAGTAATTTAATTAAGAGTATGATGCCTCAACGTAAGTATCCTAAGAAGGTTA +CAGAAGACCCTATATTTGTAGCTAATGTTATTCCTTTAGGAACAGATACAGTAACAGGTAAAACCGGTAA +AGGATTTTTTGAAAGAGATAAGGATAGAACTATCTATTCTCAAAAGGAACCAACTAAAGTCGTTCATGGT +CAATACACAGGTGTTTTTATAGGTCTAACAAGTGTTAAGTGGAATAGAACATATACCCCCTTAGAAAGTG +TTGTTGAGTACTACAAAAGGGTTAAAGGAGATAGGTTAAATGTCTAA +>MW460250_1_89 # 80429 # 81196 # 1 # ID=1_89;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.303 +ATGTCTAATGATGTAGTTAAGTTCTATGAAAAAGATATTAAAGACCTTATCAGAACTAAAAAACACATGT +TCAAAGACGATGAAATAACTAGTGATATAAACGATATACGAATCTTCAATGAGAAAGTCATTTGTCAAGG +TAAATGTAGAACAGATTGTTTAGTGTTAGACCGTAATGGTACAGTAATGGGTATAGAGATAAAAACAGAA +CGAGACTCTACACAAAGATTAAATAACCAATTAAAATATTATAGTCTAGTATGTAAGTATGTATATGTAA +TGTGCCATGACAAACATGTACCTAAAGTAGAACAAATACTTAAAAGGTACAAACATAATCATGTAGGTAT +AATGAGTTACATTAGTTTTAAAGGCAAACCTGTTGTAGGTAAATACAAAGATGCTACACCATCACCACAT +AGAAGCCCTTATCATACAATGAATATATTATGGAAGACAAACTTAATGACAATACTTAGATTGATTAGAG +ACCCTCATACGTATAGAACAGGGTATAGCTATAATGTTAGTGGTAGATATAGTGGAGGGGAAGGTAATTT +CTCCCAAACAACTCAAAGTAAAAGAATGAAAAAACCTGCTATTATTAACCAAATAATTCATTATGTAGGG +GTAGATAATACTTATAAACTCTTTACAAGAGGTGTTATCTATGGTTATAATAATAGGTGGGAAGTTATAG +AAGAAGATTTCTTTAATACTATGAAGAATGGGGTAAGAGTAATTAATGAGCAAAGACAAACCAAATAG +>MW460250_1_90 # 81174 # 81620 # 1 # ID=1_90;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.295 +ATGAGCAAAGACAAACCAAATAGACGTAAAGAGATACAGCATCAACCTGTTAACTTTGCCCCTACGAATA +CTTTAACAGGAGCTAATAATAGTTTCTTTGCTAAAAAGCCTTCAGAGCCTAAAGATGCAACATCTGTTAT +TGAATATCGTATACTATTTATTAAAAGATTTGATAACGTAACAAGTACAGATGTGAAATTACAGAAAAAG +TATGCACTAAATCTTATTAGTGAAGCACTTGATGTTAAAGAAACTTACTTGTCTCTTAAGCAAAAAGGAA +AAAAAACAGAATCTATTTTGCATACAGATAGAGTTTATTATGTTCATAGAGGTAAAAAACTTATTGGAAA +GTGTAGTATCAGAGAACAAAGAACATTTAAGGGTAAACATTTGATATTTATATTCAAAACAAGACATAGA +GTTAAAGCAGAAAGGAAAGATAAATAA +>MW460250_1_91 # 81620 # 82483 # 1 # ID=1_91;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.302 +ATGTTAAAAGGATTTTCAGAACATGTAGACAAACCTACAACTATTAAGACCTTATACAAGACCTTAACAA +GTGGTAAAGTAGAATTACTAGGTGTATCTTACGATAGTGATTACTTCCCTTCAGGTGTTACAGTACAATC +TTACATTGAGGATATAGGTAATGAAGATGAGGGTCTACAGTTTGTTAATAAGGTAAATGTAGTAGAATCA +ATGAAACAGGCTGTAGTAGGTATGAATAATCAATTAGGTTCTTCAGGTCTTGGCTATGTGAGAACTGAAC +AACTTAAAAAAGAGTTGGAAGAGACTGGACTAATGACAGATTTACTTGCTAGAGGTACTAACTTAACCTC +TACTAAGAAAGTAGATATTGTAAGTACTTTTATTGAGCCTGAGGTAACATACCAAAATATTACTATAGCT +AAAGATATTAAACTACGTTTGTATAAAGTAGAAGAAGAATCACCATTAAATGGTTACACTCATATTGTAT +ACTTACTTACTACAGAAAAACTATATGATGGTCAAACACTCTTCGGTATGCTCTCTAAAAAAGATAAGTT +ATCTAAAGGAGATACTGATAAATTATTAGCATTCTTCAGAAACAATAGTTTAATAAGTAAAAGTGTATTT +TGTGTTAAGTTATTAAGTAAAGACTACTACTTTAATTTATATAATACACATGAGACAGGGATATTCTTTT +TAGAAGACACAGATGTTATTACTATTGCTTGTGGTCAGTCATATGTTAAAGTTAACACTAAAGATATTAA +GTCTAGTTATGTTAAAATTGAAGATAAGACTCATAAATTAACTGAGCTAGTAATTAACCTAAAGGGTGAC +GACACATTAACTATTTTATTCTAG +>MW460250_1_92 # 82855 # 83586 # 1 # ID=1_92;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.276 +ATGGCTAGAAAAAAGAATTTAAGAAATAAAAACAGTGATATAAAAGTTGTTCCTGATAAAGAAAAAGAAA +GTATATTATCTAAGTTATACCATAATAAGTTACTACGTTCAAAGGTAGATAATGCATTAGATGAAGATAT +GAGTTATGATGATATTATAGAATTATGTAAAGAATATGATTTAGAATTGTCTAAATCAGCTATTACAAGA +TATAAAAGTAAAAGAAAAGAAGCTATTGAAAATGGTTGGGATTTAGGAGAATTAATTGATAAACGTAAAA +AAACAAGTGTAAAAGATATTAAGGAAAAAGAAACTCCTATATTAGAAGAGGAGCAACTTTCTCCATTCGA +ACAATCAAAACATCACACACAAACAATTTATGATGATATTCAAGTACTAGATATGATTATTTCTAAAGGT +GCAAAAGGATTAGAGTTTGTGGAAACTTTAGACCCTGCTTTAATGATACGTGCAATGGAAACTAAAGATA +AGATTACCGGAAATCAATTAAAAGGTATGTCATTTATTGGACTTAGAGAATTACAATTAAAACAAACAGC +TCAAGATACAGCTATGAGTGAAGTATTATTAGAATTTATACCTGAAGAGAAACATGAAGAGGTATTACAA +CGATTAGAAGAACTACAAAATGAATTCTACAAAAATCTAGATTTAGATGAGGAAAGTAGAAAATTAAAAG +AAGCTCTTGATAGAGTAGGCTATACAATTTAG +>MW460250_1_93 # 83604 # 84062 # 1 # ID=1_93;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.296 +ATGGCAGATGAGATTAGTTTAAATCCAATACAAGATGCTAAGCCAATTGACGATATAGTAGATATCATGA +CATACTTAAAAAACGGGAAAGTACTGAGAGTTAAACAAGACAACCAAGGAGATATCCTTGTTAGAATGAG +TCCAGGGAAACACAAATTTACTGAAGTATCTAGAGACTTAGATAAAGAATCATTCTACTATAAAAGGCAT +TGGGTTCTCTATAATGTATCTGTTAACTCTCTTATAACATTTGATGTTTATCTAGATGAAGAATATTCAG +AAACAACTAAGGTTAAGTATCCTAAAGATACTATTGTAGAATATACAAGAGAAGACCAAGAAAAAGATGT +TGCTATGATTAAAGAAATACTTACAGATAATAATGGTAATTATTTCTATGCACTTACAGGGGAAACAATA +CTCTTTGATGAAAATAAATTAAATAAAGTTAAAGATTAG +>MW460250_1_94 # 84127 # 84570 # 1 # ID=1_94;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.277 +ATGTTTATTTCATTAAATCAAGAAGAGAAAGAATTATTAACTAAAGAGGAAAGTAAATACACACCATTAG +AAACATCAAGAGAGTTTAACACACCTAAAGAAGAATTCATTGTAACAAGCTATAATGAAGGTAAACCTTT +AGATTACATTGCAAAAGAAGCTAAGGTAAGTATGGGATTAATTTACACAGTTCTAAACTACTATAAAGTA +GGTAAGCGTAATAAGAAATCACCTGTAGAAGAAAGAATTGCACATATCTTAAAAGATAAAAACTTAGTCA +AAGAGATTATTAAGGATTACCAATATATGAATTTACAGGACATTTATAGTAAATATAATCTTCATAAGAA +TGGTTTATATTACATCTTAGATTTATACCATGTGGAGAGAAAATCTGAACTTAAGGACAAAGCATTAGAA +GAGGATAATATTGTCGTTGAGTAA +>MW460250_1_95 # 84587 # 85291 # 1 # ID=1_95;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.306 +ATGAGAAATAAAAAATCATTTCAAGAGCAGTTAAATGACATGCGTAATAAAGAGAAATGGGTATCTGAAG +AGGAGTTCACTGAAGAAGTGGCTCCTCCTGAAGAACCTGAAGTAGAAGAAGAAAAACTATATACTTTAAA +TGAGTTAAAAGAGAGCTTACTAGATGCTCAAGGATTAAAAGATGTTGTAGCTGATTTTCCTGCATCTAAA +GATTTATATGAACCTAATAAGTTATATATCTGTACAATACCTAAAGGATATCAGTCTACCGAAGTACAAC +CAGGACAATATATTGGTATTAGTACTGGATTATTATCAGAGTCAGAAGACTTCAGCCATTTAAGAGGTCA +AATGCCTAGAAACTTATATGAAACTTCTCATGTTTTAAAACCTTTGATACGTATTAATAATACAAATATT +GAATACCAACAACATGAGTTACTTGAAGACATTAAGGATGACAAAAAGATATATGATGTAGAGTTAGAAG +ACTTAAGATTAGCAACAGGAGAAGAAGTTTCTCATTTAGAAATTGTTGATAATAAGTTTTTTGAAAGTCG +TATTAATGAAGTTCTTGACCGATACACTGAACTAACGGATTCCAATGATTTACTTAAGTACTATAGTAAA +TTACGAGAATTAGTAGGTAGTGACAAAATGATTTATTGTTCACTCCTCGATAAATGTGTTAAAATTATAG +ATTAA +>MW460250_1_96 # 85353 # 85751 # 1 # ID=1_96;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.321 +ATGTCAAGAAAAGCAAGTATATTCTATATACTAGTGGTTATTGTTTTGGCTTTCTCTATCTCATCTTATT +ATATATCTTCTTTCATGTATCACGACAAAGCAAAAAATGAAGTCTCTACTGAGTTATCGAACACAGGAAA +GATTAAAGAAGAAAAGAACGTAGAATTTGTCGGTGACTACACATTGAAAAAAGTGGAAGATAATAAAGCT +TATTTTATGGAAACATTACCTACTTACCTACCAGGTAGAACAGGAGATAACAGCATAGATATGAGGTACT +ACAAAACAAGTAGATTTAAGGAAGGGGTAAATTTCAAGCTTATTAGGGTATATACTGAAGATGGAGAAGA +TAATCCAATTCATAAGTATAGGTTTGAAGCAGTACCAACCAAAAAGTAA +>MW460250_1_97 # 85898 # 86140 # 1 # ID=1_97;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.333 +ATGGAAATGGCAGATTTAGAAAGATTTGATGCATTTGTAAGACTAATTTCAGATGATGAGCTTTCGGAGG +AAAGAATACTGGAGTTAAGCGTAGACTTACTAAACCCGATACTAGAAGGAGGTACAGCTTACAAGGCTAA +AAAACGTATTAAGAGTAAATTTGGTAAGTTAGAAGCAAAAAATTTTAAACGAAACTATAAATTCTTACTT +AAGTCGATAGCTCAAATAGACCAAAGGAGATAG +>MW460250_1_98 # 86145 # 86309 # 1 # ID=1_98;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.279 +ATGACAGAAAGGGAAAAATTAATTAAAGATATTGAAGAGGCTAATAGAGACATACAGTTACAGTTAAAAG +AAGTAGATAATTATAAGGACAGCATACGTTCTAAAGGAACAAGAAATTATATTTCTACAAAGGTATTAGA +TTCTATTATGGTTGGTTTCATATGA +>MW460250_1_99 # 86511 # 86687 # 1 # ID=1_99;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.294 +ATGGTTATACCTAGTATTAAAGCACAAAACAAATTCAAGAATGAGTTAGAGTATTATAAACAAGGTCACA +TTAGTGAAAGTAAAATGTTAGAATTAGCTTTTGATTACATCCAAGAATTAGAACAAAATAACGAATACGT +TACTAACTTGCTAGAAGAGGAGAGATATGGTGAGTAA +>MW460250_1_100 # 86677 # 87210 # 1 # ID=1_100;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.287 +ATGGTGAGTAAATTTATCGGAGTGTACTTATTTAATTTACTAATAGCTATTATTTTAACTTTAACCTTAA +TAGGTACTATTACTGACTCAATTGAGAGTACTTTAGCCCAAATAATCGTAGGGATGTTCATAATCATTAC +TATATATGGAATCCTATCAGCGTTAATACCTATTCTAGTTCATAAAGCTGTATCACCGGGATGGAGCTAT +ACTGAATGGAATGAATCCTATTACATCAGATTACCTGGAGAAGAGAACTACAAGTACTATAGTAAATGGT +ATTTAGATTTATTAGGAGTTAAAGAATTTTACTATAAGAGAGACAATGGAGAAGAAGTAAAAGAAAAAAA +TATATCATGGGCTTTTCAAGCTGAAGTAAAAAGACCTGAAGATGTTAACCACTGGAAAAACCAATTGCTT +ACTAATAGACCTTTAACAATTTTAGAATATAAAAAATTAAAGAAATTAGATAAGGAAAGTGAAATTAGGA +AACAAGAAGATTTAGAAGAATATAAACAATACAATAGTAATTAA +>MW460250_1_101 # 87225 # 87473 # 1 # ID=1_101;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.273 +ATGATAAGCTCATTTGATAGTATACTACTTGTCATATACATTATTATAGCTTTTGCAGTAGCTATGGCAA +TTATCTACTTAGTATTTAAAGGTATGACTATTCTACTAGATAAGCTAATGATGTTATTATTAAGTAAAAC +TACATTAGATGTAGAAGCTTGCTCTATGATAATGGCAGTCATCAGTACAATTGTGTTTGGAATTATTGTA +CTTTTAATATGGCTAGCAGTAAATAATATTTTACTATAA +>MW460250_1_102 # 87485 # 87661 # 1 # ID=1_102;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.288 +ATGGATTTTAATGACTTTATAAACAGTGAATCGGATAGGGTAGGTAAGCCTAAACAAAAGAAGAAGGTAG +AGAATAAGCTACCTTCTTCTACTCCTATTGAAGATAAGGAAAAGAAATTAAAAGAGATAAGAAAGAAATC +ATTATATATTGATTTAAGGAGAAAAAGAAATGACTAA +>MW460250_1_103 # 87654 # 87950 # 1 # ID=1_103;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.303 +ATGACTAAAGAAACAAATGTACTTTACAAAGATAAGTATAGAGATTATACTATAGTTGTAAGATTAGCAG +GGAATATTATTGTTACTGAAGTAGATAAGAAACATAAAACAGCATTTACACCTATTATATTTGACAATGG +TGTAGAAGGCGTAGAGCTTGTAATGCGTATAGGTTCTGTAGAGCTTAACATGACAGATTTACGTGAGTTC +ACAAAAGAAGTATCTACGGCTCAGAAAGCTTTAGAATATTTTAATAAAAAACTTTACATTAAAGGCTTGA +CAGATGAAGCATTTTAA +>MW460250_1_104 # 87998 # 88180 # 1 # ID=1_104;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.317 +ATGTTATTAGGAATTTTATGGTTTATATGGGGATTTGTATCATACTTTGTATTGATGTTTGGAATTGAGT +TTTGGAAAGATAGATGGATGCCAGGTGTTATCGGAGCAGGAACCTTACTACTATTCTTATTTTGGATTAT +GAAATCTATCCATAATGCTATGACAGTAGTATACTTGTATTAG +>MW460250_1_105 # 88193 # 88561 # 1 # ID=1_105;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.279 +ATGGATATACTAATTATTCATTATAAAGAAACAAATAAACGGGTTTTAAAAGAAACAATACAAACAATAC +AAAATCATTTAAATGATGAACATGGTTTGGTTAAGATGACAGCAACAAAACTTAGCAGAGAGAATATAGA +GAAAAGATTTAATAACTATAATATAGTCATTGCAGAAGATGACCCTGATAATTCTTATCATTACGGTGAA +GCTGTAGAAGACGCAGATTTTATTATAGACATACCAATTTCATATTTAGATATACATGCAGGAATAGAAT +GGGATGTTGATAATCCTGTAGATATGCTAGATAGGAATCCTGATTTTATAGAAGCTGTAAATAAACTAAA +TGAAGACTTAATGTTATAA +>MW460250_1_106 # 88574 # 88921 # 1 # ID=1_106;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.313 +ATGCTAAATGAAAAACTAAAAAACCTGGAAGATACAAAAGTATACATGATTAATAGTATTGCAAGTTTAC +TAAGCGCAAGTACAGGAAAATCAAGTAAAGTATTTTTTGATGAAGGGACTATTAAAATTGTAAGTGGTGA +AACAAAAGCAGTAGAAGTCATTGATAACTTAGTTCACCCTCACTCAGGACGTTTACCTATTAAAACAACA +GAACGTATTGCGCTAGGTAGATTAACAGATTCTTTACAGTTTGTTATTTCAGAAATAGAAGTAGTTAAAG +ACCAAATTATAGATGAAGAAAATGAAGCTTACATTGATTTTGTGATGGAAGACTGGAACTGGGATTAA +>MW460250_1_107 # 88921 # 89199 # 1 # ID=1_107;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.308 +ATGCCTATGGACTTATTAACTATTGCTTCTGTTGCTTTTATAGCTGTAGTCATTGTTGATTTGATTAATG +ATGATATGAGCTATATGCTTACTGGTACTGCAATCTTAATAAATATTTGGGCAGGATTTTATGGATGGTT +TTTCTTACTGCAAGCAGGAATGTTACTTTTCTTACTATTAGCTAGGAAAGTTAAAGATGATAAGGAGTCA +ATACTATATTCCAGTGCTTCATTAATATGTGCACTAGGAATGATAATAAATCTTCTTTCATTTTCTTAA +>MW460250_1_108 # 89269 # 89574 # 1 # ID=1_108;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.252 +ATGAGTAAAGAAACAATTAGAAGACAATTTTCAAATGCAATTGAGATTATGGCAACAACTAAAGAATGGT +GGAACTTCCCTAAAAGTTTTGATACGAATAAAGAATTTAAAATTAAAACTTTTAAAAATGATACACTTGT +ATTTGAAGTCAGAGAAGGCAGTAGAAATTTAGGAAGCTTTGTAGTTTTTACAAACATTGATTTTGATTAT +GATAAACTAGAAGGAACTTCAACACAATATATGATTAATTACTTTGCTAAGAAATTAACTAAAGATATGT +TTAACTATCATAAATTACAATTATAG +>MW460250_1_109 # 89589 # 89939 # 1 # ID=1_109;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.313 +ATGAGAGAAGAGTTAAAACCTTTTAATAGGAAACAAGTTAATGTTAAGGGTTACTTAGATGATGTTAAGT +ATTCAAAGCGTAGAAGACATAAAGGTAATCAACATGGGTGTGTTAAAATCACAGTTACTGATGTAAAGAT +TAATGGTATACCTATTGACCACGTTAACATTGAAGTTGGTATCTCTTTCTATGAAAAACTAAAGGAGCTT +CAAGGAAAGAGAATTCAATTTGTAGGTACTGTTTATAAGTATGTTAAACATGCTAGGGGGCGCAAAGGTA +GAATTAAAGGATTTTATAAAGAGGATTATAGCGTAACTTTAGATAAGAAGTTACAAAAGGAGGAAAAATA +A +>MW460250_1_110 # 89939 # 90541 # 1 # ID=1_110;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +ATGATTAAAAGAAGAAAACATTTAGACCACTCATTACAGCCTGAGAAAGGATGGAGAACAGTACCTTTTA +ATGGGTATTATGAAGCGCATCCTACGGGTTTAATTAGAAATAAAGTAACGAAAAAGTTAATTAAAGGTAC +ACAGACAAGAAAGAACCATCCTAAGTGGACTGCTCATGAGATTGTATACCTAATTAACCCTAAGAAAACA +AGTTATTCTAGGGGAGTAGTTATTGCACATACATTCCCTGAAATGATTAGTCAATCACGAGGAGACCTTA +AGAACGGTCATGTGTGTTTTAAAGATGGTGACCGAAGTAATTGTCATGTAGACAATATGTTTATTGGTAA +AGGTAATGTTAACAAAAATATCTATAAATTAAATGATTCTTATTTAACTAGAAAAGATATTGAAGAGGAT +GTTAATAATTTAGTTAATGAAAGATTATTCTCTCAATTAGAATTATTGATTAAGAAAAATGAACCGGAAA +GAATTACACCTAGTAATCACTTTATTAAAAGAGATAATAATGTGTTCAGTATCACAGATTTATCTAAAAA +CTCACTAGTAGAGTTTGAGTTAGAAATCAAGAATATTAAGTAA +>MW460250_1_111 # 90555 # 90734 # 1 # ID=1_111;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.261 +ATGAATGAGTGGTATGCTTTATGTTATTATAACAAAATAGGTAAAAAGAAAATACCTAGACAAATTAAAG +CTCACAGGGATGTATCTGTATTAGAGGATTTAAAAGATAGATTAGAAGAACAAAATCCTAAAGAAGAATA +CAAGATTAAAACAACAAAAGAATTTGATAAGGAAAGATAA +>MW460250_1_112 # 90961 # 91362 # 1 # ID=1_112;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.286 +ATGAAGTTAGAAGATAAAGTGTTAGAGAGAATTGATTCTCTTGGAAATAAAGCAGGTAACTTAAGTAATC +AAGTAATGGAGTCATTAGTAAAGTATCAAATTACGTACGGTATTATAGATATTGTTGTAAGTATTTTAGT +TATTGCACTAACAATATTTTTAGGTAAGGTTTACCTTAAAGAATATAAGAAGGTTAAAATGGATTTAAAA +GAAAGCTTATTGTATGATGATTACGATGACTTAAGTGGTATCGGATGGTGTTACACAATTCTATTAATAC +TATTAACGTTATTCTCTCTTTACGCAATAGTTGCAGGTATCCCAACTGATATTATGAGATTAATTAATCC +GGAAGTTTATGCAGTAAAAGATTTAATTGAGCAAGTTAAAGGAGGAAATTAA +>MW460250_1_113 # 91364 # 91624 # 1 # ID=1_113;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.341 +ATGAAACAAAGAGACTTTGAATTTGAAGAGGATTTTGTATTAACTTATGAGTGTGAGGATTGTAAGCATT +TCGAAGACTGGGGTCATGATGAAGAGCCTGAAGAATGTAGTGAATGTGGAAGTAGTGATTTAATCTCAAT +AATACAAGTCATGAAGATACTGAGTGTGATATGTGTCGAGGGTATATTGATATGTGGCAAGATGGATATA +GATATATGGGAGATAATAAAGAGTATATTGAAAAAGAGGAATCAGGTTTGA +>MW460250_1_114 # 91676 # 91963 # 1 # ID=1_114;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.323 +ATGAATAAAGCAGTAGAACAAGCAAGTAATGCATTAGGTCAAGGATTTTCAGCTATGGTATGGCATCAAG +TATTAGTAGGGTTAGGGTTTATTTTATTAGGATTGGTATTATCTTTACTGGTTTGGGTATTAGTAAAAAA +ATTCCATGTACCTTTTAATCACCCGACAGCTTTTGTAGTGTACTCAATTATGTTAGTGAGTATTGTTGCT +AGTTTTATTTGGGGCGGTTTACATGTAATTAACCCTGAGTATTATGCTATTTTAGAACTTAAAGGTTTTA +TAAAGTAG +>MW460250_1_115 # 91974 # 92090 # 1 # ID=1_115;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.316 +ATGACTAAAGAAGAGTTAGAGCAAAAAGTAAAAGAACTTGAAGCAGAGAATAAAGAGCTTAAAAAACAAA +TAGAACGTTTTGAAGACGAAGGAGGAAAAACAAAAGATGAACAGTAG +>MW460250_1_116 # 92080 # 92343 # 1 # ID=1_116;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.280 +ATGAACAGTAGAGAAAAGAAAATTTTAACACTAACAGTAAATAATTTTTTAATGTTAGCTTTAGATATTG +TAGCACTTGTTAGGTACAAGAAAGGTAAAATTAAGCAAGAGAATTATAACACAGGTCAAATTTCAAGAAC +TATAGTTACAACAGCCAACTCATTAGGTATTCTTTACCTAGAAGAGCAAGAACGTAAAGAAAAAAAATCT +GTTAAAATAGGTACTCTTGAAAGTGGTACTCTAAGAGGGTTTAAAAATAAATAA +>MW460250_1_117 # 92420 # 92599 # 1 # ID=1_117;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.283 +ATGAAGCATTTTATTTTAATTTTAGGAATTGTAATACTAGTTATTGCATTAGGTATTGTTTTACCGGCAT +GGATTTTACAGTTAGTACTATCTGCATTCGGAGTTAAAGTAAGTATTTGGGTATGTATCGGAATATTTAT +TTTAATCAGTGCAATAGGAAGTATGTTTAGCAGAAATTAA +>MW460250_1_118 # 92614 # 92877 # 1 # ID=1_118;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.330 +ATGGCAAAATATGAATCAAATATTAATGGAGAGAATTATATTGCAACACCGTCACAAGCTTTAAGAGAGG +CACTAGCAAAATTAATAACTGAAGAAAAGAGCTTTGCGGAGTACCAAACTAAAGGTGAGGAGCAGTATGA +ATCACAGTTACAACTAAGACACTTTGATACAATGATATCTCAGTATGAGGAAGCTATTAGAGTACTAGAA +GATAAATATAGACCTCAGATTTTTATTCCGAAAGATAATAAGGAGGAAAATTAA +>MW460250_1_119 # 92880 # 93197 # 1 # ID=1_119;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.283 +ATGAAAGCAGAATCAATAGCAAGATTTTTTAATGACAAAGTACTACAAATAGAGGGTTATAAAGTAAGAT +TCTTACAGGCTAGTTCATCGTATATTTTAGATATAGATACTATAGATGAATCAGTATTGTTTTTAGAAGC +TCAAGTATCTACACTTTCAGGTAAACATTTATTAGATACAGCTATTACAATTGAGAGACCTGAAACATTA +AGTGCTAAAGAGCTATATACAGAAATTAGTAATAAACTACAAGCTATTGTAGGAGACCAAACTAAAACAA +CTATAGAACTATCAAGATATTTTAAGGAGGAAAAATAA +>MW460250_1_120 # 93198 # 93878 # 1 # ID=1_120;partial=00;start_type=GTG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.267 +GTGTCTAATAAAACCATTACAAATTATTTATTAAATTTAGAAGGAATAAAAGGAGAAACGTATAGTATTA +TTGCTCATATCAATAAACAAACTGGTTGGGGTGATAAAGGGGATTATTTTGAAATAAGCATAAGTTATAA +AGCTGATAAAGACCCTAGAACAACGAGATATATTACAACTGAAATTTTTGTTGATTATGGTAGTAATAAT +CCAAAAGAAATTTTATTACAATTAAGAGATAAGATTTTTTCTATTGTAGAAGAACAGGTAGAGACTGACA +ATGATTTTATTGAATCTATTAAAGAAATTAATTCAACTAAAGAATTAGAAAAACTAAAGCCTTATATCAA +TAATGAATATTATTCAATGTTTAAATCTTCTATTGAAAAGGAAATACCTGTAGCTTTATCTTCTGAAGTA +CTCAATAGATGTACAGGTAAAACAAGCACATTAGCTTATTTAGCACTAGAAAAGGATTTACCCTTAGTAG +TATCAAATGAACCTATGAGAAAAATGCTTAAAAATAAATTCCCTCATCTTAGAGTAGCTTCTGCTGAAGA +TTATTCAAATTATGATATTAAAGGTGAAATTGTTCTAATAGATGAAGTAGATATTGACCAGTTATATAGT +GCTGATAAAGTATCTGTTGATGCACTTTTAGTGGGTATCATTAAAAATTAA +>MW460250_1_121 # 93967 # 94125 # 1 # ID=1_121;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.277 +ATGATACCCGTAATAGTTATACTTATTGGACTCATATTATTTTTATCTAGCGGTTATAAGTTGGTATTGG +GTAAGTATTATGATGATGTAGATTTAAAAATACTATTTACCATATTTGGTGTTGGGATTGCATTACTACT +TGGAGGATTTATATTATAA +>MW460250_1_122 # 94160 # 94360 # 1 # ID=1_122;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +ATGAATTATAGAGATTTTATTACAGATTGTATTAGCGGTGGTTACAACGTACACATCAGTGTTACAGAAA +AACGAGTACACATTATTTCTGAGATGACATCAGCATCTTACCCTAAAAAGGAAATTAACTTAGATGAACT +ACAAGCTTATGTGTACTATATGAATAATTTTGGAAGTCAAATTACAACGGAGGGGTTATAA +>MW460250_1_123 # 94361 # 94651 # 1 # ID=1_123;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.278 +ATGGAATTGGTTATTAATATTGTAGCAGTATTGGTTGGTATGTATGCTATTTATTTCTATGTTACAAAGT +TTAGTACTGGCTTATCAGGTATTTTAATTGTTTTAGGGATGGCTATTGGTCTTTACTTCTACTTAGACTA +TTTAAATGTCAGAGAAAATGTTATTCGATTAGTTTCAGTAATGTTCGGAGCTTTCTTATTTAGTATTGAA +ATGATTTATAATAAAATTATGTTCGAAATTAAAAAAAGCAATGTTCAGAAGACTGTTAGAGTGTATGATA +AAGAGCAGTAA +>MW460250_1_124 # 94743 # 95051 # 1 # ID=1_124;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272 +ATGTATCCTGAAATAGATGTGGAAGAATTAGCGTATAAGCTAAAAAGTACAAGAGAGTATTTAGAGAGCA +TTACAACAAAAGAAGTAGAAATTTATGAAATCTATCATCTTAAAACAGGTAAGTTAGTTTTTAAAGGTGA +ATACATTGAGGTAAAAGAATTACTGAGGAAAATGTATAAAGAAAATTTAACACTTGTAGATGTAGATACA +ATGTTAAGCATTGGTAAAGGATTTATTGATGTAATTAAGAATATATCGGCAGAAAATGTATTCCAAATAA +CATATAAAAAGGAGCTATCAACAAAATGA +>MW460250_1_125 # 95048 # 95956 # 1 # ID=1_125;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.285 +ATGATTAAAATATTTTCAGAAGTAGATAAAGAATACAAACCTATTATTACTGAAAAGTTTCCTAATGGTG +AGATTAATTTTAAATATGATGATTTAAAGTATTTAGTAGAAGAGGACTTAAGATTTGATGTTTTCTTTAA +ATGGGAAAATGACGCAGACTTAATGCATTTGTATATGTTTACTAAGTATTTAGAGCAACTAGGTATTAAA +GATAAAGCTGAATTTTTAGAGATTGCATATCTACCTTATAGCAGAATGGATAGAGTAGAAGAAGGACATA +ATAATATGTTCAGTCTTAAATACATTACAGAATTTATTAATAACCTTAATTATAAATCGGTATGGGTAGC +AGAACCTCATAGCCCTGTAACAGAAGAATTACTTACTAATTCTTTTGCTATTGATGTTACACTTAAATTA +TTAAATCAGTATATTGAAATGTCCGAAGAGCCTGTAACAATAGTACTACCTGATAAAGGGGCATACGATA +GATATCTATTTGATGTAGAACGTATCTTAATGGAATCTAATATTGAATCATATTCAATTGTATATGGTGA +GAAGAAACGAGATTTTGAAACAGGTAAGATTAAAGGTATTAAAATAATTAAAGATAAAAATACTTTATAT +GATAATTGTATTATACTAGATGACTTAACAAGTTACGGTGGGACATTTGTCGGTTGTAAAAAAGCCCTTG +ACAAACTTAAGGTAAGTAGTGTATCATTAATATTGACTCATGCAGAACGAGCTTTTGCAGAAGGAGCATT +ACTTAGCTCAGGATTTAAAGATATTATTGTAACAGACTCTATGTTCCCTAAAAATAATTGGGAAAAAGCT +ATTGCTAAACATAGAGCTAGAATCAACGGAACTGAATTACAAATAAAAGATATCGAAAGATATTTATAA +>MW460250_1_126 # 95974 # 97443 # 1 # ID=1_126;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.323 +ATGCTAAATCCAACTTTAATGTGTGACTTCTATAAACTAAGTCACAGAGAACAATACCCTGAAGGTACAG +AAATTGTATATAGTACACTAGTACCTAGAAGTAATAAATATTATGAACACAGTGATAATATTGTAGTATT +TGGTATTCAATCACTTGTTAAAAAATATTTTATTGATATGTTTAATAAAGAGTTCTTTAACAGACCTAAA +GAGGAAGTTATTAATGAATACAAACGTACAGTTAAATTTACACTAGGACAAGAAAATCCTGATGCTAAAC +ACTTAGAACAATTACATGACTTAGGTTATTTACCTATTGATGTAAGAGCTTTAAAAGAAGGTACTGTTGT +TCATCCTAACACACCTGTTATGACAATTGAAAATACTCACTCAGATTTCTTTTGGTTAACTAATTACCTA +GAAACTATTATTAGTACTCAAACATGGCAAGCAATGACTAGTGCTACACTAGCATATGATATGCGTAAAA +TGCTAGATAAATATGCAATGGAAACAGTAGGTAATATTGAAGCAGTAGATTTCCAGGGTCATGACTTTAG +TATGCGTGGTATGAGTTCTTTAGAAACAGCTCAATTAAGTTCAGCAGGTCATGCAATTAGTTTTAAAGGT +AGTGATACAGTACCTGTAGTGGATTTCTTAGAATCATATTACAATGCAGACGTAGAGAAGGAAATGGTTG +TTGCTTCTATCCCTGCTACTGAGCACTCAGTAATGTGTGCAAATGGTAATTATGAAACCATGGATGAGTA +TGAAACATATAAACGTATGTTAACAGAAATATATCCAACAGGCATTTTCTCTATTGTGTCTGATACTTGG +GACTTTTGGGGTAATATGACTAAAACTTTACCTAGATTAAAGGATATTATTATGGAACGTAATGGTAAAG +TAGTAATCAGACCTGATAGTGGAGACCCTGTTAAAATTATTTGCGGAGACCCTGATGCAGACACTGAATA +TGAACGTAAAGGTGCAGTAGAAGTGCTTTGGGATACATTTGGAGGTACTGAAACTGAAAAAGGGTACAAA +GTATTAGATGAACATGTAGGATTAATTTATGGAGACTCTATTAACTATGAACGTGCTCAACAAATTTGTG +AAGGATTAAAAGAAAAAGGTTTTGCAAGTATTAATGTTGTATTAGGTGTAGGTAGTTTCTCTTACCAATT +TAATACTCGTGATACCCACGGGTTTGCAATCAAAGCAACGTATGCTAAGATTAAAAATGAAGAAAAACTT +ATCTATAAAAATCCTAAAACAGATAGTGGTAAACGTTCACATAAAGGTCGAGTAGCTGTATATAAAGACG +GTTCATGGGAAGATAACTTAACCTTACATCAATGGCTAAACAAACAAAATGTTAATCAATTAGAAAGAGT +ATTTGAAGATGGTAAACTTTATAGAGACCAGTCGTTAAGTGAAATTAGAGAAATAATTAAAAATAATTAA +>MW460250_1_127 # 97522 # 97767 # 1 # ID=1_127;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.305 +ATGATTTATAAAATATCAAAACATAATTACTATAGTAGATTTGAGCATTCCACTTATCCTCCTGATGAGG +GGTTTGCGTATGTAGATTATGTAGATGTGATTCTTATTGGTGTAGATAATCCTAGGAAAAGAAAGATTAT +TACCTTAAAAGTAAATGAGTTCAACCCGGATGACTACAGAGTAGGTCATAAGTACAATATTATAAAAATA +CTATGGTTTGAAAAATGGGAATGGTTAAAGCCATAA +>MW460250_1_128 # 97787 # 98179 # 1 # ID=1_128;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272 +ATGATTATAGATAAATTAAATGGAGTTAAATTAGAAATAGGTGGGCATGTCGTATCATTTAGTGTAAGAA +AGTTTAATACAATTAATGGTGAGAGACAATTAATAGACTACCATCATATTAAAAGAAATAGACAACAGTA +CTTTAGAACTACTGAAGAATTTTATAATGAATATAAAGAAATTAAGCCTGACAAAAATGAAATAGATGAA +ATGTTTGAATCTCTAGGTTATGTAGATACTGAGTTAGATGATGTAGTAAGAAACCAGGAAAAGGTTACTG +AAATATTAGGAGTTAGTGAACAATATTTAAATCAGTTATCTTATAAAGCTATAGAGGAGTATGTAGATAA +AGTAGTTACACTTGAAATTAAAGAGTTGAAAGGAGAGAAATAG +>MW460250_1_129 # 98181 # 98402 # 1 # ID=1_129;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.342 +ATGAATAATAACTGGGAAAAAGAAGGAGTTAACTATTGGGAAAAAGAAGGAGTTAACTATTGGGAAAACG +AAGACTGTCCTAGGGAATACTTAGAGAAAGCATTCATTGACCTGGTAGAATATGTTGAAGGAGTTACAGT +ACCACCTAAAGATGTTAAGCAGTTAAGAGAAGATAAACTTAGAGAAGATATTGGGTTTTATGAGTACGTA +GCTGATAAATAA +>MW460250_1_130 # 98468 # 98779 # 1 # ID=1_130;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.317 +ATGAAAAAGTTAATAGTATTACTTACAATTACTATTTCTCTATTACTAGGGGGTTGCTCTCCTGATAACC +ATGAAGGTAAAGTAGTAGGAGTAGGTGAATACAGAGAACCAACTACTTATATAAAATCAGGTAGCGTTAC +TGTACCAGTCATTGGTGAAATGAAATACTATGTAGATTTAGAGACAGATAAAGGAGAAGACCGTGTATAT +CTTAATAAAGAGGTCTATCATAAGTTTGATAAAGGTGATGATTTCTCTAATGTAGGTGAGAAAGTGTATA +AGAATGATGAATTAATATATAAAGGAGACTAA +>MW460250_1_131 # 98782 # 99291 # 1 # ID=1_131;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.269 +ATGTATTTAAATGATTATGTAGGTAAATTTATAAAGGAAGATAACTATTATGGATATCAATCTACAGACT +TAGTATCTAATTATGTTCAACGATTAACTCTAGGTAGGTACAAAACTAAGTTAAATGCTAATAAAATGAA +ATACGAAAGATTACCTAGTTCTTGGAAAATAATTAAAGCCAAAGATTTGTTAAGAACAGATGATTATAGA +GAAGGAGATATATTTGTATCAGAAAGAATCTCCGTATTCGGTTTTAATGGTATTATTGTATATAACCATG +ATTTTAACAATGTAACTGTTATTACTCAAAATAGAGATGGTAAAGCTACTAATCCTGTAGAGGAGCATTT +ATATCCAAAGAAAGATATTGATTATATTATTAGACCTATCGAGAGGGACTACAGGGAATACTTTAAAAAA +TCAGATTCAAAAGAAAAAGTTACTCTTTCGAAGCAAGAATATAAAAAATTATTAGAGGCTTATAATAAAA +TGAAGGAAGTGTTTAAGTAA +>MW460250_1_132 # 99293 # 99622 # 1 # ID=1_132;partial=00;start_type=ATG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.294 +ATGAATAGTACAAAATTAGTAGAGTACTTTACAAATAAACAAGGTAAATCTCTAATATTACCTGATGAAA +ATAAAGTTGAGTTATATAGAGTTGATGTAACACCTTATACTATGAGACTTAATTTCACTTACAATACAGA +AGTTGTAGCTATAGATATTGATAAGTTACACTCAGATTCTATAGAAATGCATATACCACAAGGTCTTTAT +ATAACAACTGTTGTTAAAATTACTAGTACGCAGAGTATTAGTTCAGTTCTTCATAAGGTATTAGAGGAAT +GGGTAAGACAAGTACAAAATGATGGTATATTCGGATTCGTATGGGAGTAA +>MW460250_1_133 # 99628 # 99822 # 1 # ID=1_133;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.226 +ATGATAAGTATAGAACATGATTATACAATAAGAACTGTAGATAATAGAAAATATACTTATTATAGTAAAT +ACGAATCACTAGTTACTTTGTATGAAAATATTATGAGTAAAGATTGTATTGAAGTAACTAAATATGGGAA +AGATAAAAAAGTTATTATTGATACTAGACATATTGTATCTATTGAACGATGGTAA +>MW460250_1_134 # 99846 # 100160 # 1 # ID=1_134;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.273 +ATGATAAATGCAGGGCATGCTAAGTACCTATCAGAAATTTATGAAGATGATGTACATTATGAAACTATAG +ATAGTATTGTAGAAGATATACTAGATAATATTAATGATGGTATTATTGAAGAAGCTATGAAAGGTAATAC +AAGTTATCAATATGTTCTTAGAGACTTAAGAGTAGATAATGAAGTAGAATATAGAGTTATAGAAGAACTT +ACTAACCAAGGATATAGTGTAAACCACATTAGTAATGATATAGAGTACCCTTCTATATCTACAAATAATT +TAGCAGGGTTAGATTACTTAAATATTAAATGGTAA +>MW460250_1_135 # 100175 # 100342 # 1 # ID=1_135;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.238 +ATGATAAATAAATATAAAAAGTTATGGGATGAAATAACTCAACAAATTGTTAATGTAGAAATTATTAACT +TTAAAAATGAAACAGTAACAATAGAATCTACAGATGATTCAGGATTATCAGAGATAAGAGGTTTTGAAGA +AGTAGAGTTTATAGATTACTATGGATAA +>MW460250_1_136 # 100379 # 100480 # 1 # ID=1_136;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.284 +ATGGACTTGTTTGCAAAAATAATTATTATGTCTATAGGAGTTGTTCCCTTGTTAACTATTATTGTTGCAC +AGCTAATTACAGATTACCATGATAATCATTAA +>MW460250_1_137 # 101353 # 101652 # 1 # ID=1_137;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.290 +ATGATTGATATATACTTAGGAGAAGGTTATAATAAAGAATACTTGTCTAAAGCACTCAGATTAATCAATG +ACCATGCTCCTAGGGAGTTAAGTTATGATTTTAATAATGTAGAAGCGGATGTTAATATTCACACAATGTT +ATATGTTAAACCTGAAGATAGATTTATATATAAGGATATATCCTATTACTTCCCGGGTGATTTAATTATT +TGTATAGTTGATGATGATGCTATTGTATACCACCAAGGTGAGCAGATTTCAGGTATTAGTATTTTAAGAA +TACTAGAAGAGATATTTTAA +>MW460250_1_138 # 101668 # 101853 # 1 # ID=1_138;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.247 +ATGATAGGAATAACAATATTAATTACGATAATGAGTATATCAACTATCTCTATGTATATTTATTTTTTAG +TAGACTTGATTCAGTCAATCAGATATAATAGTTTTGATAAGGTAATTAACGTCATAACATTTGTACTTAT +GACAGTTATAATAGCATCAGGTATTTTAGCTATACTTGGAATATAG +>MW460250_1_139 # 101960 # 102250 # 1 # ID=1_139;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.309 +ATGATTCATATATTTGTAAAAGAGGATTATAATAAAGAAACATTAAGGAGTTTACTTGAGTATATTAATG +ATACTGTAGGTAGGGAATTAACTTATGGTATTAATACAGACTATGATAAGGATGTCGTGATTGAAACCGA +TGACCCTATAGATGAGGAGGATACAATTGAGTTATCAGGTACAAACATGTTCAAGGATGACTTATGTATT +CTTATAGAAGAGCTATACTGTAAGGCATTTGTTAATGGTGAACCTGTTATTATACGTAAGTATGTAGAGG +AGATGTTATAA +>MW460250_1_140 # 102250 # 102537 # 1 # ID=1_140;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.281 +ATGATTATAATATTTTTAACTGAAAAATATGATGCCAAGGCTTTAAAGAAAGTATTAGAACATATTGATA +ATTGTAGTAGTAGAGGTCTTAGCTATTTAATGGGAAAAGGAGAAGCGGATGTATGTATAGAGAAAAATGT +ATTTAGAGAAAGAGATGATGTAAGGATTAACTCAAACATTATTGATGAAGGTAAACTTTGTATACTAATA +AATAGACATGGTTTAGAATGTAGCTACTATAGAGGTATATCATGTAATATTGGTTCCTTCGTAAAGGAGA +GATTATAA +>MW460250_1_141 # 102537 # 102830 # 1 # ID=1_141;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.327 +ATGATAGAGATATACCTTAGTGAAAATTATGATAAGAATTTACTAAAAGCAGAATTAAAATGGATTAAAG +AGACCGCTTCAAGAGAACTAACTTATGATGTTAATAGAAGTCCAGGATTGGATGTTTATGTTAATCCCTA +TAGGTGTACTAAAGACGAAGTTGAAGAATGGAGTACACTTCCTCCATTTGAAGATGATATACTTGTATTT +ATAGCGGAGACGTGGATACATGAATATCTTAAGGGTGAATCAATAGGTGTAGATAGTATGGAAGAGTATG +TAAAGGAGATGTAA +>MW460250_1_142 # 102834 # 103091 # 1 # ID=1_142;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +ATGTTTAAGGTATATTATACAGTCTACCATAGAGGTAGTATGAAAACTATTAAGGATAAGCTAGATAGAA +GTAGTTTAATATACTTCTTGTATGATACTTGGTATAAAGATATTAGTAACGTATTCCCTAATCACTATAA +TAAAGAGTTTGGGAGTAAGAGTGATGATATAGATATAGATAAACTTATTGAAGCGGTTAATGAGGAAGGT +ATATTACTTATCAATAGAGGTAATTATGTTACAATAAGAGAATGGTAG +>MW460250_1_143 # 103169 # 103408 # 1 # ID=1_143;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.312 +ATGGATACTTTAACATACACTATTATTCATAAAGAATCTGATAGGGTAATAGCTAGCGGTTTAAATGAGA +CAGAAACTATGAACTTAGTTCAAAGGATGATAAATACTAATCTAGTTACTGATATATCATTAGATGATTA +TAAACGCAGACCACATGGAAAGATAGATGTAGTCAATTTACTAGTAGATATTAGAAGACAAGGCGTATTT +GATTTCAATCACATTTGGCACGTAGGATAG +>MW460250_1_144 # 103419 # 103766 # 1 # ID=1_144;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.305 +ATGATAGTTATATATACAGATGTTTCTAAGGATTATTTAAAAGACGAGTTCTTACCTTGGCTTAATGAAA +GGGATAGATACTTAGAATACTATAAAGATGAATTACCTGAGGATATAGATTCCTCTTATATTGTATCAGT +TGTATACTGTAAGGATATGGAAGGTCTATTAGAAAGAAAAGACATTGTTCTTGATAATAGTTATAATGAA +CCTGTAGCTTTATTAGGTGTTCCTGAGTTTTTTGGTAATTATAGTAATTATTTCTATTATAGAGGAGAAA +GTATTAGTAAACATGACCTAGGAGAAATTGTTAGGTTAAAAGCTTGGCAACGTATGGGTGGGGATTGA +>MW460250_1_145 # 103975 # 104313 # -1 # ID=1_145;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.198 +ATGAAAATAAACTATATTCCAATGTGGGACAATGAAGATGTATTACAATATGCAAAATCACAATTATTAG +TAAATGAATTAGAAACAAAAGAAATTATATTCAAAAATTATCAAATATCAGATGATTTAGATGGCGGAAC +AGATAAAAAATATTATGAAATATATGAAAGTAAATTTTATGTAGATGAGGAAACAACAAAAGAAGAATTT +AATAATTTAATTATAGAAAATGAAAAACTAATAAAAGAGTATAAAACACAAAACGGATTAATTAAAAATT +TAATTAAATCACAACATGAAGTAAATGAATTTGAATACAATGTTATAAATATTCTATAA +>MW460250_1_146 # 104624 # 104932 # 1 # ID=1_146;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.366 +ATGGATGTAAAAGAAATTGCAAATACTATAATGGAGTTGTGGCAAATGGACGGCTACAGATGTGCAGAAC +CTCCATTATATGAAAGCACACTAAACCACACACGCACACACACGGCGTTAATTGTTTCTATTAATGGAAA +CTATGACACAGTGCAGATGTTCCGCAAAACGCCTATAATGAGCATGAGAGGGCAAAGCCAACCGGCTAGC +ATGTTAGTTAATGTGATTGACGATGTAATTATAATCGTATATGAAAATGTAGTGTACGGAGTTCAAAACA +AAGAAATAAAATTTATTGAAGAAATTTAA +>MW460250_1_147 # 105138 # 105422 # 1 # ID=1_147;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.337 +ATGACAAACAAAAATTACTTATACGAAGAAACTCACACAGTACAAGGGCAAGACATTACGGCTTTCAGAA +TTCCAAATGACGCAAACGGCAACCCACGTTATGTAGTGCATTTCATGGATTTAGATATTAAACTAGCAGA +CTATGACAACATCAATAAACTATACGGCTTTAAAAAATATACTGCTAAATGGTTTGGCGGTGGTGTAGTA +TTCCAAAGTTATAATATAGCGGATACTCTAGAATATGCTTACACACAAGTTAAAACAAATAGAATTAGTC +AATAA +>MW460250_1_148 # 105497 # 105688 # 1 # ID=1_148;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.177 +ATGAAATTTAAAATAGAAAAAAATAATAGTGATATAAAAACTTTATGGAATTTAGCAAAAAATGGATATA +TGAGCTATCAAACTGTACACAATATATTTAAAAATGAATCAGATGAATTTATTATATTTAACAGTAAACA +AACTTATAATAAATTTATGAAATTAAGATATAATAGAAGTGCAATACAATAA +>MW460250_1_149 # 106005 # 106493 # -1 # ID=1_149;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.276 +ATGGTTGAAAATAAAATTAATGAAATTTGGAAACCTATAAAAAAAGAATACTTTAATAAATATAATTTTT +ATGTTTCTAATTTAGGTAGGGTAAAAATAAATGATAGGTTGAGTAAAGTACATCAAGACCGTGACGGATA +TTTAACCGTAAGGGTCAATAATAAAAAACACATGGTGCATAGACTTGTATATGAATATTTTGGCAATGAT +TTTATTAAATCAAACCATGTACACCATATAGACGGTAACAAACAAAATAATTGTATTGATAACCTAGAAT +GTATTAGCCCGTCAGAACACAATAAAAGGCACCATAAAGACAATACTTTCAATAGGTATAATAGGGGTTA +TGCATTAACAGAAGATGAAAGAAAAGCCATAGCAAGCAAATACAAGCCTAGAAAGTACACTCAACCTATG +TTAGCAAAGGAATATAATATAAGTGAAATAACCGTTAGAAGAATTATAAAAAAATATAAAAAAGATTAA +>MW460250_1_150 # 106661 # 106819 # 1 # ID=1_150;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.245 +ATGATAAAATTTAAATGGAAAAACAAAACAATCAAATCAACACAAAAAACAGATAACATTCTCTTACTTA +TTATAGGGGGTTTAGTTGCAACAGTCACACCTAAACTTGTAAACTGGTTTTTACTACTACAAGATAATAT +AAATATTTTTTTAAGATAA +>MW460250_1_151 # 106889 # 107020 # 1 # ID=1_151;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.242 +ATGAAAAAAATCACAACAACTTTAAACTTAATCGGCATGAAAAATAATGAAAGGTTTACAGAAGAGTTAA +AAAACTACCGTCAAGATGTTACTTTCTTGAAAGCAAATAAAATTGTAAAATATTCAAAATAA +>MW460250_1_152 # 107188 # 107424 # 1 # ID=1_152;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.350 +ATGAAATTAATCAATAGAGATAATGAAATCGTAATTAGCATAGCAACACTTGAGAGTGTAAAACAAGCCC +TAATTTGGGAGTACATCGACCACTTAGATAATAACATCCTAGACAAAGAAATACATGACCAGGAAGCGGT +TGTTATTACTTCAGACACTTTGCAATCACTCAAATTTGCGGACACTATGGAAGAACTAGAAGAATATGTA +AACGACATCGGTTGGAAATTAGTTTAG +>MW460250_1_153 # 107504 # 107974 # 1 # ID=1_153;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.380 +ATGACAAATACAATACAAGCATTTTTACAAGGACAAGAAGCAAGCACAGTTAAGGACGTAGCAACTCATG +GAGTACAAAGCGGAGCAATTGGCAAATTAATCTACACATCAGACGTAGTAAACTTCTTTGATAGTTACGA +GCAGGACATTGAAGCGGTCATCACTGAATACATTGAAGAGGTTACAGGACAACAATATTATGACTTATTG +AACTATGAGCTTATGAGAGACCTCGAGAATTATGCAAATGTAGAATTTGAAGACGAAGACGAATATAATA +ACATTCAATTTGACCTAGCAGAAAACATTGCTTCTGATGAGGTTGAAGGATTTGAAGACATGGACGAAGC +AGACCGGGCGGAAGCAATCTATGAGGCTATGGATGATGTTGAATTAGAACTACAAGAAACTGACAAGGTT +CAATATGTTAATCTAGCGGTTGAGATTGTAGCTCAAAGAATGGCACTATAG +>MW460250_1_154 # 108004 # 108129 # 1 # ID=1_154;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.310 +ATGAACAATAATACTACTTCATATAGTAATAGCCCATATGGTAGCTTAGAAGAGCTTAGAGAAGCTTATG +ACCTATCGTCATTATCTACTGGTGAGATTAAAGAACTAATACAAACATTTGTTTAA +>MW460250_1_155 # 108214 # 108393 # 1 # ID=1_155;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.256 +ATGAGAAACTTATTAGAGCAGGAACAATTAGAAAAAGATGTAAAAGACATTATTTGGGTATTAGATAGAA +TGATTGCTAAAGGAGAACAATACACTGAAGCTTACGATATTTTAGTTAACAAATTAGAAAGACAAGAAAA +AAGAATCGTAGAAATAAAAAAACAAAATGGAATATTTTAA +>MW460250_1_156 # 108727 # 108963 # -1 # ID=1_156;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.287 +ATGAAACTATACCAAGTAGAACATGATAATTGTGAACCTTATGAAGATAACTTCCATTTTAGAGAAGATA +AGATATATACAGATAAGGAGAACTTAATTAAACGTATTAAAGAAGAAGGTTATAAAGAGGAAACAAACCA +TAGAGGTGAGCAAGAGTTCATTAAAGGAGACCCAAGAGATTTCTATGGAATGGATATGATTACTATACAT +GAATTAGAGTTTGTTAATAATACCTAA +>MW460250_1_157 # 108965 # 109450 # -1 # ID=1_157;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +ATGAACAAAGAACAAGCCAAACTTAAACTCGAAACAAGTATTATTAATTATGAGAACCAAATAAAATTCT +TAGACCCTGCAAGTATGTATACTAGAGGGCTTATAGATGCAAAAGGTTACTCAAAATTGGCGTTAAAAGA +ATTAGAAGATACAGGTAAACACTCTTATGAAGATACTACATGGAAAGATAGTTATGCAAAGGTATTTACA +GATGAAGAAATACTAGAATTCTTACTATCCAAACCAAGAGTCACATTTAAAGGTAATCAAGAAAAACTAG +ATGAAATTAAAAAAGAAAGAGAAAAAATACAAAAAGAAGCTACTAAAGACTTACCTAAGGGCAGTCCACT +AGGTGACTTATCAAAAGAAAATTATGAGAAATTTTGGGGAGCATTACAATGGTCTAGAGAAGAAAGAGAA +AAGTTAACACAAGAATCAAGAGCTTATTATGAAAACTATCTAAAAAAAATCAAGGAGAATAAGTAA +>MW460250_1_158 # 109463 # 109870 # -1 # ID=1_158;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.282 +ATGGGACTAGACTTTGAAGTAATCGGTGTGACATTAAGTAATAGAAAAGTGGAACAGAAAGGCTTACAGC +ACTTTATTAATAACGCTAGGTATAGACATATATTAGAAAAGAATTATTATAAAGGATTTAACTTTGAAGA +TGACTTTAGAAAACCAGGATATTTTATGGACTTACTTTTAAGAGATGCAGAGACATATTATGATGAGTTT +GAAGAATGGTGTGAAGGTGTATTCGTACTAACTAAAGACAAGTTAGTTAATCTAATGAAAAATGAGTTTA +ATGAAAAAACGTTTAAAGGTACACATGATGCAGAATATTATTATAGATTAATGTCTCATATATACAATGT +AGAACAATATGAAGGTAAATTCTATGACTTTTACTTAATCATGAGTGTAAATGTATAA +>MW460250_1_159 # 109870 # 110301 # -1 # ID=1_159;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.315 +ATGGAAAACTATAAAAACTTTATTATAGAGGAAATGAATAAAGCCCATATCTTAGTAACTAAAGCAGAAC +AAATTAAAAGGAATAGAAAATTAGCAGAAACAGAACTAGAAGAAGTATATAAAAAAGCAGAAGCCTTCGA +TGAAATTGTAAATGAGTTACTTTATCAATTACAAAATCTAGAGAGTTGGGATACTCTAGACCAAAAAGAC +TGTCAGACATTAAAACAAATACTAGAGGAAAACATAAAGGAGGAAAAACAGTTGAAAAGATACAAAGTAA +AACGTACTATTACTACAGAAGAGGTAAGATATATAGATGCAGAAACAGAGGAGGATGCATGGTATAGTGT +AGAATATGAAGATGAAGGTGCAGATACAGCACACTATAATGCAGAATATGGTACGTGGTCTTATGAAGAG +GAGGAAAAATAA +>MW460250_1_160 # 110304 # 110495 # -1 # ID=1_160;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.328 +ATGATAGAAATTAGTATCTCATGGACTTATTTAATATCATTCTTGCTACTATGGTCAGCCGGTATCTTAT +ACATTAACTACCTTGTTTATAGAATAAGGTTAACAAATAAGGAACGTAAAGAAATGAGTAAGGAGCACCA +CCGCAATAGAGAAGAGATAAAACAACGGATAGAAAATAGGAGGGATAAATAA +>MW460250_1_161 # 110492 # 110977 # -1 # ID=1_161;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.288 +ATGAATAAAACATTTTTTAAGTTCCTAGGTAAAAACACATTAGAGTATTCAAAACAGGGGCTAGGTTTTC +TTGTAGCCCTCCCTATAATGTTAATTATATTTTCTGTATTCCTAGCATTCATTATAGGTATTCCTGCAGT +TATTATTTACGCTCTACATGCATTAAATGTAGACAATGATTTTATTATACAGTTAGTACCTGTTATGTGG +TTCATAATACTTTATGGTATTGTAAGAACAGGTGAGCACAAAAAACCATTTGTTAAACTAAAATTAAAGG +ATTACTTATTATCTATATTATATCTAACTACTATTACAGCTATTAGTGTTTTAGAAAACTATTTGCTCTT +TCAATCATTACCTTTTACAGGGGATGTAAGAGCAGTTATAACACTATTATCATTTATAGTATTTGTTGCT +GTAAATAGAGGTATTTGTAAAATAGCCATTAAGAGCTATAAAGAATATAAGGAGGACTCACAATGA +>MW460250_1_162 # 110970 # 111401 # -1 # ID=1_162;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.282 +ATGAATATCAAATATATTGATTTAGTATTAGAAAATTGCGATGTTGTAAGATTAGAGCCTAAAGACGTAA +GCAGGTTCCATATATCAGGTATTACAGAAGGTATAGATTACTATGGTACATATAAAGGGACTTCAAGTAT +AAGTCGAACACGTCACTGTACTTATTTCGGTATTCTTATTGATAAGCCTATGGAAATACCTCAAGTTGGT +TTTGCTTATCCTGATAATACGAACGCTTATGAAATGATTACAGCATATTCAGATATTACAGCTATAGATA +TTATTTATGAAAATGACGCAAATGAATATATTTATGTAGACTTTAATGAATACAATGATAACTATAATAT +CAATCAAAAGAATGATTATTACAATAATATGTTAGAAATTACAATTACAGAAAGTAATTCCAAGGAGGAA +GAAGATGAATAA +>MW460250_1_163 # 111415 # 111957 # -1 # ID=1_163;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.243 +ATGGATAAGATAAATCTTAATAAAAAACATGAGGGCTCTACTGTAGTTAACATATCAAATAATATTACAT +TAAAAATACAATGTACAGACCTAAGAAAAGAATGTGATGATTCAGAAGCACCTACTACCTATACCCATTT +TAAAGCTTATATCGTATATAATATATTCATTGTAGTTAATGATAGAAAACAAAAGAAAAAAGTTAAATAT +GATTGCTATAATGACCATGTAGGCAGAGGTAATGTTAAAGACCTATTGAAAGTAAAAGATGTTATCTTCC +AGTTATCCACTCAATTAAATACTAATGAAATTATTAAAATATCAGGTGCAGATGAAAGAAGATATAAAAT +ATATAAATATTTTATAGAAAAAGATATAAGATTTGAAGACAATATGTATTATAGTAAAAGTAATATATGG +ATTATAAATAATTTTAGCTTATTACAAAAGTTCCAATGGAACACTGTAGTAACTAAAGACGGGGACTATA +ATAAAAAAGAACTTAAAAAGGTTGATAAAGAATGGAAAGAATTATTAATATAA +>MW460250_1_164 # 111969 # 112457 # -1 # ID=1_164;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.315 +ATGAGAGAAACAAGGGAATATATTATGTTTTGGGGTAAAGAGGATATTTATTCTAATTTCTACCCTATAA +AGTTTAAACACCAAGGAAGAACATTTAATAACTCAGAGCAAGCCTTTATGTGGCGCAAAGCAAGATACTT +TAATGACTTTCAAATAGCAGGTGAAATACTAAATGCTAAGAACCCAAACCATGCTAAAAGTTTAGGTCGT +AAAGTTCGTAATTTTAATGAAGAACAGTGGAATAAAGTAAGATATAATATTATGGTAGAAGTAGTTAAAG +ATAAATTTATGACTACACACCTAAAGCAAAGAATATTAGACACAGATGTACGTAAAGATTTTGTAGAAGC +TTCACCTTATGATAAAATATGGGGAGTAGGTCTTAAAGCAAATGACCCTAAAATATTAGAGCAAAGTAAC +TGGAAAGGTCAGAATCTATTAGGCAAAGTAATGGAAGATGTTCGAGTACATTGTATCTACAATAAGTAG +>MW460250_1_165 # 112470 # 112868 # -1 # ID=1_165;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.251 +ATGAAGAAAAAATATTTTAAAGGTCTTAAATTGAATGACTTTGAAAAAGAAGTTTTTGGGATAAAAAAGA +ATAAAAAATATAAGAAAATGAAGAAAAAACTAGGTAGAAATGAACCTAAGTATTGGAATTATGACATGTC +CTTTTTTATTCAGCTATACGCTGACCTAAATGCATTTATAGAGAGTAGTAATCATGTGGATATGGAATAC +CATACTTTTGTAGATGTAGACGGCAAAGAAAGAACACAAATAGATATGATAAAACATATTTTAAGCTTAA +TAAAATATTATCATAAAGAAATGGATGATTTTGATATGGATAAATATGATGAGCTTGAACAGGTACAAAG +TAAAATATTAGATAATTTTAAAATTGTATTACCATCACTATGGAATTAA +>MW460250_1_166 # 112865 # 113572 # -1 # ID=1_166;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.297 +ATGGCTATATACGTTGTTCCCGATATTTACGGAGAATACCAAAAATTATTAACAATTATGGATAAAATAA +ATAATGAAAGAAAACCTGAAGAAACAATAGTATTTTTAGGAGATTATGTAGATAGAGGTAAAAGGTCAAA +AGATGTTGTTAACTATATATTTGATTTAATGTCTAATGATGATAATGTAGTAACTCTGTTAGGGAACCAT +GATGATGAGTTTTATAATATTATGGAAAACGTAGACCGATTAAGTATCTATGATATTGAATGGCTCTCAA +GATATTGTATAGAAACACTTAACTCTTACGGTGTGAGTACAGTAACTTTAAAATATAGTAGTGTAGAGGA +AAATCTAAGAAATAATTATGATTTTATTAAAAGTGAACTAAAGAAACTTAAAGAATCAGACGACTATAGA +AAATTTAAAATACTTATGGTTAATTGTAGAAAGTACTATAAAGAAGACAAGTATATATTCTCTCATTCAG +GTGGGGTTAGTTGGAAACCTGTAGAAGAACAAACAATTGACCAATTAATATGGTCAAGAGACTTTCAACC +TAGAAAAGATGGTTTTACCTATGTATGTGGTCACACCCCTACAGACAGTGGAGAAGTAGAAATTAATGGA +GACATGTTAATGTGTGATGTTGGCGCAGTATTTAGAAACATTGATTTTCCATTTATTAAATTGGAGGTTA +AGAAATGA +>MW460250_1_167 # 113672 # 114226 # -1 # ID=1_167;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +ATGATGGTAAACGTACTACCTAGTGTTTATGACGCAGAAAAAGGTGAATGGGTAACACTATTAGCAAAAC +CTATAGCAGAAGAAGTACTTAAAATTATGAAAGCAGATTATTTAGAGCATAAAGGAAATATTGGATTTTT +TATATCTAAGTATAAAGACGGAGATTCTAGTATAGAACAACCTAATGTTGTAGTATTCTATAACGAAAAA +GATTATGATACAATGGAGTTAACAGAATCAGAACTTACTAATGCATTAAATGAATATATTGACTATACCT +TAGATGGCAAGTATAAACCATTTTCTCTTAATAACTTTATTAATTATTTGGAAGATTATGGATACAGACT +ACCTGTTAACTTTGAAGTTGATGTTACTATTATTTTATCTGACGGTCAAAAATTTACTTACCCTAGAACA +AGCTCAATAACTAATAATGCTTCTATAGTAGATGCACTGAAAAGTGAAGACCAGTATATAGAGGTTAAAT +ATATTTATAATGACCATGCAATAGATGATAAAAAATTAGCACATGGTAATGATACTTTAAAATAA +>MW460250_1_168 # 114242 # 114559 # -1 # ID=1_168;partial=00;start_type=GTG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +GTGGAGAGAACATTAAACTTATATGATTCCAAAGGTAAATTATTAAAATCTTCAGAAAAAATTACAGGTG +CATCCGCTAAAATTATAATTGAAAAACTGACACCGAATACAGTGTATAGCCAAGGGTCTTTTAAAATATC +ATGGACTATTAATGGTAAAGAGTCTATATTAACAGATGTTCCCGAATTTACAACAAAAAGTAATGAAGAC +AAACAAGAAATTGTTTTTAATACTTTAAACATTGATTCTAACTCTTTCGTAGTTAGTGAAACAGAACCAA +GTGATAAGAGTAAATTGTGGTTTAAACCAATAAATTAA +>MW460250_1_169 # 115545 # 116093 # -1 # ID=1_169;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.237 +ATGAAAAAAATTTATATATTAGAAGAAGAAATAGAGGAAATGGACTATGACTTGTGGGAAGAAGATACAG +TGTATACAACAAGTTATGAAATTTTAGGTTATACTGATTCCCTAGAGGATGCAGAATATATTAGAGATAA +CTATGGAACAAGTAATCCTATATTCATAAATGAATATCCTTATATAACAAAAGAGAAGTTAATAGAAGAA +CAACGTTACTTTAGATACAATAGTTATATTGAACTTAAAAGAGTTAATGGTTACTTTGAAATATCTGAAA +TAAATGACTTACAGGTTACTGAAGACTTTAGTATTAATAAAGATGATAAAAATTTTGATTCACCTTTTTC +TATAAATATGTTTTCACATAATAGAAATAGTATAGGTATAGAATTCATTATGTTTTCAGAATATGATGAT +AAAGAAGATATTATAGAAAAAGAAAAAAATTCTTTTTTAATGAAATTAAAATATCTCCTTAAACATAGTA +AAGAAGCAGATATACGTAGCACATCAAAAATTATAGATTCAATTGATAAATTAACTTGA +>MW460250_1_170 # 116097 # 116315 # -1 # ID=1_170;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.260 +ATGAAAAATATTATTAATTTTTTAGTAGACTATAACATAAATTTTAGTTACTCAGAAGATAGTTTAAATG +TTATGAATAACTCATACTTAGTAGATAAACATGGTACACAAGATTATGAAATTGTAGGTAACTATGAACA +TATTACAGGGGTATTTTCTTATCAAACAGAAGAAGAGGTTATAGCTAAGCTTAAAAATCTTATCGGGGTT +TGGGAGTAA +>MW460250_1_171 # 116316 # 116510 # -1 # ID=1_171;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.272 +ATGAGAGATAAACGAATACACTCAGAATTATTATATGATATCATAGGTAAACACATACAAGAAGAGGAAA +ATATTACACCGTATATAGAAGCTATATATGTAGATATGATGAATATTATTGTAGTAGAATATACTTTTTA +TAATGAAAATGGAACAAGAATGCTAGGACAATATCCGATAGGAGAGGTTATGTAA +>MW460250_1_172 # 116500 # 117237 # -1 # ID=1_172;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.255 +ATGAACTTAGAAAAAAGTTTCTTATTATCAACAATAGAATTTGGAAGTACTTATCAAGGAACATCAGATG +AGCATTCAGATAAAGATTACATGAGTTTAGTTGTTCAACCTTTATCAGATACTATATTTAGGAATAATGA +AAAAGCAAGTAAGCATACAGAAGTATCAAGATACTATGCAGTAGAAAGATTTATCTCTTTAGTTCTAAAA +AGTGGGTTTGATAATGTTCTTAATTTATGTGCTCAATTAGAGCAAGCTAAAAATACTAGATTCAATAAAA +CTGTTTTAGATTTATTTTATGATGATTTTATATTTTTAACTTATGTTAGAGCTAATTTTAAGCCTATAGC +ATATTCTGTTATCGGTAATATTAATAATATACTAAAAAAAGGAGAATTAACTGGTAAAGACCTTGTTAAG +TTTTATACATTCTATAATCATTTAGAATACTATAATGATTTATTAGATGATTTAGATAACTTAAATGTTA +GCTATAAAGACTTTGCAAAAGTTAAATATATGCCAAAAGAAGTATTAGACAATAAGAGAAGCAATGTAAG +TATTGAAAAGAAAAAAGATTTGGTTAATAAAGTAGAGCCCCTAATTCAAGAAGTAAAAGATAAACTTAAG +TCTAATGAATCTAATATCAAACACTATAAAGATGCTATGGAATTAGTAGAAAAATCTTTGAAAGACAAAA +CTGTAGAATTCCTTACGGAGGTCTATAATGAGAGATAA +>MW460250_1_173 # 117300 # 117404 # -1 # ID=1_173;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.238 +ATGAAATATATTTTAGGATTAATAACACTAGGGATTATACTATTTAAAGTCTATGAACACTTTAAGTATA +AACAAGATGAAGTTGATACAGAAGAAGATATATAA +>MW460250_1_174 # 117416 # 117655 # -1 # ID=1_174;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.279 +ATGGACTTCTACCAATTTCTAAATCACGAAAATGTAAGAGTTAATTCTATAACACCAAGTCAAAAGAATT +TTATTAGAGAAAACTTAGAATTAACTAACCTAGAGGATACTGATATTGATTTTATTAGTTCTAAGCAAGC +AAAGGAAGAAATAGAGAAAATTATAAGAATTAAAAATGAAGAAGAATATGATATAGCTATGGATGCTTTA +GCAGGGTGGGTAACTAAACATGGTTATTAA +>MW460250_1_175 # 117657 # 118046 # -1 # ID=1_175;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +ATGTTTAAAAAAGCACCTCAATACATTATGGAAAAAGTAGAAAAAGAAAACAATATTCTAGGAGAAGATT +TAAGTCTTGATATTTATTATAAAGGAGTTAAACTAACTGTTAAGAGACACCCTGAAACTGGTCATCTAAA +TGGATATATAACTTTACCTTCAGATATCAATGAAAAAGAATATGACTCCTTAGAAAGACGTGCCCATAGA +GGTATCACTTACGATGATTATGACTATGAGGGTAAGAGAGTATTAGGGTTTGACTGTGCACATGCCTGGG +ATATGACACCATATGCTATTATAGGTTCGTTAGATGACCAATATAGAGATTTAGAGTATGTACTAAGTAT +TTTAAAAGATATGGCAGAATACGTTAAAAAGGATGAGTAA +>MW460250_1_176 # 118145 # 118318 # -1 # ID=1_176;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.328 +ATGGAAAAAGTAAATCATGAGTTTCTAGCAGAATTGGCAAAGAGTAATAGTCCTGTACTAAATTCAAAAC +CACTTCAAGATGGAGACTATAATATTGAATTTGACTATGATGGTTTTCACTTTGAGTTCTCACAGAAAAA +TGGTTATTGGCGTTGGTCTTATAACGCTAAATAA +>MW460250_1_177 # 118359 # 118841 # -1 # ID=1_177;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.290 +ATGGCAAATGAAAAAGAGATTATAAGAATGGTTAATTATCTTATTGATAATATGTCTATGTGGCATATAA +ACTATGCTAGGGCTGTGTTAATACCAAGTGAGGTAGAAAAGATAATTAAAGAGCATGAGAAGTTTGATGA +CCTCCTTAAGAAGAGAGGAGAATGGTTAGTAAAAGGTTCAGATACAGATAACATTGATGATTTAGAAACT +TATAATCAAATAATGAATAATCAAAAGGATGAAATGATGATACAAGAAATTGATATCTATACCCAAGGTA +AAACAATAACAATAGATAATGAACACTATTCTTCAGATGATTTAGGTGAAGTCTTAAATAAGCTAGAACA +ATCAGAAGATATTAAGATAAAATCTAATTATAAGTCATTATATGTAGGATATACTAACGTAGTGGGTTAT +GAGGTTACTTACGCTAGTTCTTACGAGGAAACTTTTAAAAATGACTTAGAAAAAGACCTATGA +>MW460250_1_178 # 118891 # 119433 # -1 # ID=1_178;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.276 +ATGGATAGAATTATTGGTAAACATAACTTAACTCAAGATTTGAGATTAGGTGATAAAGTAGAAGTTTATG +ATGCTCATAAATTTAAAGAAAATGAAGATGGAACTATTGAATTAGGAGATAAAATAACTGAAGGAATTGT +TGTAGATTATAAAGGTGACTTTACAGGTAATACAAGTGGTTTAGTTACCCTTGACTCTTCTGAAAAAGAA +TTAATCATTGGTGAATATAATTTTAAACTTATTGAAGAAGGTAATTTACAAGCAGTTTATGATTCTGTAT +CTAAAAACAAAGTAGAAAGTCTTTCTGAAGATTATGACATGTATAGAAAGTTACTTGGAGTTAAATCAGG +AGAACTAGCAGGTATAGAAGATGAACTAGAGTACTTAGTCAGACAATATAATAGTAAAGTAGATAATTAT +AATGGACTACTAACATTATCTAAAGAAAAAGCTAGAGAATTATCTCTATTAACAGGAGATAAAAAAATGA +TTCCTCATATGAAAAATAGAAGATTAGAATTAGGTACAGAAGCAGACTTTTAA +>MW460250_1_179 # 119433 # 119966 # -1 # ID=1_179;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.270 +ATGGTATACGATAGTATTATTTCTCGAACAATGGCAGTATCAATTTTAAATAAATGGATTGCAGAATTAA +TTACAGATGTTGATTTAGATAAATGTAAGTTTACAGAAGAAGAATATGGGAAAGTAGTTACAAATTCAAT +CAATAAAATACAAGATGTATTAATTGAAAAGAACTATGAGGTTACAGATGGTGAATTGTATGATATTGTT +TGTACAGAATTAATTAACCCAATTAAAAATAATACAGAAGAAGAAAAACATAATGAAAAGAACGATTTAT +TAGAACATTTAGAAGATTTGGCTTTTAGACATGATATTGATTTAGGATATGTTAGTGATGGGTCATATAA +CTTAACTGTAACCCATTGGTTAATGCAAGATGAGTTTACAGATGTTAACATCAAAGTTAATAATGATGAA +GACTTCTATACAGTTACAATTCCGGAAAGTAAATATTTTTGGTTACCTATTACAAAAGAGAACTTAGAAA +TGTTCTTAACACAAGACCCTATTAATAAAGGAGAGGTTAAGTAA +>MW460250_1_180 # 119969 # 120133 # -1 # ID=1_180;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.267 +ATGAAAAACCTGATTAAACTTTTATCAATGGTTGTAGTAACTATCTTGACTTTTTCACTAACTTATGTTA +TACTTAAAAAAGAAACAAATAATAAAAGAAATGGTGTAGCGCCTTTTGATTTTTCATTAGAAGACCACAT +TCACCTAAATAAGGAGATTAAATAA +>MW460250_1_181 # 120136 # 120411 # -1 # ID=1_181;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.264 +ATGGCTAATAATATTTGGGCTGTTGTATTGAGCATTGTTATTTTACTCATCATTTTATTAATACTTTGGT +TTCTTTTTAGAAAAAAGGTAAATGGTAGTAGTAAGAATGTAGAAATTCAAAAAGCAGAAGAAGATAACGA +TAATAAAGAACAAGAGGTAGAAGAAGCTCAGTATAGAGAACTTAATGAAGAAGAGAAAGAAAAAAATGAA +AACTCTAGTAAAGATTATAAGTACGATAAAGAGAAAGTAAAAAATAAACTTAAGGAGTTAGAGTAA +>MW460250_1_182 # 120411 # 121256 # -1 # ID=1_182;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.307 +ATGGGTAGACGATTAATAGATAACTCAGAATTAAATGTAATTAAATATGATGGTCTACCTGATTTCTTTT +CTGCTTTAAAAAAGAATAGAGTTTCAGGTAGAGATAATTCATCAGATACAGGTAGCTATGATTTTACAGG +GACTCATAGTTTTCAAGAAGCCTATAACTTAATGGTTAAGGGTGATAGAGAGTCATATGATATGGTAGTT +AAACTTAAAAAAATGACAGATGCATTATTTAGAATGGATAAGTCAGTAAAAAGAAAACCTGTCGTAGCTC +CGGAAGGGTATCAACCTCACGTACCTAATGCTATAAAAGGGTTACCTAATTCTATGATGTCTCAGCAAAG +AGTTAAAGCAGAGAAGAAAGTTATTGATGTATTTTATAATTCTAGTATTAGTTGGATGGAAGACCCTGAA +AATCTTGCTTACAGAGGGGCTATAATGTTAAGTGCTATTCAAACATTAGAAACAAAAGGATATAGTATAA +ACCTTTACTTAGGTAAGTTATCAAATTCAGGATATGAAGATAAGTTAACAGGGTTTGTTGTTAATATAAA +ACATTCTTATCAAAGATTAAATGTTTTTAAATCTTCCTTTTACTTAGTAAATCCTTCATTCTTACGTAGA +ATATCTTTTAGAGTACTAGAAGTTGAACCTGATATGGTTGACCTAACTAATCATGGTTATGGTAGTGTGG +TAAGTAAAAGTAGTTATGGTAATAAATTAACAGAGCATATACTTGATAATGCTGTAATTTTTGATTCTAG +CGTAGGAATTGATATAAATAACGATTCATCTGAAAACTTAAGAGCTGTAAAAAAACTATTCGGAGGTAGG +TTGTAA +>MW460250_1_183 # 121268 # 122386 # -1 # ID=1_183;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.323 +ATGGCAAAGCAAGATACTATTGAAAGATTAGAGAGATTGGTAGAACAACAAATGGAAACAACAAAAGACT +TGGCAGACAAACTAGGAGAGAAAAACTCTAATCCGTATGAACAAGCAATTGTAGATGCAATTGTTGAGAA +AGCAGGAACTGAGAGTAGAGAAATTATTATTACTGACGTTAAAAAACAAATTGAAGAATATGTAGAAGAA +CAACTTAGTAATTTACCAGTTAAAATTGAATTACAACAAGAAGGAAAAACAATTAAAGATATCTCAGGAA +TCTTCCATTATAGATACCAAGATATACTAAAGTTAGTTAACCAAAATATTCCAGTATTTTTAAAAGGTGG +AGCAGGTTCAGGTAAGAACCATGTATTAGAACAAGTAGCAGAAGCTCTAGATTTAGATTTTTATTTCAGT +AATGCAATTACTCAAGAATTTAAATTAACAGGATTTATTGATGCAAATGGTAAGTTTCATGAAACCCAAT +TCTATAAAGCATTTACAAAAGGTGGGTTATTCTTCTTAGATGAAATGGATGCATCTATTCCTGAAGTACT +ATTAATTCTTAATTCAGCTATTGCAAATAAATACTTTGACTTCCCTATTGGACGTGTAACAGCTCATGAA +GATTTCAGAGTTGTGTCAGCAGGTAATACTATGGGAACAGGAGCAGACCATATTTATGTAGGTAGACAAC +AATTAGACGGAGCTACATTAGACCGCTTTGCTCAAGTTGAATTTGACTATGATACTAAGGTGGAACATCA +GTTATCAAGCAATGAAGACCTAGTTAACTTTGTACAACAATTAAGACATGAGAATGATGAAAAAGGATTA +CCTTATGTATTCTCAATGCGTGCAATTATTAATGGTAGTAAATTAGATGGAGTAATGGAAGATGAGTTTG +TTGTAGAAAGTATTATCTTTAAATCTGTACCGAAAGATGAGATTAATCAATTTATTAGCTCTTTACCTGA +GGGTAACAGATACACAGAAGCAACAAGAAAGCTTTTAGGTATGCAACAAGAGCCTAAGCAGGAACCTAGA +AAATCTGATAGTACATCAAAAGATTCAATGGACTTTGATACAATTATGGATAAATTAGGATTAGAATAG +>MW460250_1_184 # 122540 # 122866 # -1 # ID=1_184;partial=00;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.254 +GTGTCTAAAAGAACAGACAATTTTATATATTTCTGTAAATATTACTTTTCAGAATATTTACCTTCACTAG +GTGTAGAAGTACTTAATCATAATGAAACTTCTCATGGAACAATGGAAGGTGTTAGGAAATATTATATAGC +AAACATACTTTATGAAGGTCAAGAACTTACGGTAACTATTGATTTAGAGGAATTCAATAATGCAACTTCT +ATGCATAACATGTTAGAAATAATGAATAATCATACATATAATTGTATGTTTATGTATGATATGGATACAC +ATGAAACTAAAGATATTGATGATTTTTTTAAATTAATGTATTTTTAA +>MW460250_1_185 # 122859 # 123275 # -1 # ID=1_185;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.273 +ATGAACGCAAAAGAATTTATGAAAACACAAGCTCAAGTAGAAGATTATTTAGATAAATTAAAAGTAACAA +TTATAGAAGATGCACTATCAGTATCTAAAGAATGGTCTAATGATTCAAATGATTTAGGTTACGCTTTATC +TAGTCTTGGTGAAAGTATAGGTCTTTTAGAAGATTATTATAATATACAAGTAGATGCACATTTACCTGAA +CACTATAAAGGTAGTAAGGATGTTATTTCTTTTCTAGAAGAACATTTTTCTTATGATGGCTTTGTTGATT +CTATGATATTTAATATTGTAAAATATACTACAAGGTTAGGAAGAAAGGATGCAGTAGATAAAGAAGTACA +AAAAATTAAAACATATTATGTACGATTAGAAAGAAATATAAAGTACGGAGATAGTACTCGTGTCTAA +>MW460250_1_186 # 123409 # 123711 # -1 # ID=1_186;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.300 +ATGGAAAAAGTAGAACTTATTAAACAATGGGCAAAAGATAGAAATTTACAAACAGGTAAACCTGAAGGTC +AAATGTTAAAGTTATTAGAAGAAGCAGGAGAATTAGCTTCAGGTATTGCTAAAAGTAATGACCATGTAAC +AAGAGATAGTGTTGGGGATATTTTTGTAGTATTAACAGTACTATGTTTACAGTTAGATATAGATATTGAA +GAGTGTATTGATATGGCTTATGATGAGATAAAAGACAGAAAAGGTAAGCTTATTAATGGAGTGTTTGTCA +AAGAAGAAGACCTTAAAAAATAA +>MW460250_1_187 # 123711 # 123899 # -1 # ID=1_187;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.254 +ATGGAAAAATTCCAAGAAGATTATGTTAATATAGATATCAGAGTTAAAGCTTATGTTCGTGTAGGTTATA +GGTATGAAGAAGATATTACTAATAATCTACATGAATTAGTTGAAGATAATTTAAATGTAACAAGTGACTC +TGATAACCTAATTATCAAAGATACAGAAATTAAAGGAGATATAGAATAA +>MW460250_1_188 # 123943 # 124104 # -1 # ID=1_188;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.290 +ATGGTAAAACCAGTTATAACTTTAGAACCTGAGGATGTAAAAGTATTATTAGATTACCTTAGTTTCTTGG +AAGATGATATGAGAAACTATGAAGGTATGAGAGAATTATATGAAGAATTACACAAAAAGTATCAACTTGC +TAAAGGAAACTACTCAGATTAA +>MW460250_1_189 # 124104 # 126152 # -1 # ID=1_189;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.257 +ATGGCAATAACTTATAAACAAAAAGGATTAACAGAACAAGAAATTATTAATTTACCTAAGGTTAATAAAG +GATGTATCTATATAGGAGAAGAAGATGTATTTCTTAAGAAAAAGAAAAATAATATAATTAACTTAGGTTC +TAAAGAATTATTTAGAGATATTCATAATATATTTAGTTTTGATACAGCCACAGAAATACATTTATTTTTA +GCCCTATGTGGTAATAAAGAAGTAACAAACTTTGAAGGTAACCCTTATGAAACAGTTGAAAAATTAGTTG +AAGGTGTAGTTGATGATAATAAAGGAAGAAGTTACAAAGAGTATATTGAATCTAATAGAGAAGAAAGAAA +AGATTTTCCTGTATATGGTTATAAGAGTAGAAGACGTATACAATCAAAAGGTTATGTTGAAGAAAAAATT +AAAGAACTAGAAGGTAATGACCATTTATGGAGGAATGAATCTAGACAATTAGAAGAATATAAAAAAGTAG +TAGATAGTTTAAATAATGATATTATGGATGTACTAGACCAAGGTAAATATGGTCTTATAAAGTCATCTAT +TATTGTTATGAATGAAGATATAGAAAAAGGTTCTAGTGAGTACTATAGTGCTATGACAGATGAATTATAT +AGCCGTGTTTGGTACATGCATCCAAGTACTGAAAACTATTCATCTTTTGGTCTTAAAGTTAAACATATTA +GAGATAAACATAATATGGGTAACAAATGGGTTTTAGAAAATAAAAGTTCATTTGACGTTAAAACAGGAGA +AGTTAAGGTTTTCTTAACAGATAGTCTTGTTAATAAAGAAATAACTTTAAACCTATATAAAGATGATATT +AGTAAAAGTGAATATAAGAATGAATTAACTTTATCTGTTTTATTAAATGTTATTTTAAAAAACTATGCAC +AACCTAACTTAACTAGAGGAATTATAATAAAGATAATAGAACAAACATTAGAGCACCATAATTTTGATTT +TTCTAGTTGGTGTCCTGATAATACAGATGTTTATGGTCATATAAATTATAGAGGAGATAAATATAGGATT +TTTATAGGAGAAAATTCAACTTCTAATTACTTAATAACTTTAACAGATATTGTTAAAAATATTGATAAAA +TAAATAACTTAGAAGAATTTGGGTTATTTGAAAGAAATGCATTACTATTCCATATACCTAAAAAACCTAA +ATGGAAAGTTCATGAAGCCTTCAATCTTACAAAACAGACTTATAAAAAGTTACTAACTTTAAATAAATTT +GAGCAAGGTAATTACTTAAGATTTGCTAATATTCTCTATAAGCACTATAATCATTTACACAATGAAGTTA +ATTTACACCAATTGTTTGACGATACCTTTTTAATGGTTAGAGATTCAAGAGATGTTACAGATGCTTTAAA +AGTTAAACCTATTGTGAATCAAATATTATCTATATCTTTTGCTAATTACAAAAAGATGACGCACTATTTA +GATGTAGATGCTCAAGACAGACAACGTATAACAGGATATGCACTAGATAACTATTACTTAGATTACTTAC +ATGATTTATCAATATTAATAAGAGAAGGTTATAGAACATTAGAGAGTGTTAGTTTAACACCATTTTCACT +AAAACTAGAACATGATATAGTTACAGATGAGAAACAATCTATACAACAACAATTAGATGATGCAGAACTT +AAAGCTAAATATGATAATAAATTAGAAAAAATAATTGATAAAACTTATAAATTAAAAGATGGTAGAAAAG +TAAAATTCCTTCCTGCAGATACTGTAAGTAAACTTAAAGATGAAGGTAAAATGTTATCTCATTGTGTTGG +TGGGTATGCTAACAGAATTCTAAAAAATAGTTGTTTAATATTATTAGCAAGATTAGAAGAAGATTTAGAT +AATTCATGGTTTACAGTGGAAATACGTATTACAGATAATGGTTATGTATTAGGTCAACAACAATCAATTG +ATGCCTATAAATTACCTAATGAATTAAAAGAAGCATTAGAAAAAGATATCAAGAAAATAAATAAAGAAGA +ATTTAAGGAGGTTGCCTAA +>MW460250_1_190 # 126230 # 126493 # -1 # ID=1_190;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.258 +ATGAGCATAGAAAAGAAAGAAGAAGTTATAGCACATAATGAGGTAGTATTTAGGAGTTTAACTCAAGGTC +TATATGTAAAAGAAGTAGATATCTATTCAGATGTTGTAAGCTATACTAAAGATGTTGATGAAGCTCTTGC +TATGCCAAATACTATCAATTTTAAAAATTCAAGAAAGTATAAAAAACTTATTATGAATTTAGATTTAGAA +CCATTAAATAAAATTCAAAAAGTTATATACGAAACTCATTTAGAAGGACTTTAA +>MW460250_1_191 # 126510 # 126683 # -1 # ID=1_191;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.270 +TTGAATGACCTTATCAAAGAGGGAAATAAATATTATCACAAAGTAAGAGCAGGAGAGACATTATGGACTA +TAAGTAAGAACTATGATGTGGAAATTAAGAAATTACAAGAATTAAACAATATTAAATCAGTTTCTTTAAC +TAATTTAGAATACGTACTTGTTTGTGTAGAGTAG +>MW460250_1_192 # 126690 # 127268 # -1 # ID=1_192;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.273 +ATGGATAATTTATCACATTACTTAAGTATATTATATGCTATATTAATTACAGTAGGATATATACCAGGTT +TAGTAGCATTAGTTAAAGCAGAAAGTGTCAAGGGAGTTAGTAACTATTTTTGGTATTTAATTGTAGCTAC +AGTAGGTATAAGTTTTTACAACTTATTATTAACTGATGCTTCAGTATTTCAAATAGTATCTGTAGGTCTT +AATTTAACCTTAGGTATTGTTTGCTTATTAGTAGCTTCATATAGAAAAAAGGACTATTTCTCTATACCTT +TTATTATTGTGTTCTCACTGTTACTATTTTTATTAAGTGACTTTACAGCCTTAACACAAACTGTAGCAAC +TATTACTATTATCCTTGCGTATGTAACTCAGATAACAACCTTTTATAAAACTAAAAGTGCAGAAGGAACA +AATAGATTTCTATTTCTTATTATTGGATTAGGATTAGCTTCATTAATTGTAAGTATGGTTTTAACACATA +CCTATGTTCATATTATAGCTACTGAATTTGTAAACTTTGTTCTTATACTTATATGTTATCTACAAGCTAA +TTATTACTCAAGAGGGTAG +>MW460250_1_193 # 127261 # 127887 # -1 # ID=1_193;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.308 +ATGGGTAATAAAATTAAAGACAAAGTAATTTATATGGGTGGACATATCCTAAATGAAGCTATGGTAGATT +ACAGAGATAAACAACATAAAGAAGTAGATGGCATTGTAGGAGTAACTCCTTATAGCCCTCACAAAGATAA +GTCAATAAATGATAAAGCTAATGCAGAACAAACTAAGCTAGCAGAACGTATTTTAACTAATGACTTTAAG +GCTATGCAAGAATCAGATATTTTTGTATTTGATATCCTTAATGAAGGATTAGGAACAATTGCAGAACTCG +GTATTTTATTAGGTATGAAACATCAAGCAGAAGAAACAATTAACCATATATATGATAATGGAGAAGAGTA +TTTTAATTATTTTACAAATAAGTTTGAAACGTCATTAAATACTGAAGAAGAATTAATAGTAGATAAACTT +GAAAACATTGTAAATAAACCCGTATTAATCTACTGCTCGGATATTAGACAAGGACATGGTAAACCATACA +ATGACCCTGACAGAGCTGAATTTAGTACGAACCAATTTGTGTATGGTATGGTTTTAGAACTAACAGACGG +AGAAGGATTTATTTCATGGGAAGAAGTTATTAATAGATTAGAGAAATTAGGGGAACAAGATGGATAA +>MW460250_1_194 # 127880 # 128776 # -1 # ID=1_194;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.265 +ATGAAATCGTATACTAAAGTAAAAAATAAAGGTATTGTACTAGATAAATTTAAAGAAAGAGGTCTAGTTG +TACAAGAAAAATTAGATGGAAGTAATGCAAGCTTCACAGTAGAAAATGGTGAATTAGTATGTTTTTCACG +TAGAAAAAAATTAAATGAGAATGAAACTTTAAATGGTTTTTATGATTGGGTACATGAAAATATAAATGTA +AGAAATACGTACGTATCAGCCTTAGAAAAATACATTATTTTTGGTGAATGGTTAGTCAAACATAAGATTC +AGTACAAAGAAGAATTTTACAACAATTTTTATGTATTTGATGTTTATGATAAAGAAAATGAAGTTTATTT +ATCAGTAGAAGACATGAATGTAATTGCACATCATTTAGGGTTAAAAACAGTTAAAACTTTGCTAGTATCT +AAACCATCTCACTACTTAAATGATTTAAAACCTGAAGAAATTCAAGAATTAGTAGGAAAATCTGACATGA +CAGTTAAACCTGATAAGGGTGAAGGTATAGTAATTAAATACTTAGATGGTAAATCAGAATATGATGACTA +CTTTAAATTAGTATCTAATGAGTTTAAAGAATTTAGTCGTCAAAAAATGAAAACAGAAGTAAAAAAGAAC +GAGTCAGTGGCAGATTATGCCATTACAAGAGCAAGAATGGAAAAAATGATTTTTAGGGCTATAGAAGAAG +ATAGATTATCTGAAGATGATTTAGAATTAGAAAACTTCGGTCTAATTATGAAACAAGTAGGTCAAAACTT +TGTTGATGATATTATGGAAGAAGAAAAAGAAAATATACTGAAAATAGTAGATAAACAAATTAAGAAAAAA +ATGCCACATATTTTAAGAGAAATTTTAGAAGAAAAAGGAGATACTATAGATGGGTAA +>MW460250_1_195 # 128776 # 129000 # -1 # ID=1_195;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.244 +ATGAATTATTTAGCTAAGGTATTTATTAATAACAATTGGTTGGTGAAACTTATAACAATTGTATTATTAA +CTTTATTTCTAAGCGGTCTAGTTTATGTTATAAGTGCAATATCATTATTCTTATCAACAGTTTTAAATTT +ACCTGGTTTAGTAGTATTAGCATTTTTAGCAAGCGTAAGTCTTATTTTGTTTTCTATAGTACATAATTCA +AAGGAGGATAATTAA +>MW460250_1_196 # 129069 # 129809 # -1 # ID=1_196;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.306 +ATGGCAATACAACTAAAAGAGTTAGACTTTAAGTTAAAAGATTATCCTAATGTAAGATACAACATGGGAG +AACATCTAGTCTTTAATGAATTCCTTGAAAAAGCTACAACCGAGCAGTTAGATTTCTGTGAGGATTTCTT +TAATGATAATGTTGAAATACTTTGGAATGAGAGTCAAGCCGGTACAGGTAAAACAATGTGCTCAGTAGCC +TGTGCTTACGCAGACTATCTTAATAAAAATAGAAAGCTAGTATTTATAATTTCACCAGTATCAGAAGATT +TAGGAAGCAGACCAGGTAATCAGACAGAAAAAGAAATGGCTTATTTCATGGGATTACATGATGCCCTTAT +TGAACTTAATATGAATCCTGAACAACAAATAACTGAAATGTTAATGATGGAAGATAATGTTAAAGAAGAT +AAACTAGGAGATTGTTGGGTATCTCAAATATCCCATCTATTCCTAAGAGGTGGAAATCTAAGAGATGCTA +CTATAATTATAAATGAAGCACAGAACTTTAAACGTAGTGAACTTAAAAAAGTTCTTACAAGGGTTCATAC +AAAAAATTCTACTGTAATAGTAGAAGGTAATTTTAAACAAATAGATTTAAAGAACGAAAGTAAATCAGGT +TTTGGAGATTATATGGAATACTTTAAAAATTATGAAGGAGCAGTATTTCATAATTTTACAGTTAATTTCC +GCTCTAAGCTTGCACAATATGCAGATAATTTTAAATGGTAA +>MW460250_1_197 # 129861 # 130475 # -1 # ID=1_197;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.289 +ATGAAGAAAATTAATTCAGTAATTAAAGGTGAAGGTAAGAAAGTACAAACAGCAGATGTTAGGAAAATTA +GTTATTATGTTAAAGATTATAACCCTTGCATGACAGTAGATGACGCAAACGACTATAATGCAACTAGTCA +ATATTTGGTAAGTGACAATGGTAAATTTATTGCAAAATATAATAAAGATATGAATGCAGTAGGATTCTAT +GAAGAATCAGGGGATACTGTAAAACATTTAACACATACTACACCGGAAAGATTAGAAGGAACTGTATTCA +CTATTGAAGAAGAAACAGAGATTGATTTAATTAATGATACCTTACCTCAAGGTGATATTTTAATTAAATT +TTCAGACGGTAGTATTTATTTACCTGATAATGAATCAGTACTAGATAGTGTAAATTACTTGGCAGATAAC +GATTGGGATTCTGTGGATGATATTATTTATACAGGATTATCTAAAGGTAATAGTGAAAATTGTATTGTAG +ATTTTAATTATAATAATTATGATATTGGTTATGATGATGTAGAAGATGAAGATGTTTGTGATAACTACCC +TGAATGTGAATGTAGTAATTATTGCTCTTCAACAGGAGAATATATCGGAAATTAG +>MW460250_1_198 # 130491 # 130916 # -1 # ID=1_198;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.298 +ATGCAGGATAGTGTAAATATATACACAGACGGTAGCTCCTCATATAATAAAGGTAAAGTAGGCTCAGGTG +CTGTCTTGGTAAGTAAAGAAGGAAATATAATATCGGAAATTAGTAAAAGTGTTGACAAACCAGGATTAAT +AAAGTATAATAATGTTGCAGGTGAAATATTGGCTTGTTGTTATGGTATTGAAGAGGCTATAAAACTAGGA +TACAATCAGGCAATAGTTTATATAGATTATATTGGTTTAATACATTGGTATGAAGGTACTTGGTCTGCAA +GAAATATTCTAAGTAAAACATATATTAATATGATACGAGAATACCAAAAAGTAATAGATATAAACTTTGT +AAAAGTAAAGAGCCATTCAAATGACAAATGGAATGACTATGCCGATAATCTTGCAAAAAAATCAATTGAT +ATATAA +>MW460250_1_199 # 130906 # 131097 # -1 # ID=1_199;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.281 +ATGAAAAAAGGAGTATTTACAGTAATAGCTGATGGTTTTAAATTTAATGTGATTGCTAAAGATAAAAAAG +AAGTTCAAGAACACTGTTTCAAATGTTTTGATTTTAACTATATCTCAGTATCTTTTTGCAGAGAAGTCTA +TTCAGATTGTGAATTCCCTCAATTTATGGAGGATTATAAGTATGCAGGATAG +>MW460250_1_200 # 131120 # 131761 # -1 # ID=1_200;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.255 +ATGGAGAATAATAATTTAGTTAATTTTTTAATGACTACAGATGATATAGATGATACTATTGAAATGGTAG +ATTCATTTGAATTACAAGATATAAACAAAGTTCTAGGTGAAGATACGTTTTTAACAATAATGGAAATTAC +AGACAGTCTTCCTGATAACCAATATAAAATAGTATTGTTGTCCTCTTTAGACAAGTTATTGAATACAGAT +AGAAAAGAATTAGTAGAATATGATGAAGAATTTCCTACTATACGAAAACATAATGTATCTGAGCTAAAAA +GAGATACAGTTAACTCTGTAATTGATAGTTATATGAATACTAATGTAGAAATACTTTATACAGAGTATCC +TACTATTAGTAACTACAGTGTAGTTGTAGATTCTGTTAAAGTGTTAAATACTTTATATTTAATTGAAAGT +AAAAATGGTAAAATAGAAGCAACACTGTCAGAAGATGGAGAAGACTTACATGAGTATATATCAGAAGAGG +GTTACAGTGTTACAGACATATTAAATAAATTTGATGATGTTGAAGATTTATTTGATGAGGATGACAGTTT +AATTAATTTCTTTTCAGATATTGATGAAGGTAAAAATAAAACTATTAAATCATTTATTGAGTTAGTTATT +AATTTAAAATAA +>MW460250_1_201 # 131751 # 131981 # -1 # ID=1_201;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.329 +ATGGATGAAAAAAAGGAAAGTAAACCTCTAAACCTTCAAAAAATTAGAGTAGAAAAAGGACATACGTTAA +GAAGTCTAGCTTCTGAGATAGGTGTTCATTACTCTCTTATATCTTATTGGGAGTATGGAAAAAAGAAACC +AAGAAGTGCTAATTTAATGCGGTTAGAGAAAGCGTTAAATACTCCAGGAAAAGAGTTATTTAAAGAATTG +GAGGAAGACGATGGAGAATAA +>MW460250_1_202 # 131984 # 132211 # -1 # ID=1_202;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.263 +ATGAATAAATTTAAAAGATGGTTTCGAATTAATGTTTTAAAAAAAGAAACACTACTTTTTAAAGTTTATT +GGAGATACGAGTCACCGTCTTTAAAAAAACCTCATGTATTTCATATAGAGTTATATGCTAAAAGTAAGGC +AGAGGCAAGAAATAAATCACAAGAGTATATACTAAAAAATGCAAAAGCATCTGAGGATTTTAAATTTTTA +AAAGTAGAGGAGAAATAA +>MW460250_1_203 # 132321 # 133013 # -1 # ID=1_203;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.348 +ATGAAGAAAACAATTTTTGCAACATTAGCATTAGGTACAGCTATTACTTTTGGAGGTATTGCTACAAATG +AAGCTAGTGCAGACGAAATTGATTATAATAAGTTAGCAGAACAAGCTAAATCAAATTCAGCAGAAGTAAA +TACAAAACCAATTCAAGCAGGTAATTATGATTTCTCTTTTAGTGATGGTGAATTTACTTATCATTTCTAT +AATTATAATGGTAACTTTGGTTATGAATACCATTCAGGCTCAACTCAAGTAGATAATACAGTATCTAGAT +TAGCAGGAGAAGAACAAACACCTGAACAAAAAGTAGACCAACAACAAGCACAATTTGATACTCAAAATAA +ACAAGATACTAAAAAAGAAGTACAAACAACATCAGCACCAGTTCAAAAGGAAACTAAACAACCTACACAA +TCAACTAGTTCTACAGGAGGCTCTGTAGCAGAACAGATTAGACAAGCAGGTGGAGACGAGGCAATGATTG +AAATTGCTATGCGTGAATCTACAATGAACCCTAATGCTGTTAATGCATCATCAGGAGCTCAAGGATTATT +CCAAGGATTAGGTAAATCATGGAGTGGTGGTTCTATAGCAGAACAAACTAAAGGTGCAAAACAATATATG +ATTGACCGTTACGGTTCAACATCAGGAGCTCTTGCATACCACAACGCACATAATAGTTATTAA +>MW460250_1_204 # 133200 # 133835 # -1 # ID=1_204;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.280 +ATGATTGGAGAAACTATTAATAAACTAAAAGTTATAAAAGAGTCTAGTAAGAGGGATAAATCAAGATGTA +AAATGTATGAATGTTTATGTGAATGCGGAGAAGTAATAATAGTTAGAAGCTCCACTTTAAGGCAAGGTAA +AATAAAATCTTGTGGGTGTGAGAGTAATAAAATTCATAGTGAGTTAATGAGAGAAAGGAATACTACCCAT +GGATTATCCAGTAACCCAATGTACCAAAGATGGTTAGGTATGAAACAGCGTTGTTACGATGTTAACGCTA +TAAATTATAAAAACTATGGAGGAAGGGGTATAGAAATATGTGAAGAATGGAAGAATGATTTTAAAAAATT +CTACGACTATATGGGAGACCCTCCGAATGAAAATTATCAAATAGATAGAATAAATAATGATGGTAATTAT +GAACCTGGTAATGTTAAGTGGTCTACTAGGTCTGAAAACTCAACTAATATAAGAAAGAAAAGTACACATA +ATATATACAAAAAATCAAATAATGTATATAATATACAAATTGTAAGAAAAAATAAAGTTAAATATTTTAG +TGCAAAATCTTTAGAAGAGGCTATAGAATTAAGAGATAATGTAATAAATAAATATAATGAAACTGGAGAG +TGGTAA +>MW460250_1_205 # 133902 # 134693 # -1 # ID=1_205;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.298 +ATGAGAAAGTCAGTAGTTATTTCAGGAGTATTAGGGTTTTTAGCAATTATAGGGTTTATTATTTTATTAA +TGTGTATTACTAAGATTCCACAAGGTCATGTTGGAGTTGTATACTCAGTAAATGGTGTTAAGGAAGATAC +TAAATCACCGGGTTGGCATTTAACAGCACCTTTTGATAAGGTAAATAAATACCCAACTAAAACACAAACA +CATAAATATAAAGATTTAAATGTAGCAACTTCAGATGGTAAAAATATTAAATTAGACATTGATGTATCTT +ATAAAGTAGATGCAACTAAGGCAGTAAACCTTTTTAATAGATTCGGAAGTGCTGACATAGAAGAACTTGA +AAAAGGGTATCTTCGTTCTAGAGTACAAGATAATGTTAGACAAGCAATTTCTAAGTACTCTGTAATTGAT +GCTTTTGGTGTAAAAACAGGAGAAATTAAACAAGATACTTTAAATAAACTTAATGATAATTTAGAAAAGC +AAGGTTTTATTATTGATGATATTGCATTATCAAGCCCAACTGCAGATAAAAATACTCAAAAAGCAATTGA +TGAGAGAGTAAAAGCAAACCAAGAACTAGAGCGCACTAAAGTTGATAAGCAAATAGCAGAAGAAAACGCT +AAAAAGAAAGAAATTGAAGCAAAAGGTGAAAAGAAAGCCAATGACATTAGAAGTGAATCCTTAACAGAGG +AAGTTTTACAACAACAATTAATTGAAAAATGGAATGGAAAACAACCTATTAGTATAGGTTCAGATAGTGT +TATTACAAACTTAAATAAATAA +>MW460250_1_206 # 134693 # 135001 # -1 # ID=1_206;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.275 +ATGGCACTACTTTTAACATATTTTGCTATTTTTATTGTTTTTCTAGTCCTTGTAGGTTTTGGTATAAGTT +ATTTATTTGATTTTCTTTCAATGAAAGAGAAGAAGAGTAACATAAGAAAACAATACAGGGAATTAGTTAG +GCAAGGTACATTAGATGAATACGGTTTAGAACAATATGTAAAGTATAAAAAACAATTCTTAAATGACCGT +AGACAATCAATTGTAACTAGAGCCGATAAACAAGAAATAGACCAAGAGGAAAAAGCTTTAAATAGCTTAA +TAAAAGAAATAGAAAAAGGAGAAATGTAA +>MW460250_1_207 # 135114 # 135743 # -1 # ID=1_207;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.379 +ATGAGTGCTAGTGATGCTCAATTCCTTAAAAATGAACAAGCAGTATTCCAATTTACAGCAGAGAAATTTA +AAGAATGGGGTCTTACTCCTAACCGTAAAACTGTAAGATTGCATATGGAATTTGTACCAACTGCCTGTCC +TCACCGTTCTATGGTTCTTCATACAGGATTTAATCCAGTAACACAAGGAAGACCATCACAAGCAATAATG +AATAAATTAAAAGATTATTTCATTAAACAAATTAAAAACTACATGGATAAAGGAACTTCAAGTTCTACAG +TAGTTAAAGATGGTAAAACAAGTAGCGCAAGTACACCGGCAACTAGACCAGTTACAGGTTCTTGGAAAAA +GAACCAGTACGGAACTTGGTATAAACCGGAAAATGCAACATTTGTCAATGGTAACCAACCTATAGTAACT +AGAATAGGTTCTCCATTCTTAAATGCTCCAGTAGGCGGTAACTTACCGGCAGGGGCTACAATTGTATATG +ACGAAGTTTGTATCCAAGCAGGTCACATTTGGATAGGTTATAATGCTTACAACGGTAACAGAGTATATTG +CCCTGTTAGAACTTGTCAAGGTGTTCCACCTAATCAAATACCTGGCGTTGCCTGGGGAGTATTCAAATAG +>MW460250_1_208 # 136014 # 136514 # -1 # ID=1_208;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +ATGGAAAAGAAATTAAATGAAATACCTGGATTAGAAATATATGAAAATTACACTATTACTGATAAAGGAG +AAGTAATATCTTATAAAGGTAAAGAGCCTAAAAAGTTAAAACTTCAAAAAAATAACAAGGGTTACTTGTT +TGTAAGGTTACGATACCATTCACCTAAAATACATCGTTTAGTTGCTATGGCTTTTATACCTAATCCTGAT +AATAAAGAACAAGTTAACCATTTAAATGGTAAAAATGATAATAGTGTAGGAAATTTAGAATGGGTTTCTA +ATTCCGAGAACAGAGAACATGCAATAAAGACAGGATTAAAAAATGAAATAAATTATAATATAGCTCAGTA +TGACTTAGAAGGTAATTTATTGAATGTCTTTTACACAGCTCAAGAGGCTTTAGAGTTCTTAGGTATTTCT +AATAAAAGAAGTGGTAATATAGGAAGATGTATCAAAGGAGAGAGAAAAACAGCCTACGGATACATTTGGA +AACAATATTAA +>MW460250_1_209 # 136674 # 137477 # -1 # ID=1_209;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.353 +ATGGCTAAGACTCAAGCAGAAATAAATAAACGTTTAGATGCTTATGCAAAAGGAACAGTAGATAGCCCTT +ACAGAGTTAAAAAAGCTACAAGTTATGACCCATCATTTGGTGTAATGGAAGCAGGAGCCATTGATGCAGA +TGGTTACTATCACGCTCAGTGTCAAGACCTTATTACAGACTATGTTTTATGGTTAACAGATAATAAAGTT +AGAACTTGGGGTAATGCTAAAGACCAAATTAAACAGAGTTATGGTACTGGATTTAAAATACATGAAAATA +AACCTTCTACTGTACCTAAAAAAGGTTGGATTGCGGTATTTACATCCGGTAGTTATGAACAGTGGGGTCA +CATAGGTATTGTATATGATGGAGGTAATACTTCTACATTTACTATTTTAGAGCAAAACTGGAATGGTTAT +GCTAATAAAAAACCTACAAAACGTGTAGATAATTATTACGGATTAACTCACTTCATTGAAATACCTGTAA +AAGCAGGAACTACTGTTAAAAAAGAAACAGCTAAGAAAAGCGCAAGTAAAACGCCTGCACCTAAAAAGAA +AGCAACACTAAAAGTTTCTAAGAATCACATTAACTATACAATGGATAAACGTGGTAAAAAACCTGAAGGA +ATGGTAATACACAACGATGCAGGTCGTTCTTCAGGACAACAATACGAGAATTCATTAGCTAATGCAGGTT +ATGCTAGATACGCTAATGGTATTGCTCATTACTACGGCTCTGAAGGTTATGTATGGGAAGCAATAGATGC +TAAGAATCAAATTGCTTGGCACACGGGTAAATAA +>MW460250_1_210 # 137477 # 137980 # -1 # ID=1_210;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.349 +ATGGCTAATGAAACTAAACAACCTAAAGTTGTTGGAGGAATAAACCTTAGCACAAGAACTAAGAGCAAAA +CATTTTGGGTAGCAATTATATCAGCAGTAGCATTATTTGCTAACCAAATTATAGGTGCTTTCGGTTTAGA +CTACTCAGCTCAAATTGAGCAAGGTGTAAATATTGTAGGTTCTATACTAACACTATTAGCAGGTTTAGGT +ATTATTGTTGATAATAATACTAAAGGTCTTAAAGATAGTGATATTGTTCAAACAGACTATCTTAAACCTC +GTGATAGTAAAGACCCTAATGAATTCGTTCAATGGCAAGCAAATGCAAATAACACTAGTACTTTTGAGAT +AGACAGCTACGAAAACAATGCAGAACCTGACACAGATGATAGTGATGAAGTACCTGCTATTGAAGATGAA +ATTGATGGTGGTTCAGCACCTTCTCAAGATGAAGAAGATACCGAGGAACATGGTAAAGTATTTGCAGAGG +AGGAAGTTAAGTAA +>MW460250_1_211 # 138065 # 138250 # -1 # ID=1_211;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.258 +ATGGCATCAGCAAAACAATTATATTATACGGAAAGTTTAGTTGGTAAGGCAATTATTAATAATAAAGTAT +CTAATAAAGAAGAAGTTTGGGATAAGTTAGAGCTACTTCCTGAAACTAAATTAGAAGATTTAGATAACAA +ACAAATGTCTGAAGTTATCAAAAAACTAAACCAAATTAATGAGTAA +>MW460250_1_212 # 139797 # 140015 # -1 # ID=1_212;partial=00;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.274 +ATGAAAAGACAAAAAATGTTTTACTCAAGTTTAATATGTAAAGAATGTGGAAATGTATTCAAAGTACCAA +GAAAAAGAGCAAATAAAAGAGAAGAAGGTCATATCAAAGATATATATTGTATCAAATGTTGTAAAACTAC +AAAACATATAGAAGATAATCGTAGTGAAGCAGAAAGAAGATGGGATGCTATTCAGGAGGAACTAACAAAA +GATAACTAA diff --git a/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal_out_aas_tmp.fasta b/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal_out_aas_tmp.fasta new file mode 100644 index 0000000..e3442c4 --- /dev/null +++ b/tests/test_data/overall/Standard_examples/SAOMS1_Output/prodigal_out_aas_tmp.fasta @@ -0,0 +1,1013 @@ +>MW460250_1_1 # 1 # 159 # -1 # ID=1_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.403 +MRIETKKSKLSRPGQKAEADLLSDYMVGKEDDPILLNGIDLEHSSWIGDGGVG +>MW460250_1_2 # 183 # 392 # -1 # ID=1_2;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.267 +MSKHIEITMSSGAKYFLVSTDEKSYNRQDIDYMLRGMDETSIKVYTESAITSPQVYINPN +RIESFKIVF* +>MW460250_1_3 # 405 # 737 # -1 # ID=1_3;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.306 +MDKEINNLVSQVETIKSKIQEGNYIDRGTFKDLEVEVAELRKMIVSIDKDVAVNSEKQSA +IYVQLERLDEKISELAESTKTKDTEKKDTTEKVLLLVLGAILSFVFNKFA* +>MW460250_1_4 # 750 # 1076 # -1 # ID=1_4;partial=00;start_type=TTG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.306 +MIKYKDILKLEFKDALAHFKRDRRYFHVYRIDRVLINGSIIYFDYYYLPSDDPNIVIKEL +DLQSFGKLRFEIDTKTSYGKVVTDNYMEIINDFLENYDIHSESETVRP* +>MW460250_1_5 # 1516 # 1902 # 1 # ID=1_5;partial=00;start_type=TTG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.284 +MNNNIAIFIFKTLVIIIFLLLILSVINSLSLIYSIRPSVVMTYFIFGGIVSNVALTVTDK +FLLKKEDPLPEYVLKKVEINDKEIRIIKKIIESNYGITAEEIKVRAKAQRRVEEDSKKED +YNENKERN* +>MW460250_1_6 # 1880 # 2182 # 1 # ID=1_6;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.290 +MKTKKEIKEQRKELKDGATSVSLVKKGDKRIASPSRICSLCGQQLSGMNYTKGKALSKVN +HFHLQYSKYIYFDICADINNCYKNLRKRGEMDWVQKILEI* +>MW460250_1_7 # 2155 # 2565 # 1 # ID=1_7;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.299 +MSAENIRDIINKKKLEEEDTRKYIADGFMNGIGKLMYEFNKKVDNKEIEVKDPNDLYKLF +VIFSQMQNMVNETSEGGAIPQLSRPQQELFDEITTEDSNGESTVDLQKISEMSAEDITAM +ISEKEKVMNEENSETF* +>MW460250_1_8 # 2580 # 2777 # 1 # ID=1_8;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.323 +MDGKELIKIAQETFQTEKITREQIDHIINMLNPSTYMLKYHTLRGHPITFSIPNRDRSKA +QAHRP* +>MW460250_1_9 # 3071 # 4042 # 1 # ID=1_9;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.256 +MTLEKRKQEYLKKLKQIKNDEFELLGGFTKTREKALFKHKVCGYEWYTTPYNLLKSKGTG +CPKCQYRDKSYTTDEFKKKLKDKFGYEYELIEGQEYKNSREKLLFIHNKCGTEFKITSDS +LFRSKVPCHKCSKENRKTKKKTTEQFKNELYNKHKDEYILVEGSEYKTALEKVRIIHTKC +GYTWDVRASHILHTSKCPNCNESKGESLIKDILEDNNFSYIREYTFEDLKNVKKLPFDFA +LFIDNELVGLIEYDGSQHFIPFEHFGGKEKLRKTQYNDRKKNEYCDKNRIPLKRIKYDLD +EKEVIREIEMFLNSIVKSKAESY* +>MW460250_1_10 # 4183 # 5730 # 1 # ID=1_10;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.326 +MGVMEMVHFADMHSYANAKCLYTFPTNEQMKKFVQSRLNPVLEKEYFRDIVDWDKDSLGF +KKIRNSSLFFRTSSKASTVEGVDIDYLSLDEYDRVNLLAESSALESMSSSPFKIVRRWST +PSVPGMGIHKLYQQSDQWYYGHRCQHCDYLNEMSYNDYNPDNLEESGNMLCVNPEGVDEQ +AKTVQNGSYQFVCQKCGKPLDRWYNGEWHCKYPERTKGNKGVRGYLITQMNAVWISADEL +KEKEMNTESKQAFYNYILGYPFEDVKLRVNEEDVYGNKSPIAETQLMKRDRYSHIAIGID +WGNTHWITVHGMLPNGKVDLIRLFSVKKMTRPDLVEADLEKIIWEISKYDPDIIIADNGD +SGNNVLKLINHFGKDKVFGCTYKSSPKSTGQLRPEFNENNNRVTVDKLMQNKRYVQALKT +KDISVYSTVDDDLKTFLKHWQNVVIMDEEDEKTGEMYQVIKRKGDDHYAQASVYAYIGLT +RIKELLKEGNGTSFGSTFVSTDYNQEGNKQFYFDE* +>MW460250_1_11 # 5723 # 6544 # 1 # ID=1_11;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.279 +MNRGEIDLTDKLFYGTISNEEINKSVLNLLLGEELSLDYVSKNSDILDVKYEHVYKSLGF +DNFFDCFLYANREPEIVHKGGDKNLGGLNKVKRTVIRNGKEMEMTVYEDGNKENDSKEKQ +EGKEEVSRSAVGARAISNGEEGKVNPKKVANSLSNLSKKGVDVSHINTNSSLYKEFVDDN +GDTIGITSFKRTENDIILESYASSPDSDGVGARAIMELLRLSIKENKNAVVYDIELPEAI +EYLKTLGFKPNKDGYILRKKDVKQFLGDYSDFI* +>MW460250_1_12 # 6701 # 7180 # 1 # ID=1_12;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.292 +MTLEENKLTLEESITPLSKEEKEDSIKEFSSLLCEMVNRLYKSYNVFRQDPMDETQRLDG +SLMVFQSRLNDPLTGDLHDKMYKLAFSKRIDIFEANKQFRKDVEAGKAIELGDVAIIDTA +LSNILSGNEFQGSISFMLRKDFEEKERIRKEEEEKLNNL* +>MW460250_1_13 # 7222 # 8415 # 1 # ID=1_13;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.353 +MKKKPQGNEVIITIITVMIAVFVVIMTIFFNKYQDAKEDKDRYQRLVEIYKKADDNDGET +KKKYVKRLNKAEEELKKVKKETNYKDYNKKSSKERQKEDKETREKIYDVTGDDDLILVKN +NIEFSDKVDKPEILISEDGIGTITVPVDSGYEKQTVGSIITSVLGSPFLSPGSNSIDGLS +VINDNVYPNTVDSIVEDTKPSINLPTDNPIITNPVEPTIPSDIIPPIDNPSVPISPENPG +DNNQGNTDNPNPPPPGYTDEDGGRGSGGGGNSEPPSTEEPSDNGNTGGGDWEEKPDPGEE +PSDNGNTGGNGGEVTPEPEPEPEPEPEPEPEPEPSEPSDNPDENGGWETEPTEPESPSEP +DDKVDEEDKNEDTTDDKQSTEQPDDNNIDNEDKTEEE* +>MW460250_1_14 # 8558 # 8842 # 1 # ID=1_14;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.228 +MIFVHSKFSSKNVFVLYVIYAIIGIGTYIVLTMFQTTSVLIKNDVIDSIENTEHYIVFND +PIIIFIISFIGAILGGIWYKMMKIIKKSNFKDKK* +>MW460250_1_15 # 8851 # 9231 # 1 # ID=1_15;partial=00;start_type=GTG;rbs_motif=None;rbs_spacer=None;gc_cont=0.281 +MNRLIFSKDKKWDEAKDFIKGQGMQDNWIEIVDYYRQIGGKHVAVFIALNKVKYMILEAT +KDNKVILVDKDNNILLEDYDIVMESKKMFYYIEEPFEVKINIPQHIRDVTYNNTVVLTTV +RGSRGD* +>MW460250_1_16 # 9235 # 10926 # 1 # ID=1_16;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.315 +MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEP +FIEMMDTNPEFRDKRSYMKNEHNLHDILKKFGNNPILNAIILTRSNQVAMYCQPARYSEK +GLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQ +VNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSREL +AMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQ +SQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISA +LYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIIS +EYGDKYTFQFVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQ +GTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDN +VYRTQTSNKGQGRKGEKSSDFKH* +>MW460250_1_17 # 11120 # 11893 # 1 # ID=1_17;partial=00;start_type=TTG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.323 +MEEIKFNAFVPMDLKKSVSTASDTNEYSIVSGWASTPSMDLQNDIVNPKGIDIEYFKSQG +YINYEHQSDKVVGIPTENCYVDIEKGLFIEAKLWKNDENVVKMLDLAEKLEKSGSGRRLG +FSIEGAVKKRNINDNRVIDEVMITGVALVKNPANPEATWESFMKSFLTGHGTSPDTQVDA +GALRKEEIASSITNLAYVTKIKDLKEFNDVWNGVVEDLSKSNSMGYEESVLTLQLAKGLS +RKDAELAVMDINKQKLE* +>MW460250_1_18 # 11912 # 12868 # 1 # ID=1_18;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.322 +MSKEMQNILEEYDKLNAQEAVSKSVEDDEKNTVESTEEQVAETTEEPAKEPEKVSEEDAK +EAQEQGEKVESEEVAEGNEDEEVEKSAKESKDPVDQKDTKTENKDNEKRKNKKDKKEDSD +SDDEDKDTDDDKDKKEDKKEKTSKSISDEDITTVFKSILTSFENLNKEKENFATKEDLSE +VSKSINELSAKISEIQAEDVSKSVDTDEEAVEKSVTSTNGEQEKVEGYVSKSVDTEEQAE +TGEAKSEEAEEVQEDNTFKGLSQEERTKFMDSYKAQAKDPRASKHDLQSAYQSYLNINTD +PTNASEKDIKTVKDFAQI* +>MW460250_1_19 # 12984 # 14375 # 1 # ID=1_19;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.372 +MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNE +DLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSD +TKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAK +LIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQ +DNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQ +KGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFV +SIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVVHLFELL +PMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV* +>MW460250_1_20 # 14467 # 14763 # 1 # ID=1_20;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.310 +MLYYKKLLDKKMATVYGTVEIDKDGVVKGLTKEQEKEFANVPGFEFEEEKKTTRKQSAST +SKEEEPKEEEKKASTRKTTNTTRKSTARKTTAKKDENK* +>MW460250_1_21 # 14776 # 15684 # 1 # ID=1_21;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.355 +MVNSMFGGDLDPYEKSLNYEYPYHPSGNPKHIDVSEIDNLTLADYGWSPDAVKAYMFGIV +VQNPDTGQPMGDEFYNHILERAVGKAERALDISILPDTQHEMRDYHETEFNSYMFVHAYR +KPILQVENLQLQFNGRPIYKYPANWWKVEHLAGHVQLFPTALMQTGQSMSYDAVFNGYPQ +LAGVYPPSGATFAPQMIRLEYVSGMLPRKKAGRNKPWEMPPELEQLVIKYALKEIYQVWG +NLIIGAGIANKTLEVDGITETIGTTQSAMYGGASAQILQINEDIKELLDGLRAYFGYNMI +GL* +>MW460250_1_22 # 15698 # 16576 # 1 # ID=1_22;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.319 +MEKPYMIGANSNPNVINKSTTYTTTTQADEQDKPKYTTRLEFDTIDMIRFINDRGIKVLW +EEAYFCPCLNPDTGHPRVDCPRCHGKGIAYLPPKETIMAIQSQEKGTNQLDIGILDTGTA +IGTTQLEKRISYRDRFTVPEVLMPQQMIYFVNKDRIKKGIPLYYDVKEITYIATQDGTVY +EEDYEIKNNRLYLNEKYENHTVTLKILMTLRYVVSDILKESRYQYTKFNQPKSKFENLPQ +KLLLKREDVIVLQDPYKVNDGIEEDLEIQVDDPKASASNPSNLGGFFGGAFK* +>MW460250_1_23 # 16576 # 17196 # 1 # ID=1_23;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.320 +MPVHGKRPNLFKNKNYKQVGKRTIDGMRSEVLDKLQATAQQVENTSIKRMPTYLQITEKK +LEKEGVVDLKKAFAHSSKKKTSKDGGWYLTVPIRIKTSRMNNSTYQDMRTLKVDKGTGSV +SKITDYLEGRRKNVSHPSMKPEPMTHNMTKVKRGKQSSYFIFRTVSSKSPASSWILNRDK +VNEDNFSKTTLKTVKQLMNWKMKNLN* +>MW460250_1_24 # 17215 # 18051 # 1 # ID=1_24;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.302 +MAITSVDSYLLSEIKPRLNTVLENCYIIDEVLKDFDYQTRESFKEAFCGKNAQHEVTVGF +NFPKFKNNYEAHYLIQLGQGQETKNSLGSIQSSYFEATGDTLVESSTAIREDDKLVFTVS +KPIGELIKVEDIEFAKYDNLQVEGNKVSFKYQTNEDYENYNANIIFTEKKNDSKGLVKGF +TVEEQVTVVGLSFNVDVARCLDAVLKMILISMRDSIEEQQTFQLQNLSFGDIAPIIEDGD +SMIFGRPTIIKYTSSLDLDYTITQDINKLTFKERKDWK* +>MW460250_1_25 # 18053 # 18268 # 1 # ID=1_25;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.306 +MARKKTPENNTPKFNGYVHIDTFLDTAKTLFNMRDSQVAGFKAYMEGSHYLFSEQEFLPS +LEKYLGRKLDI* +>MW460250_1_26 # 18295 # 20058 # 1 # ID=1_26;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.362 +MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAK +RLFRSGELLDAIELAWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQ +VGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYKGEEANATFSVEHDEETQKASR +LVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIENAN +IKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTI +EPFELTKLKGGTNGEPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGE +PMRAIVGGGFNESKEQLFGRQASLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGL +ASGLEIGESITFKPLRVSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTF +NDKSDPVKAEMAVGEANDFLVSELKVQLEEQFIGTRTINTSASIIKDFIQSYLGRKKRDN +EIQDFPAEDVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA* +>MW460250_1_27 # 20131 # 20559 # 1 # ID=1_27;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.354 +MASEAKQTVHTGNTVLLMIKGKPVGRAQSASGQREYGTTGVYEIGSIMPQEHVYLRYEGT +ITVERLRMKKENFADLGYASLGEEILKKDIIDILVVDNLTKQVIISYHGCSANNYNETWQ +TNEIVTEEIEFSYLTASDKART* +>MW460250_1_28 # 20656 # 20796 # 1 # ID=1_28;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.241 +MANKRKTIGKMSNTRATWNINPVTKVKKDKTKYSRKNKHKGLDNYN* +>MW460250_1_29 # 20839 # 21297 # 1 # ID=1_29;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272 +MSTFWSERRTTNKDRQVKKHYTQMSMYERKKCVELLQETITENRIINFTRHSAKKVKGKP +TTNIPKLIGFIFKNKFAYENIIEYNNTDYNGNIERRIVVKHPKVITVEGKPSYQFLTISL +EDARVITVWYNSVDDTHRTLDLNYYSKDLTIQ* +>MW460250_1_30 # 21310 # 21504 # 1 # ID=1_30;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.246 +MGITIVNSYFILSSIFLIILTILNGKGTVTRESLTMSKILVVITSIQFLACLIINGIYWS +LKFM* +>MW460250_1_31 # 21586 # 21897 # 1 # ID=1_31;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.260 +MSQDKLRAIYTEMKVELHKFPKEVDITSKSTAIAINQILDKFKTLTEQAGKITRKYLEGQ +EILTIDYEYYDSLQEYYIYLLRNSEKIEQSLQEITKRTGEYVK* +>MW460250_1_32 # 22029 # 22487 # 1 # ID=1_32;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.305 +MAEEIKKEQDVQETTKEEKKDVSKMTPEEIDKLKYQDKQEKEQVINKVIKGVNDTWEKEY +NFEELDLRFKVKIKLPNAREQGNIFALRSAYLGGMDMYQTDQVIRAYQMLATLQEVGIEV +PKEFQDPDDIYNLYPLTVMYEDWLGFLNSFRY* +>MW460250_1_33 # 22531 # 23067 # 1 # ID=1_33;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.337 +MESIVKQPLSRNLWAIMKEFNVLPTEQRFKDLDDYQIEFIIGNMNRDVYEHNKQLKQAQK +GGKFDSQFEDDDSSWWNESHEDFDPVPDFLDADDLAQQMEAKLSDRDKEERAKRNDAELN +DETEGLTTQHLAMMEYIRQKQQELDDEVGNGKTSEDDATISQDSVNKALEDLDDDWYM* +>MW460250_1_34 # 23123 # 27178 # 1 # ID=1_34;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.372 +MAMNDDYRLVLSGDSSDLENSLKAIELYMDSLESKNIDAPLDNFLKKLKVIAKEVKNVQN +AMDKQDGKSVISSKDMDESIKSTQSATKNINELKKALDDLQKENISKGIAPDPEVEKAYA +KMGKVVDETQEKLEKMSSQKIGSDASIQNRIKEMKTLNQVTEEYNKISKDSSATKDYTKR +LRANRNMTRGYMERSEGTGRLTYDQGARVRSELGKISSYESQRKQNQRNLGQAREQYSNY +RNQQQDLTKRRASGQINKEQYEQELASIKQEMKAREELISNYEKLGAELDKTVQYYKGSV +QKDFQSRDVDQQRGTFGRMVQERLPSIGSHAMMGTTAMATGLYMKGASLSETNRPMVTSL +GQNSDNMDIDSVRNAYGDLSIDNKLGYNSTDMLKMATSYEASVGHKSDEDTMAGTKQLAI +GGRSLGIKDQEAYQESMGQIMHTGGVNSDNMKEMQDAFLGGIKQSGMVGRQDEQLKALGS +IAEQSGEGRTLTKDQMSNLTAMQSTFAESGSKGLQGEQGANAINSIDQGLKNGMNSSYAR +IAMGWGTQYQGLEGGYDLQKRMDEGISNPENLTDMADIATQMGGSEKEQKYLFNRSMKEI +GANLTMEQSDEIFKDAQSGKLSKEELAKKAKKMEKEGKKEGEDNATDYKESKSGKNDQNK +SKTDDKAEDTYDMAQPLRDAHSALAGLPAPIYLAIGAIGAFTASLIASASQFGAGHLIGK +GAKGLRNKFGRNKGGSSGGNPMAGGMPSGGGSPKGGGSPKGGGTRSTGGKILDSAKGLGG +FLVGGAGWKGMFGGESKGKGFKQTSKEAWSGTRKVFNRDNGRKAMDKSKDIAKGTGSGLK +DIYNDSIFGKERRQNLGEKAKGFGGKAKGLYGKFADKFGDGGKNGILSQSPKAGGSGIGK +LGKLAGGLGKGAGVLGVATSALSLIPALASGDSKAIGGGIGSMGGGMAGASAGASIGALF +GGVGAIPGALIGGAIGSFGGGAVGEKVGDMAKKANTKEGWNLGWTNGDKDGKNKFQDSLL +GKPISKAWSGITGLFDNDAEASEEDSKDKKKGVKGVKGDTKKKEKMTAEQLREKNNQSET +KNLKIYSDLLDRAQKIIESAKGINIDGGTSDSGSDSGGSASDVGGEGAEKMYKFLKGKGL +SDNQVGAVMGNLQQESNLDPNAKNASSGAFGIAQWLGARKTGLENFAKSKGKKSSDMDVQ +LDYLWKEMQSDYESNNLKNAGWSKGGSLEQNTKAFATGFERMGANEAMMGTRVNNAKEFK +KKYGGSGGGGGGGALSSTYQEAMSNPVLTTGSNYRGSNDASNASTTNRITVNVNVQGGNN +PEETGDIIGGRIREVLDSNMDIFANEHKRSY* +>MW460250_1_35 # 27257 # 29683 # 1 # ID=1_35;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.331 +MRRIRRPKVRIEIVTDDNTFTLRFEDTRDYNGDEFGAKLLGFQTKNSMEDDSSVFQINMA +GDTYWDKLVMANDIIRIFITPNDDPNDKEGKQERLIQVGMVSQVSKVGSYGNDQTQFRIT +GQSFVKPFMKFGLGVIQEVQAVLPEVGWLIDGDGDNEVKFTGSSAHEVMTGIIRRFIPYM +KYNYTEKTYNTIDNYLDYDDLSSWDEFEKLTEVSAFTNFDGSLKQLMDMVTARPFNELFF +KNSEKTPGKAQLVLRKTPFNPTEWRALDMIKVPTEDFIEEDVGKSDVETYSIFTATPAGM +LKELNGDVFSKPQFHPELTDRYGYTKFEVENIYLSTKSGSATEDSDSSGDDNGTERGTYS +KIMKDLSNYGRDNISKGIDKYTSKLSSKYKNLKKAQAKKIIEKFVKEGKVTEKEYEKITG +NKVDDELTSDNRPKLTKDKLKSILKEKFKTQDDFNNSKKKKKAKTDALKELTTKYRFGNK +THATTLLDEYIKYKGEPPNDEAFDKYLKAIEGVSNVATDTGSDASDSPLVMFSRMLFNWY +HGNPNFYAGDIIVLGDPKYDLGKRLFIEDKQRGDTWEFYIESVEHKFDYKQGYYTTVGVT +RGLKDAILEDGKGSPHRFAGLWNQSSDFMGGLMGEDTSKELKEKGVAEKQSSGDKDGGSD +SGGAQDGGSLDSLKKYNGKLPKHDPSFVQPGNRHYKYQCTWYAYNRRGQLGIPVPLWGDA +ADWIGGAKGAGYGVGRTPKQGACVIWQRGVQGGSPQYGHVAFVEKVLDGGKKIFISEHNY +ATPNGYGTRTIDMSSAIGKNAQFIYDKK* +>MW460250_1_36 # 29697 # 30584 # 1 # ID=1_36;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.282 +MATDKEAKDVIDKFIDNVFNFDVLTKERIKEKDEEIKKITTDDMYEKVVYIRPYVGVIQS +LNPQHVQYESFSNNGYDIEAELSFRKVSYLVDKGSIPTDSLSTLTVHLVERNQELLIDYF +DEIQDVLYGEYMEEEYVFDEDVPLSTILALDLNDNLKSLSNIKYMFKGAPKENPFGTDKD +VYIDTYNLLYWLYLGEDEELAYPMNINYFFTEGRFFTIFGKGHKYKVDVSKFIVGDILFF +GRSDTNIGIYVGDGEFISMMGKFPKDETPIGKYKLDDYWNEFNGRVMRFDEEVYI* +>MW460250_1_37 # 30584 # 33130 # 1 # ID=1_37;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.311 +MVVRFQSSMGRSLKRVDSDDLNVKGLVLATVSKINYKYQSVEVKVNNLTLGSRIGDDGSL +AVPYPKSFIGRTPEGSVFGTKPLITEGSVVLIGFLNDDINSPIILSVYGDNEQNKMINTN +PLDGGKFDTESVYKYSSSLYEILPSLNYKYDDGEGTSIRTYNGKSFFSMTSGEEEKPQAT +DFYTGTEYQDLFTSYYGNKTLIEPRIQKAPNMLFKHQGVFYDDGTPDNHITTLFISERGD +IRASVLNTETQKRTTQEMSSDGSYRVIKQDDDLMLDEAQVWIEYGISEDNKFYIKNDKHK +FEFTDEGIYIDDKPMLENLDESIAEAMKNLNEIQKELDDINYLLEGVGKDNLEELIESTK +ESIEASKKATSDVNRLTTQIAEVSGRTEGIITQFQKFRDETFKDFYEDASTVINEVNQNF +PTMKTDVKTLKTKVDNLEKTEIPNIKTRLTELENNNNNADKIISDRGEHIGAMIQLEENV +TVPMRKYMPIPWSKVTYNNAEFWDSNNPTRLVVPKGITKVRVAGNVLWDSNATGQRMLRI +LKNGTYSIGLPYTRDVAISTAPQNGTSGVIPVKEGDYFEFEAFQDSEGDRQFRADPYTWF +SIEAIELETETMEKDFMLIGHRGATGYTDEHTIKGYQMALDKGADYIELDLQLTKDNKLL +CMHDSTIDRTTTGTGKVGDMTLSYIQTNFTSLNGEPIPSLDDVLNHFGTKVKYYIETKRP +FDANMDRELLTQLKAKGLIGIGSERFQVIIQSFARESLINIHNQFSNIPLAYLTSTFSES +EMDDCLSYGFYAIAPKYTTITKELVDLAHSKGLKVHAWTVNTKEEMQSLIQMGVDGFFTN +YLDEYKKI* +>MW460250_1_38 # 33237 # 34028 # 1 # ID=1_38;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.332 +MPQSDGISNLHRIALRFPKEGGGYDMYRFKVNPENYTIDSPQRTTAIKTKSDIVIEDYGK +DIEVINFTGTTGFRPVREADGLKTGKQKMEELQSRVSEYAMQGGSGNVSGSYLQFFNFTD +DSYYKVHLAPQGLKITRSKDEPLLFRYEITLVVIGSLTEADRSAVTTEEFGNVKPNASQR +VDEGIKELDKNARKTRDRNNQEISRRENTIPKSTGDNTNEGNRLKQSFPSSSIYNPRQST +NGLKGNIDNMALIIGYGDGGVSS* +>MW460250_1_39 # 34028 # 34552 # 1 # ID=1_39;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.290 +MNNFIPQPQGLLRFLNALDTDLTSSHMNLLDEEVSFVSKFYTPQLQLSELAKKVLTNIKT +DDIPVLEREFNDNTIIHKANDTLLKVQAPRMYMILQSIVLEAYAIVNCFVENPSSLKYLT +EEDVSITRENLNYVADYLGNYDDYNSVVLDLRDLDLCFSAIELQLPLIKKEANV* +>MW460250_1_40 # 34552 # 35256 # 1 # ID=1_40;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.308 +MRFKKHVVQHEETMQAIAQRYYGDVSYWIDLVEHNNLKYPYLVETDEEKMKDPERLASTG +DTLIIPIESDLTDVSAKEINSRDKDVLVELALGRDLNITADEKYFNEHGTSDNILAFSTN +GNGDLDTVKGIDNMKQQLQARLLTPRGSLMLHPNYGSDLHNLFGLNIPEQATLIEMEVLR +TLTSDNRVKSANLIDWKIQGNVYSGQFSVEIKSVEESINFVLGQDEEGIFALFE* +>MW460250_1_41 # 35271 # 36317 # 1 # ID=1_41;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312 +MKTRKLTNILSKLIDKTMAGTSKITDFTPGSASRSLLEAVSLEIEQFYILTKENIDWGIQ +EGIIEAFDFQKRQSKRAYGDVTIQFYQPLDMRMYIPAGTTFTSTRQEYPQQFETLVDYYA +EPDSTEIVVEVYCKETGVAGNVPEGTINTIASGSSLIRSVNNEYSFNTGTKEESQEDFKR +RFHSFVESRGRATNKSVRYGALQIPDVEGVYVYEETGHITVFAHDRNGNLSDTLKEDIID +ALQDYRPSGIMLDVTGVEKEEVNVSATVTISNKSRIGDTLQKHIESVIRSYLNNLKTSDD +LIITDLIQAIMNIDDVLIYDVSFDNLDENIIVPPQGIIRAGEIKVELK* +>MW460250_1_42 # 36338 # 39397 # 1 # ID=1_42;partial=00;start_type=GTG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.288 +MANFLKNLHPLLRRDRNKKDNQDPNFALIDALNEEMNQVEKDAIESKLQSSLKTSTSEYL +DKFGDWFGVYRKTDEKDDVYRARIIKYLLLKRGTNNAIIDAIKDYLGRDDIDVSVYEPFT +NIFYTNKSHLNGEDHLMGYYYRFAVINVSIGDYFPVEIIDVINEFKPAGVTLYVTYDGAS +TIRGGAIIKWLDGLPKIETYQEFDRFTGYDDTFYGHINMNQSKDTDNSSSDIFKTNHSLI +NSLDVLTGSSSVGRQYINYGYVTSYVYNPGMTSSVNQISASTKGRGQEVPTDYYMYTSTK +NNNTVELSMQTTSGVSYLYNNFNFRDYMSKYRPQVDLQSDEARRIVSDYIKELSIDYYLS +AVIPPDESIEIKLQVYDFSINRWLTVSINNLSFYEKNIGSNIGYIKDYLNSELNMFTRLE +INAGKRDSVDIKVNYLDLMFYYYERGIYTIKPYKALIENYLDISRETYVEAFKIASLSNG +DIITKTGFQPIGYLKLVGNYENTIPSTINIVAKDTDNNPIESNELDVYNTVENRNLLQSY +KGVNTIAREITSTKEFTVSGWAKEIYSTNYLSKVLKPGKVYTLSFDMEITGNDPTLKSYS +DSHGIYLYSNTKGIVVSGVKSMERTIGNKVSVTQTFTAPTITDHRLLIYTGRYTSDGKAS +TPPVFFNTVKITELKLTEGSSKLEYSPAPEDKPNVIEKGIKFNNILTNIQTLSINSDTIL +KNVTLYYSYYGDSWVELKTLGNISTGETTETNNLIDLYGLQTVDYSNINPMSKVSLRSIW +NVKLGELNNQEGSLYNMPNDYFNAVWQDIDKLSDIELGSMRMVKDTEGGVFDGATGEIIK +ATLFNVGAYTDLDMLAYTLTNYTEPLTLGSSRLIIELKEELLTSESFNVDNRIKVIDSIY +EELPNTSIIKNGFVEREVTGSKYLDYGLYEPIEDGTRYKLIVEGEFKDNIEFISLYNSNP +NFNETFIYPSEIINGVAEKEFIAKPSTEDKPRLNTDVRIYIRPYDSTISKVRRVELRKV* +>MW460250_1_43 # 39508 # 40029 # 1 # ID=1_43;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.324 +MAIATYNSHVELAKYLVSKADSVYLTIGKSTPWSNETNPPQPDENATVLQEVIGYKKATK +VTLVRPSKSPEDDNKNLISYGNKSWVEVTPENAKAEGAKWVYLESSIVGDELPLGTYRQV +GFVMDLVAKSGISKFNLVPSEVESTGTLLFFDNKQFQNRSEQTTAKERFIVEV* +>MW460250_1_44 # 40050 # 43508 # 1 # ID=1_44;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.342 +MAINFKGSPYLDRFDPSKDRTKVLFNPDRPLQQAELNEMQSIDQYYLKNLGDAIFKDGDK +QSGLGFTLSEDNVLTVNPGYVYINGKIRYYDNDDSVKITGVGKETIGIKLTERIVTPDED +ASLLDQTSGVPSYFSKGADRLEEKMSLTVNDPTSATIYTFMDGDLYIQSTNAEMDKINKV +LAERTYDESGSYKVNGFELFSEGNAEDDDHVSVVVDAGKAYVKGFKVDKPVSTRISVPKS +YDLGTAENESTIFNKSNNSISLANSPVKEIRRVTGQVLIEKERVTRGAQGDGQDFLSNNT +AFEIVKVWTETSPGVTTKEYKQGEDFRLTDGQTIDWSPQGQEPSGGTSYYVSYKYNKRME +AGKDYEVTTQGEGLSKKWYINFTPSNGAKPIDQTVVLVDYTYYLARKDSVFINKYGDIAI +LPGEPNIMRLVTPPLNTDPENLQLGTVTVLPDSDEAVCISFAITRLSMEDLQKVKTRVDN +LEYNQAVNALDDGAMEGQNPLTLRSVFSEGFISLDKADITHPDFGIVFSFEDAEATLAYT +EAVNQPKIIPGDTTAHIWGRLISAPFTEERTIYQGQASETLNVNPYNIPNKQGVLKLTPS +EDNWIDTENVTITEQKTKKVTMKRFWRHNESYYGETEHYLYSNLQLDAGQKWKGETYAYD +REHGRTGTLLESGGQRTLEEMIEFIRIRDVSFEVKGLNPNDNNLYLLFDGVRCAITPATG +YRKGSEDGTIMTDAKGTAKGKFTIPAGIRCGNREVTLKNANSTSATTYTAQGRKKTAQDI +IIRTRVTVNLVDPLAQSFQYDENRTISSLGLYFASKGDKQSNVVIQIRGMGDQGYPNKTI +YAETVMNADDIKVSNNASAETRVYFDDPMMAEGGKEYAIVIITENSDYTMWVGTRTKPKI +DKPNEVISGNPYLQGVLFSSSNASTWTPHQNSDLKFGIYTSKFNETATIEFEPIKDVSAD +RIVLMSTYLTPERTGCTWEMKLILDDMASSTTFDQLKWEPIGNYQDLDVLGLARQVKLRA +TFESNRYISPLMSSSDLTFTTFLTELTGSYVGRAIDMTEAPYNTVRFSYEAFLPKGTKVV +PKYSADDGKTWKTFTKSPTTTRANNEFTRYVIDEKVKSSGTNTKLQVRLDLSTENSFLRP +RVRRLMVTTRDE* +>MW460250_1_45 # 43557 # 43715 # 1 # ID=1_45;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MPREVRDPYSQAKLFIPTVEEKSIKELEKTYKEKIDEATKLINELKKERGEK* +>MW460250_1_46 # 43716 # 45638 # 1 # ID=1_46;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.316 +MAFNYTPLTETQKLKDMYPKVNDIGNFLKTEVNLSDVKQISQPDFNNILASIPDSGNYYV +TNSKGAPSGEATAGFVRLDKRNVNYYKIYYSPYSSNKMYIKTYANGTVYDWISFKLDEGS +LYNEGNTLNVKELTESTTQYATLVNPPKENLNTGWVNYKESKNGVSSLVEFNPVNSTSTF +KMIRKLPVQEQKPNLLKDSLFVYPETSYSNIKTDNWDTPPFWGYSSNSGRSGVRFRGENT +VQIDDGSDTYPSVVSNRFKMGKELSVGDTVTVSVYAKINDPALLKDNLVYFELAGYDTVD +DTSKNPYTGGRREITASEITTEWKKYSFTFTIPENTIGASGVKVNYVSLLLRMNCSSSKG +NGAVVYYALPKLEKSSKVTPFITHENDVRKYDEIWSNWQEFISKDELKGHSPVDIEYNDY +FKYQWWKSEVNEKSLKDLAMTVPQGYHTFYCQGSIAGTPKGRSIRGTIQVDYDKGDPYRA +NKFVKLLFTDTEGIPYTLYYGGYNQGWKPLKQSETSTLLWKGTLDFGSTEAVNLNDSLDN +YDLIEVTYWTRSAGHFSTKRLDIKNTSNLLYIRDFNISNDSKGSSVDFFEGYCTFPTRTS +VQPGMVKSITLDGSTNTTKVASWNEKERIQVYNIMGINRG* +>MW460250_1_47 # 45586 # 46035 # 1 # ID=1_47;partial=00;start_type=ATG;rbs_motif=AGxAG;rbs_spacer=5-10bp;gc_cont=0.302 +MKRNVYRYTILWELIEDKERWNKKTMAVKYDIGNNEIVLHLREGKYITGFTTVGGYDKEL +GQVKVNREILPAYFFDNFAYERYLYYSKPEEVIENKNYVPPQINDDDEESQQITVPKEQY +DSLKEELELMRKQQEAMMEMLQKLLGQKG* +>MW460250_1_48 # 46042 # 47418 # 1 # ID=1_48;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.328 +MALNFTTITENNVIRDLTTQVNNIGEELTKERNIFDITDDLVYNFNKSQKIKLTDDKGLT +KSYGNITALRDIKEPGYYYIGARTLATLLDRPDMESLDVVLHVVPLDTSSKVVQHLYTLS +TNNNQIKMLYRFVSGNSSSEWQFIQGLPSNKNAVISGTNILDIASPGVYFVMGMTGGMPS +GVSSGFLDLSVDANDNRLARLTDAETGKEYTSIKKPTGTYTAWKKEFELKDMEKYLLSSI +IDDGSASFPLLVYTSDSKTFQQAIIDHIDRTGQTTFTFYVQGGVSGSPMSNSCRGLFMSD +TPNTSSLHGVYNAIGTDGRNVTGSVVGSNWTSPKTSPSHKELWTGAQSFLSTGTTKNLSD +DISNYSYVEVYTTHKTTEKTKGNDNTGTICHKFYLDGSGTYVCSGTFVSGDRTDTKPPIT +EFYRVGVSFKGSTWTLVDSAVQNSKTQYVTRIIGINMP* +>MW460250_1_49 # 47510 # 49258 # 1 # ID=1_49;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.310 +MRLRIKNLYTYVEFEEDDKYLKDIFLKRVHTTIGARQEGFQYSPAYKRGSWDGYVDFYVY +EEDKFPTGLLFKIELLLGELQSRYNFQFETIDERDESFLSEEDIDDEITLLDNNVGQITL +RDYQYEAVYNSLTFYNGIAHLATNGGKTEVASGIIDQLLPQLEKGERVAFFTGSTEIFHQ +SADRLQERLNIPIGKVGAGKFDVKQVTVVMIPTLNANLKDPTQGVKVTPKQNISKKIAQE +ILPKFEGGTNQKKLLKVLLDNTTPKTKVEQNVLSALEIIYQNSKTDAEVLLNLRNHNAHF +QKIVREKNEKKYDKYQDMRDFLDSVTVMIVDEAHHSKSDSWYNNLMTCEKALYRIALTGS +IDKKDELLWMRLQALFGNVIARTTNKFLIDEGHSARPTINIIPVANPNDIDRIDDYREAY +DKGITNNDFRNKLIAKLTEKWYNQDKGTLIIVNFIEHGDTISEMLNDLDVEHYFLHGEID +SETRREKLNDMRSGKLKVMIATSLIDEGVDISGINALILGAGGKSLRQTLQRIGRALRKK +KDDNTTQIFDFNDMTNRFLYTHANERRKIYEEEDFEIKDLGK* +>MW460250_1_50 # 49270 # 50883 # 1 # ID=1_50;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.284 +MATKTQRKLYQYLEENATENKFHISTKKELADSLGVSISALSNNLKKLEEENKVVTVSKR +GKNGGVIITLVREYDTEELKEFNNSTDNIITSDLQYAKALREKHFPSYRYERKEQRRRTK +IEMAQYNAIKDEKRRIIADMNFYSEGLPYPSKDIFNMSYDPEGFYKAYILCKLYDQYAIS +HMDAKHTSHLKAMSKATTKDEYDYHQHMSEYYRNKMIQNLPRNSVSDNFFGSKMFNTFYN +FYLKIKDKNINVFKYMQNVFKNVTFYYENGMQPNPIPSPNFFSSDKYFKNYNNYIKGIKK +GVNSTNRHLGDTDSIINSSDYVKNPAVLHLHQLYTTGLNSTLHDIDTMFEQALDLENASY +GLFGDMKHIILLQYNSMIEEEIKNLPREEKDIINKYVKQCIINDYSPTSISPSARLSMFT +MQKEHIVYNKQLNKGIKREDLLPLSLGGIVNKDLLSGMDIQNLEQNGNEYLYMRQHTSTY +YILRMFGDYLGYEVNLREVKYIVEKYNLIDKIPLTKEGMLDYNKLIHLVEEEVNNYE* +>MW460250_1_51 # 50876 # 52318 # 1 # ID=1_51;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.312 +MSKKIKELILHKSMKDIHFAREVLDNLPKNLFSAESEDMGYLFTAIKRTAHISDKMSNEA +LAIKVEQLMGNNKEDEEKVTKTLTYLEDLYKVDVNEKDESVNYEIEKYIKTEMSKEVLVK +FIAENKQEDSDNLHELVDKLKQIEVSDISGGNGEFIDFFEDTEKKQELLSNLATNKFSTG +FTSIDNHIEGGIARGEVGLIIAPTGRGKSLMASNLAKNYVKSGLSVLYIALEEKMDRMVL +RAEQQMAGAEKSQIVNQDMSLNNKVYDAIQNHYQKNRKLLGDFYISKHMPGEVTPNQLEQ +IIVNTTIKKDKNIDVVIIDYPHLMRNPYAKYHSESDAGGKLFEDIRRLSQQYGFVCWTLA +QTNRGAYGSDVITSEHVEGSRKIVNAVEVSLAVNQKDEEFKSGFLRLYLDKIRNSSNTGE +RFVNLKVEPTKMIVRDETPEEKQEHIQLLSDNGKEDTSKFQNKDNKIEAINNTFGGLPGV +* +>MW460250_1_52 # 52397 # 53434 # 1 # ID=1_52;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +MKFVFFTDSHFHLFTNYAKPDEQYVNDRFREQIQALQKMFDIAREEDATVIFGGDLFHKR +NAVDTRVYNKVFETFQLNRDIEVLMLRGNHDSVTNSLYTDSSIEPFGYLPNVEVCKNLDT +LGFLGEEQDINIVMAPYGDETEEIKEFIKNKYVEDRVNILVGHLGVEGSLTGKGSHRLEG +AFGYQDLLPDKYDFILLGHYHRRQYFQNPNHFYGGSLMQQSFSDEQEANGVHLIDTEKMT +TEFIPIHTRRFITIQGEDIPENFEQLIEEDNFIRVIGTANHAKVLEMDDSMKDKNVEVQI +KKEYTVEKRIDSDVSDDPLTIASTYAKQYSPESEQEILECLKEVL* +>MW460250_1_53 # 53434 # 53811 # 1 # ID=1_53;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.280 +MKKYREYLNKTDAENLAEDWEKVTEDLWKVFKDMKPKINTLDISNVVSKDLDKSKPILQF +QDSDGVIENICNVEGLEDGLSKMKKIFDDSNFEKHYYNRVVDHDEYYWIDYGSHHCFFRV +TKGDK* +>MW460250_1_54 # 53811 # 55730 # 1 # ID=1_54;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.297 +MVVFKQVEVNNFLAIKEATLELDNRGLILIEGENKSNESFHSNGSGKSTLISAITYALYG +KTEKGLKADDVVNNIEKKNTSVKLKFDIGEDSYLIERYRKDKENKNKVKLFVNEKEITGS +TNDVTDKQIQDLFGIEFNTYVNAIMYGQGDIPMFSQATDKGKKEILESITKTDVYKQAQD +VAKEKVKEVEEQQNNIRQEIYKLGYQLSTKDEYFQREIEQYNQYKEQLVQIENSNKEKDR +LREQEEKQIEAQIEQLASQIPTIPEDEFKHSEEYNKASQSLDLLSNKLTELNQVYSEYNT +KEQVLKSEIATLSNSLNQLDTNDHCPVCGSPIDNSHKLKEQENINNQIENKKQEITSVLE +MKDTYKEAIDKVKDKSQEIKDKMSQEDQQEREHNNKINSIIQEASRIKSDISSLENNKTY +LKVKYQHQSVQGLEREEPSKEKHEEDKKELQESIDKHEENIVQLETKKGKYQQAVDAFSN +KGIRSVVLDFITPFLNEKANEYLQTLSGSDIEIEFQTQVKNAKGELKDKFDVIVKNSKGG +GSYKSNSAGEQKRIDLAISFAIQDLIMSKDEISTNIALYDECFDGLDTIGCENVIKLLKD +RLNTVGTIFVITHNTELKPLFEQTIKIVKENGVSKLEQK* +>MW460250_1_55 # 55730 # 56326 # 1 # ID=1_55;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.281 +MKLKILDKDNATLNVFHRNKEHKTIDNVPTANLVDWYPLSNAYEYKLSRNGEYLELKRLR +STLPSSYGLDDNNQDIIRDNNHRCKIGYWYNPAVRKDNLKIIEKAKQYGLPIITEEYDAN +TVEQGFRDIGVIFQSLKTIVVTRYLEGKTEEELRIFNMKSEESQLNEALKESDFSVDLTY +SDLGQIYNMLLLMKKISK* +>MW460250_1_56 # 56341 # 57408 # 1 # ID=1_56;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.290 +MRFEDFLTQELGEPKENTIGELRYCCPFCGEKSYKFYVKQALDSSNGQYHCKKCDESGNP +ITFMKTYYNITGKQAFDLLESKNIDIERAPLLTTNNKDLTESEKLILMLRGVHQDKGNTS +IKPPRLPEGYKLLKDNLNNKEIIPFLKYLKGRGITLEQIINNNIGYVINGSFYKVDGESK +VSLRNSIIFFTYDNDGNYQYWNTRSIEKNPYIKSINAPAKQDEVGRKDVIFNLNIARKKK +FLVITEGVFDALTFHEYGVATLGKQVTENQIKKIIDYVSIDTSIYIMLDTDALDNNIDLA +YKLKTHFNKVYFVPHGDEDANDMGTRKAFELLKQNRVLVTPESIQSYKIQQKLKL* +>MW460250_1_57 # 57475 # 57813 # 1 # ID=1_57;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.274 +MSNNKKDILEFVDEYITALRVGNEQRQHQLEEMGKEETATLTDVAKAITNLMLGVNEQMT +DLEYNNELNLNILIDALYKAELINEDVLDYIQESIDKSQEEPKNEEEKGEQE* +>MW460250_1_58 # 57813 # 58265 # 1 # ID=1_58;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.331 +MEKNISTHTKGISQADMEKWIEAVVQGTVDGKQVDEKTAKQLDRIGSRSVSLEEATRIAK +VLNAVTAQEVTGDFNDAFNAIDLMMIIMEDELGVTQEKVGKAKDKLNEKREAYLKEKQEE +LRQKQQEEAQKKTESDSNEKVIQLKKNDEQ* +>MW460250_1_59 # 58252 # 58860 # 1 # ID=1_59;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.328 +MTNSKKKGDTFERKIAKELTAWWGYQFNRSPQSGGASWGKDNNAVGDIVVPQEANFPLVV +ECKHREEWTIDNVLLNNREPHTWWEQVINDSSKVNKTPCLIFTRNRAQSYVALPYDEKVY +EDLRNNEYPVMRTDFIIDNIRKDKFFYDVLITTMNGLTSFTPSYIISCYDKKDIKPYKKV +ESNLSEVSKHEDELINDLLSDI* +>MW460250_1_60 # 58877 # 59269 # 1 # ID=1_60;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.293 +MTSKERPLIVYFSGTGQTERLVNKININNSFETFRVKSGKEKVNKPFILITPTYKKGAIP +KQIERFLEINGSPKEVIGTGNKQWGSNFCGASKKISEMFKIPLIAKVEQSGHFNEIQPIL +EHFSNKYKVA* +>MW460250_1_61 # 59284 # 61398 # 1 # ID=1_61;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.312 +MATYGKWIELNNEITQLDDNGKNKLYKDQEALDEYLKYIEDNTRKFNSEVERIRVLTKEG +TYDKIFDNVPDTIIDEMTKLAYSFNFKFPSFMAGQKFYESYASKQYDENKKPIFVEDYEQ +HNVRVALYLFQNDYVKARELLVQLMEQTFQPSTPTYNNSGQANRGELSSCYLFVVDDSIE +SLNFVEDSVANASSNGGGVAIDLTRIRPKGAPVRNRPNSSKGVIAFAKAIEHKVSIYDQG +GVRQGSGAVYLNIFHNDILDLLSSKKINASESVRLDKLSIGVTIPNKFMELVKEGKPFYT +FDTYDINKVYGKYLDELNIDEWYDKLLNNDSIGKVKHDAREVMTDIAKTQLESGYPYVFY +IDNANDNHPLKNLGKVKMSNLCTEISQLQEVSEIYPYSYSNQNVINRDVVCTLGSLNLVN +VVEKGLLNESVDIGTRALTKVTDIMDLPYLPSVQKANDDIRAIGLGSMNLHGLLAKNMIS +YGSREALDLVNSLYSAINFQSIKTSMLMAKETGKPFKGFEKSDYATGEYFVRYIRESNQP +KTDKAKKVLNKVYIPTQDDWDELAKAVKVHGLYNGYRKAEAPTQSISYVQNATSSIMPVP +SAIENRQYGDMETYYPMPYLSPITQFFYEGETAYKIDNKRIINTSAVVQKHTDQAVSTIL +YVESEIPTNKLVSLYYYAWEQGLKSLYYTRSRKLSVIECETCSV* +>MW460250_1_62 # 61412 # 62461 # 1 # ID=1_62;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.320 +MDITQKVKQHNKNAVLKATNWNIEDDGMSDIYWEQGISQFWTPEEFDVSRDLSSWNSLTE +SEKNTYKKVLAGLTGLDTKQGGEGMNLVSYHEPRPKYQAVFAFMGGMEEIHAKSYSHIFT +TLLSNKETSYLLDTWVEENDFLKVKAQFIGYYYDQLLKPNPTIFDRYMAKVASAFLESAL +FYSGFYYPLLLAGRGQMTQSGAIIYKITQDEAYHGSAVGLTAQYDYNLLTEEEKKQADKE +TYELLDILYTNEVAYTHSLYDPLELSEDVINYVQYNFNRALQNLGREDYFNPEPYNPIVE +NQTNVDRLRNVDFFSGKADYEKSTNIKDIKDEDFSFLDSKEYSTAKEFL* +>MW460250_1_63 # 62479 # 62808 # 1 # ID=1_63;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.306 +MDRKEAMDLLSKAEILFKKHDEFSCVSDINDPMKLFSNSKDAKADDTSNSFQLEFMHDMT +MYTLSYGSGQLKLIDLAEGYEAQKATIVNSFPEIIKTLEKDDSEDGKNE* +>MW460250_1_64 # 62792 # 63112 # 1 # ID=1_64;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.293 +MEKMNSLVDLNTAIRQKKDVIVMITQDNCGKCEILKSVIPMFQESGDIKKPILTLNLDAE +DVDREKAVKLFDIMSTPVLIGYKDGQLVKKYEDQVTPMQLQELESL* +>MW460250_1_65 # 63172 # 63915 # 1 # ID=1_65;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.277 +MEGRWHIAKNKTLTIYNSDRYFNIHTKDKDKINEAIKVTHGNEEEIEKNMDELISKSRRY +IMRDEKHYMLFNEKYNNDRLIEKVCKHGGKVTYYTDSVLPYYVLKDLSSHPDSEVVYRMR +NGFTAKEVDNIALSFMGTKVIIDISVVFPYVNPYDIIRSLHDIKTNVDEVHLSFPRILGV +DEKQEKFYFFDGEAYDLKPEYKVDFADKIRVSLSVWKMYIYILTSSRDFEDVDNVITKLK +QQRKIKI* +>MW460250_1_66 # 63925 # 64230 # 1 # ID=1_66;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.307 +MSTANRRDIARKISENTGYYIQDVEEILSAETDAISDLLEEGYTKVKNHKFMQIEVIERK +GKKAWDGLNKEYFHLPNRKAIKFKPLKELEEVIDRLNEEEK* +>MW460250_1_67 # 64306 # 65214 # 1 # ID=1_67;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.315 +MKVLILFDHIREEHFSVSKDGSVKSNVLNTPNGKTLKKLLEKCSNLKRDKTNRDYDIDFL +YNAVPTPIRNDYGKIIKYQDVKQAEVKPYYERMNNIIIDNSYDMVIPVGKLGVKYLLNVT +AIGKVRGVPSKVTIENGTSSHDVWVLPTYSIEYTNVNKNSERHVVSDLQTVGKFVEQGEE +AFKPKEVSYELVDNIERVREIFNKEVKNDNYDGVDITAWDLETNSLKPDKEGSKPLVLSL +SWRNGQGVTIPLYKSDFNWENGQDDIDEVLELLKNWLASKEDIKVAHNGKWFAVVKSLSY +RA* +>MW460250_1_68 # 65371 # 65856 # 1 # ID=1_68;partial=00;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.278 +MKKDYMTSVKNNKKVCRRCNEELDLSNFKTYKKNDKTYYQSMCIPCRKEYNKLDKTKNTI +KKCYEKNGDKYRRQSNEYNTSDRGRELNKNRSRKYRENNSLKSKARSSVRTALRNGSLIR +PDKCSECNKDCIPEAHHPDYTKPLEIKWLCKSCHEDTHHKK* +>MW460250_1_69 # 65992 # 67335 # 1 # ID=1_69;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.303 +MSTENFKDFESIQDTKVGWYLAVTQEVKESLRLSDLAYEVTDVGGYDKPLEDFKLWFVTK +LLRFFSDKIKEIQKENKKIAKKEYDVKAPEYKEWLENKLNETVVELDDTEKKFRVSELEK +KYIQLGLSPEIVNMNLVMDNDEFINIAEQSPEYMGLSDYAKSYTLNTAINLINEYRDVKD +VVNDIDGGNFNYDWFPIELMHPYASGDTDVCRRIYCDVIKKLKEQDRPKSMHLLEVNYPR +LTKSLARIESNGLYCDLDYMKENDESYESEMAKNHATMREHWAVKEFEEYQYNLYQMALE +EHEKKPKDRDKDIHQYRDKFKDGKWMFSPSSGDHKGRVIYDILGIQLPYDKEYVKEKPFN +ANVKEADLTWQDYKTDKKAIGYALDNLELKDDVKELLELLKYHASIQTKRNSFTKKLLNM +INKQKRTLHGSFSETGTETSRLSSSNP* +>MW460250_1_70 # 67603 # 68310 # 1 # ID=1_70;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.275 +MKEIWKKVVGFENYEVSNKGKVRNIKTNYILKPWIINSGYEQVSIGIANVLVHRLVAMTF +IPTDSYSIVNHIDNNKLNNCVENLEWVSYKGNSAHANKQGRLNTYSAREKLSSVSKKAIY +QKDMEGNIIKLWDSPSEAEKESNGYFKSTKISSVAHGKRKHHRSYTWEYVYKDSKRSLNK +SINMYDLNNNLLYEDLTMNKIMGILEMNNHKTLRDKLRNTDDFVEYRGYKFKNNN* +>MW460250_1_71 # 68544 # 69404 # 1 # ID=1_71;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.304 +MRIIGLFTKDPDMLQSFLNGEDIHKATASIVYNKPVEEVTKEERQATKAVNFGLAFGESP +FSFAGKNNMEVSEAEEIFEKYFQTKPSVKTSIDNVHEFVQQYGYVDTMHGHRRFIRSAQS +TDKKIKNEGLRQSFNTIIQGSGSFLTNMSLTYLDDFIQSRNLKSKVIATVHDSILIDCPP +EEAKIMAKVTIHIMENLPFDFLKAEIDGKEVQYPIEADMEIGLNYNDMVEYDEEEIDTFN +SYQGYIKYMMNLQTLEDYKESGKLTDEQFEKATNVVKSEKHIYQEI* +>MW460250_1_72 # 69473 # 69715 # 1 # ID=1_72;partial=00;start_type=GTG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.337 +MNTGEIRFNRSMDEWIITSMYQDELGGMNIVVTFYNREENKHGSTVLPTESSTGEVTEEL +ASLEEEYPLALPLSSISVNI* +>MW460250_1_73 # 69732 # 70214 # 1 # ID=1_73;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +MEIHIDSLDFTNFTIKDRNGNSQEFDITDELRITEYTIQEDFMQQSAKYAFWASILEKVR +AYSEMEQRNLETIGSKLNLTIRQEYEQQGKKPTKDMIESSVYIHDSYQQQLKVVEAWNYK +VKQLQYVVKAFETRRDMMIQLGAELRQTNKNGGITNPFSH* +>MW460250_1_74 # 70301 # 71572 # 1 # ID=1_74;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.355 +MDFNQFINNEASKLESNNSSFNNNVESYKPKNPVLRLGNIKDANGNKVVKENAFVRVLPP +AQGTNVFFKEFRTTGINYSKKDGSQGFTGLTLPAEEGSSVLDPYIQDWITNGVQFSRFPN +KPGVRYYIHVIEYFNNNGQIQPKTDAQGNVMIQPMELSNTGYKELLANLKDTMLKPSPNA +PHSFISATEAFLVNIVKAKKGEMSWKVSVYPNAPLGALPQGWEQQLSDLDQLAKPTEEQN +PNFVNFLINNVNNTELSHDNFKFNRETNVLGEEPSEPKQAPTQQDVDSQMPSNMGGQPNQ +PQQGQVGQYAQQGQSNGQGQQLQGTQQPINNTQFGQGTPSGQQPSNTGSVDWDNLAQQQS +QPDSNPFNDFDVSSVDDSQVPFETQPQNTQQAPEPQQTTQEPPKQKQTQSIDDVLGGLDL +DNL* +>MW460250_1_75 # 71632 # 71856 # 1 # ID=1_75;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.333 +MARAKKGKEVDLTDLNTIDLGKELGLTLLSDTNRADIKNVIPTMVPQYDYILGGGIPLGR +LTEVYGLTGSGCLK* +>MW460250_1_76 # 72201 # 73169 # 1 # ID=1_76;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.268 +MVKRVWTNEEKQDIVKSFQEGKTFKELQDKYNAHYFTIKKILDEFNIDTNKKRRWTNKQK +QDILRMYTKESMTIAEIKKVYTTHAREIGRILKDFGVDTSYYQTRSVNRNINRNFFEVID +TEEKAYILGLLMADGCVRYRREGQCYLTLELIDKEIVKRVQKELNSDSKIYESHRKRDYI +KNEKQTYTFSVTDEKLCNDLAKYGIVPMKSKKTECLTQDIPYDLRKHYLRGLFDGDGSIG +YYNNRWFITLINNHPEFLKDVGTWINDLLGLKCPKVSKTSTSYRIGYTGKKAKELMKLLY +QDNNIHIDRKQKLADQAIQDIV* +>MW460250_1_77 # 73317 # 74264 # 1 # ID=1_77;partial=00;start_type=ATG;rbs_motif=AGxAG;rbs_spacer=11-12bp;gc_cont=0.344 +MEQLGVDVSKLFSIQSGEGRLKNTVELSVEQVGKELEYWIDTFNEKIPGVPIVFIWDSLG +ATRTQKEIDGGIDEKQMGLKASATQKVINAVTPKLNDTNTGLIVINQARDDMNAGMYGDP +IKSTGGRAFEHSASLRIKVHKASQLKQKSELTGKDEYHGHIMRIETKKSKLSRPGQKAEA +DLLSDYMVGKEDDPILLNGIDLEHTVYKEAVERGLITKGAWRNYVTLNGEEIKLRDAEWV +PVLKDNKELYLELFSRVYGEHFPNGYSPLLNNKVIVTQLEEYQALENYYKEWATDNKQEE +QEEELKGESQEKDSE* +>MW460250_1_78 # 74268 # 74621 # 1 # ID=1_78;partial=00;start_type=ATG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.314 +MDNLIDKNMNQVKESLGNANSSDVLPLPYKDIAKKFEEVKEKGESIIIEEGGFPYTDSTV +MYIEHVTDRWAGGYSLIRHEGEEVKVPKTIHFSDIYVKDKSHKVRIIFEGANPYEES* +>MW460250_1_79 # 74608 # 75270 # 1 # ID=1_79;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.275 +MKKANNGNRYVIDIDGIPVDFERDLDSLLNRYKNLRWSLYHRYAGILSNDFERQELREYI +DEQFIKLVKEYNIRSKVDFPGYIKAKLTLRVQNSYVKKNEKYKRTEIIGKKDYTVESLTE +DLNEDFEDNQIMSYVFDDIEFTEVQSELLKELLINPEREDDAFIVSQVAEKFDMKRKEVA +SELTELRDYVRFKINAYHEYYAKKELNNHRVNTENHIWEN* +>MW460250_1_80 # 75398 # 76021 # 1 # ID=1_80;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.361 +MAKKNVNDVLQQESVTVADKYLQVKVNRDGYTRTHEGQYAYKVVSEGEELFLYPVQTDGK +GTLNVMKKSPIAYTDGDNIHFVVNTVVDPYNHSFIRTEDIKGLDKGKQLIQAFLAFVEDR +FKFGVYNVFVANNKEDVLSIVDPTDNDADEVKDSLEHAHEDVIADFPASPARKDVKGVDS +GEGQGDTSEPSAPKNVQVTPKEDVSAE* +>MW460250_1_81 # 76044 # 76556 # 1 # ID=1_81;partial=00;start_type=TTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.365 +MAKLNLYKGNELLNSVEKTEGKSTITIENLDANTDYPKGTFKVSFSNDSGESEKVDVPQF +KTKAIKVISVTLDVDSLDLTVGDTHQLSTTITPSEASNKNVSFESDKSGVASVTSEGLIE +AVSAGTANVTVTTEDGSHTDIVVVTVKEPIPEAPADVTVEPGENSADITV* +>MW460250_1_82 # 76571 # 76798 # 1 # ID=1_82;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.355 +MEKTLKVYSNGEVVGSQVANNDGATTVSITGLEAGKTYAKGDFKVAFANDSGESEKVDVP +EFTTKTPTEEPSGDA* +>MW460250_1_83 # 76894 # 77154 # 1 # ID=1_83;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.295 +MDIPTILFRNPYDYTKVKKLMENKEQYIVVKFDSVSVHNLNVQGMMNVIQDYLHIYGYRV +KEYGQENSSKDDERDVKGYLYERVGE* +>MW460250_1_84 # 77158 # 77913 # 1 # ID=1_84;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.279 +MGIIVNSNHIQSDTLYEYDSFFDIEKVDTFEEGLLSIQDEPTVLAGFIYDDITFNKVINS +NSDIDDYIKNNDIYYVSDIGLLPDTFITVDSDRKYYSLLQQITELSKDPFPKWVEDDAKG +LTKYYNFQDFEDVFDLNSFYKKEVDMVREKCYNNGNVYLLYEVLPDYKLPLAYSLLSNKE +HGIVIIGSQTRSNNDILTFYVKGMDAKAIASMFNVEHDYDSNIFHTFVNSHINILGNQIT +KFIREKGSSYE* +>MW460250_1_85 # 77906 # 79156 # 1 # ID=1_85;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.289 +MSNYKTIEEVQAVIIGVLFKDEGKIVTSKFNKITKEFGLDRIGKDDLKEIVEDIRQDAYL +NELKNKAIKGKVTLGDLKDVADNQVFEGNNYHEEVSTYVVAKEKELSHLREQRKHNRHTA +YPQIMFDELKEHMVKELQGETLVEHHGSKANINDTELIVLLSDFHIGSIVSDMTNGKYDF +EVLKSRLNHFINTTVKEIEDREISNVTVYFVGDLVEHINMRDVNQAFETEFTLAEQISKG +TRLLIDILNVLSNVVSGELRFGIIGGNHDRMQGNKNQKIYNDNIAYVVLDSLLLFQEQGL +LNGVDIIDNREDIYTIRDTFGGKSIIINHGDGLKGKGNHINKFILDSHIDLLITGHVHHF +SVKQEDFNRMHIVASSPMGYNNYAKELHLSKTKPSQQLLFVNKENKDIDIKTVFLD* +>MW460250_1_86 # 79170 # 79559 # 1 # ID=1_86;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.303 +MDTIFIIGVAFITFATFNIVFRLFDLWTTEKKMVSQGQPPLSNFEYYHVIVPYLVGVIVI +ILSIIFRDSLYSAQSGFGVIITSFIYMLVYVIIGLVGSFVLTIFQARKARQYQTQEDNNE +VQWYLWAIN* +>MW460250_1_87 # 79525 # 79836 # 1 # ID=1_87;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.272 +MKFNDIYEQLIKNDTVQNIHESQDDKGNIYTIQFDKGNDKYLFNVINDGFLKEMTNGMVD +HPEGQPYSVSLINKETPSMSVKQYLTDVEDIVPTIRKMEKDFL* +>MW460250_1_88 # 79900 # 80436 # 1 # ID=1_88;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.333 +MDFNFSAFDNSSLAMRISEGVYYFNDTPYYFIEHVEEEMSEYVIVYDIHDREEKENPQKK +YRIEPYQRTIPGGTPLSNLIKSMMPQRKYPKKVTEDPIFVANVIPLGTDTVTGKTGKGFF +ERDKDRTIYSQKEPTKVVHGQYTGVFIGLTSVKWNRTYTPLESVVEYYKRVKGDRLNV* +>MW460250_1_89 # 80429 # 81196 # 1 # ID=1_89;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.303 +MSNDVVKFYEKDIKDLIRTKKHMFKDDEITSDINDIRIFNEKVICQGKCRTDCLVLDRNG +TVMGIEIKTERDSTQRLNNQLKYYSLVCKYVYVMCHDKHVPKVEQILKRYKHNHVGIMSY +ISFKGKPVVGKYKDATPSPHRSPYHTMNILWKTNLMTILRLIRDPHTYRTGYSYNVSGRY +SGGEGNFSQTTQSKRMKKPAIINQIIHYVGVDNTYKLFTRGVIYGYNNRWEVIEEDFFNT +MKNGVRVINEQRQTK* +>MW460250_1_90 # 81174 # 81620 # 1 # ID=1_90;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.295 +MSKDKPNRRKEIQHQPVNFAPTNTLTGANNSFFAKKPSEPKDATSVIEYRILFIKRFDNV +TSTDVKLQKKYALNLISEALDVKETYLSLKQKGKKTESILHTDRVYYVHRGKKLIGKCSI +REQRTFKGKHLIFIFKTRHRVKAERKDK* +>MW460250_1_91 # 81620 # 82483 # 1 # ID=1_91;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.302 +MLKGFSEHVDKPTTIKTLYKTLTSGKVELLGVSYDSDYFPSGVTVQSYIEDIGNEDEGLQ +FVNKVNVVESMKQAVVGMNNQLGSSGLGYVRTEQLKKELEETGLMTDLLARGTNLTSTKK +VDIVSTFIEPEVTYQNITIAKDIKLRLYKVEEESPLNGYTHIVYLLTTEKLYDGQTLFGM +LSKKDKLSKGDTDKLLAFFRNNSLISKSVFCVKLLSKDYYFNLYNTHETGIFFLEDTDVI +TIACGQSYVKVNTKDIKSSYVKIEDKTHKLTELVINLKGDDTLTILF* +>MW460250_1_92 # 82855 # 83586 # 1 # ID=1_92;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.276 +MARKKNLRNKNSDIKVVPDKEKESILSKLYHNKLLRSKVDNALDEDMSYDDIIELCKEYD +LELSKSAITRYKSKRKEAIENGWDLGELIDKRKKTSVKDIKEKETPILEEEQLSPFEQSK +HHTQTIYDDIQVLDMIISKGAKGLEFVETLDPALMIRAMETKDKITGNQLKGMSFIGLRE +LQLKQTAQDTAMSEVLLEFIPEEKHEEVLQRLEELQNEFYKNLDLDEESRKLKEALDRVG +YTI* +>MW460250_1_93 # 83604 # 84062 # 1 # ID=1_93;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.296 +MADEISLNPIQDAKPIDDIVDIMTYLKNGKVLRVKQDNQGDILVRMSPGKHKFTEVSRDL +DKESFYYKRHWVLYNVSVNSLITFDVYLDEEYSETTKVKYPKDTIVEYTREDQEKDVAMI +KEILTDNNGNYFYALTGETILFDENKLNKVKD* +>MW460250_1_94 # 84127 # 84570 # 1 # ID=1_94;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.277 +MFISLNQEEKELLTKEESKYTPLETSREFNTPKEEFIVTSYNEGKPLDYIAKEAKVSMGL +IYTVLNYYKVGKRNKKSPVEERIAHILKDKNLVKEIIKDYQYMNLQDIYSKYNLHKNGLY +YILDLYHVERKSELKDKALEEDNIVVE* +>MW460250_1_95 # 84587 # 85291 # 1 # ID=1_95;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.306 +MRNKKSFQEQLNDMRNKEKWVSEEEFTEEVAPPEEPEVEEEKLYTLNELKESLLDAQGLK +DVVADFPASKDLYEPNKLYICTIPKGYQSTEVQPGQYIGISTGLLSESEDFSHLRGQMPR +NLYETSHVLKPLIRINNTNIEYQQHELLEDIKDDKKIYDVELEDLRLATGEEVSHLEIVD +NKFFESRINEVLDRYTELTDSNDLLKYYSKLRELVGSDKMIYCSLLDKCVKIID* +>MW460250_1_96 # 85353 # 85751 # 1 # ID=1_96;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.321 +MSRKASIFYILVVIVLAFSISSYYISSFMYHDKAKNEVSTELSNTGKIKEEKNVEFVGDY +TLKKVEDNKAYFMETLPTYLPGRTGDNSIDMRYYKTSRFKEGVNFKLIRVYTEDGEDNPI +HKYRFEAVPTKK* +>MW460250_1_97 # 85898 # 86140 # 1 # ID=1_97;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.333 +MEMADLERFDAFVRLISDDELSEERILELSVDLLNPILEGGTAYKAKKRIKSKFGKLEAK +NFKRNYKFLLKSIAQIDQRR* +>MW460250_1_98 # 86145 # 86336 # 1 # ID=1_98;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.271 +MTEREKLIKDIEEANRDIQLQLKEVDNYKDSIRSKGTRNYISTKVLDSIMVGFIWWSRYC +IKT* +>MW460250_1_99 # 86511 # 86687 # 1 # ID=1_99;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.294 +MVIPSIKAQNKFKNELEYYKQGHISESKMLELAFDYIQELEQNNEYVTNLLEEERYGE* +>MW460250_1_100 # 86677 # 87210 # 1 # ID=1_100;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.287 +MVSKFIGVYLFNLLIAIILTLTLIGTITDSIESTLAQIIVGMFIIITIYGILSALIPILV +HKAVSPGWSYTEWNESYYIRLPGEENYKYYSKWYLDLLGVKEFYYKRDNGEEVKEKNISW +AFQAEVKRPEDVNHWKNQLLTNRPLTILEYKKLKKLDKESEIRKQEDLEEYKQYNSN* +>MW460250_1_101 # 87225 # 87473 # 1 # ID=1_101;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.273 +MISSFDSILLVIYIIIAFAVAMAIIYLVFKGMTILLDKLMMLLLSKTTLDVEACSMIMAV +ISTIVFGIIVLLIWLAVNNILL* +>MW460250_1_102 # 87485 # 87661 # 1 # ID=1_102;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.288 +MDFNDFINSESDRVGKPKQKKKVENKLPSSTPIEDKEKKLKEIRKKSLYIDLRRKRND* +>MW460250_1_103 # 87654 # 87950 # 1 # ID=1_103;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.303 +MTKETNVLYKDKYRDYTIVVRLAGNIIVTEVDKKHKTAFTPIIFDNGVEGVELVMRIGSV +ELNMTDLREFTKEVSTAQKALEYFNKKLYIKGLTDEAF* +>MW460250_1_104 # 87998 # 88180 # 1 # ID=1_104;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.317 +MLLGILWFIWGFVSYFVLMFGIEFWKDRWMPGVIGAGTLLLFLFWIMKSIHNAMTVVYLY +* +>MW460250_1_105 # 88193 # 88561 # 1 # ID=1_105;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.279 +MDILIIHYKETNKRVLKETIQTIQNHLNDEHGLVKMTATKLSRENIEKRFNNYNIVIAED +DPDNSYHYGEAVEDADFIIDIPISYLDIHAGIEWDVDNPVDMLDRNPDFIEAVNKLNEDL +ML* +>MW460250_1_106 # 88574 # 88921 # 1 # ID=1_106;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.313 +MLNEKLKNLEDTKVYMINSIASLLSASTGKSSKVFFDEGTIKIVSGETKAVEVIDNLVHP +HSGRLPIKTTERIALGRLTDSLQFVISEIEVVKDQIIDEENEAYIDFVMEDWNWD* +>MW460250_1_107 # 88921 # 89199 # 1 # ID=1_107;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.308 +MPMDLLTIASVAFIAVVIVDLINDDMSYMLTGTAILINIWAGFYGWFFLLQAGMLLFLLL +ARKVKDDKESILYSSASLICALGMIINLLSFS* +>MW460250_1_108 # 89269 # 89574 # 1 # ID=1_108;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.252 +MSKETIRRQFSNAIEIMATTKEWWNFPKSFDTNKEFKIKTFKNDTLVFEVREGSRNLGSF +VVFTNIDFDYDKLEGTSTQYMINYFAKKLTKDMFNYHKLQL* +>MW460250_1_109 # 89589 # 89939 # 1 # ID=1_109;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.313 +MREELKPFNRKQVNVKGYLDDVKYSKRRRHKGNQHGCVKITVTDVKINGIPIDHVNIEVG +ISFYEKLKELQGKRIQFVGTVYKYVKHARGRKGRIKGFYKEDYSVTLDKKLQKEEK* +>MW460250_1_110 # 89939 # 90541 # 1 # ID=1_110;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +MIKRRKHLDHSLQPEKGWRTVPFNGYYEAHPTGLIRNKVTKKLIKGTQTRKNHPKWTAHE +IVYLINPKKTSYSRGVVIAHTFPEMISQSRGDLKNGHVCFKDGDRSNCHVDNMFIGKGNV +NKNIYKLNDSYLTRKDIEEDVNNLVNERLFSQLELLIKKNEPERITPSNHFIKRDNNVFS +ITDLSKNSLVEFELEIKNIK* +>MW460250_1_111 # 90555 # 90734 # 1 # ID=1_111;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.261 +MNEWYALCYYNKIGKKKIPRQIKAHRDVSVLEDLKDRLEEQNPKEEYKIKTTKEFDKER* +>MW460250_1_112 # 90961 # 91362 # 1 # ID=1_112;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.286 +MKLEDKVLERIDSLGNKAGNLSNQVMESLVKYQITYGIIDIVVSILVIALTIFLGKVYLK +EYKKVKMDLKESLLYDDYDDLSGIGWCYTILLILLTLFSLYAIVAGIPTDIMRLINPEVY +AVKDLIEQVKGGN* +>MW460250_1_113 # 91364 # 91651 # 1 # ID=1_113;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.333 +MKQRDFEFEEDFVLTYECEDCKHFEDWGHDEEPEECSECGSSDLISIIQVMKILSVICVE +GILICGKMDIDIWEIIKSILKKRNQVWFVKIVMRN* +>MW460250_1_114 # 91676 # 91963 # 1 # ID=1_114;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.323 +MNKAVEQASNALGQGFSAMVWHQVLVGLGFILLGLVLSLLVWVLVKKFHVPFNHPTAFVV +YSIMLVSIVASFIWGGLHVINPEYYAILELKGFIK* +>MW460250_1_115 # 91974 # 92090 # 1 # ID=1_115;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.316 +MTKEELEQKVKELEAENKELKKQIERFEDEGGKTKDEQ* +>MW460250_1_116 # 92080 # 92343 # 1 # ID=1_116;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.280 +MNSREKKILTLTVNNFLMLALDIVALVRYKKGKIKQENYNTGQISRTIVTTANSLGILYL +EEQERKEKKSVKIGTLESGTLRGFKNK* +>MW460250_1_117 # 92420 # 92599 # 1 # ID=1_117;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.283 +MKHFILILGIVILVIALGIVLPAWILQLVLSAFGVKVSIWVCIGIFILISAIGSMFSRN* +>MW460250_1_118 # 92614 # 92877 # 1 # ID=1_118;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.330 +MAKYESNINGENYIATPSQALREALAKLITEEKSFAEYQTKGEEQYESQLQLRHFDTMIS +QYEEAIRVLEDKYRPQIFIPKDNKEEN* +>MW460250_1_119 # 92880 # 93197 # 1 # ID=1_119;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.283 +MKAESIARFFNDKVLQIEGYKVRFLQASSSYILDIDTIDESVLFLEAQVSTLSGKHLLDT +AITIERPETLSAKELYTEISNKLQAIVGDQTKTTIELSRYFKEEK* +>MW460250_1_120 # 93198 # 93878 # 1 # ID=1_120;partial=00;start_type=GTG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.267 +MSNKTITNYLLNLEGIKGETYSIIAHINKQTGWGDKGDYFEISISYKADKDPRTTRYITT +EIFVDYGSNNPKEILLQLRDKIFSIVEEQVETDNDFIESIKEINSTKELEKLKPYINNEY +YSMFKSSIEKEIPVALSSEVLNRCTGKTSTLAYLALEKDLPLVVSNEPMRKMLKNKFPHL +RVASAEDYSNYDIKGEIVLIDEVDIDQLYSADKVSVDALLVGIIKN* +>MW460250_1_121 # 93967 # 94125 # 1 # ID=1_121;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.277 +MIPVIVILIGLILFLSSGYKLVLGKYYDDVDLKILFTIFGVGIALLLGGFIL* +>MW460250_1_122 # 94160 # 94360 # 1 # ID=1_122;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +MNYRDFITDCISGGYNVHISVTEKRVHIISEMTSASYPKKEINLDELQAYVYYMNNFGSQ +ITTEGL* +>MW460250_1_123 # 94361 # 94651 # 1 # ID=1_123;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.278 +MELVINIVAVLVGMYAIYFYVTKFSTGLSGILIVLGMAIGLYFYLDYLNVRENVIRLVSV +MFGAFLFSIEMIYNKIMFEIKKSNVQKTVRVYDKEQ* +>MW460250_1_124 # 94743 # 95072 # 1 # ID=1_124;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.267 +MYPEIDVEELAYKLKSTREYLESITTKEVEIYEIYHLKTGKLVFKGEYIEVKELLRKMYK +ENLTLVDVDTMLSIGKGFIDVIKNISAENVFQITYKKELSTKWLKYFQK* +>MW460250_1_125 # 95048 # 95956 # 1 # ID=1_125;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.285 +MIKIFSEVDKEYKPIITEKFPNGEINFKYDDLKYLVEEDLRFDVFFKWENDADLMHLYMF +TKYLEQLGIKDKAEFLEIAYLPYSRMDRVEEGHNNMFSLKYITEFINNLNYKSVWVAEPH +SPVTEELLTNSFAIDVTLKLLNQYIEMSEEPVTIVLPDKGAYDRYLFDVERILMESNIES +YSIVYGEKKRDFETGKIKGIKIIKDKNTLYDNCIILDDLTSYGGTFVGCKKALDKLKVSS +VSLILTHAERAFAEGALLSSGFKDIIVTDSMFPKNNWEKAIAKHRARINGTELQIKDIER +YL* +>MW460250_1_126 # 95974 # 97443 # 1 # ID=1_126;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.323 +MLNPTLMCDFYKLSHREQYPEGTEIVYSTLVPRSNKYYEHSDNIVVFGIQSLVKKYFIDM +FNKEFFNRPKEEVINEYKRTVKFTLGQENPDAKHLEQLHDLGYLPIDVRALKEGTVVHPN +TPVMTIENTHSDFFWLTNYLETIISTQTWQAMTSATLAYDMRKMLDKYAMETVGNIEAVD +FQGHDFSMRGMSSLETAQLSSAGHAISFKGSDTVPVVDFLESYYNADVEKEMVVASIPAT +EHSVMCANGNYETMDEYETYKRMLTEIYPTGIFSIVSDTWDFWGNMTKTLPRLKDIIMER +NGKVVIRPDSGDPVKIICGDPDADTEYERKGAVEVLWDTFGGTETEKGYKVLDEHVGLIY +GDSINYERAQQICEGLKEKGFASINVVLGVGSFSYQFNTRDTHGFAIKATYAKIKNEEKL +IYKNPKTDSGKRSHKGRVAVYKDGSWEDNLTLHQWLNKQNVNQLERVFEDGKLYRDQSLS +EIREIIKNN* +>MW460250_1_127 # 97522 # 97767 # 1 # ID=1_127;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.305 +MIYKISKHNYYSRFEHSTYPPDEGFAYVDYVDVILIGVDNPRKRKIITLKVNEFNPDDYR +VGHKYNIIKILWFEKWEWLKP* +>MW460250_1_128 # 97787 # 98179 # 1 # ID=1_128;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.272 +MIIDKLNGVKLEIGGHVVSFSVRKFNTINGERQLIDYHHIKRNRQQYFRTTEEFYNEYKE +IKPDKNEIDEMFESLGYVDTELDDVVRNQEKVTEILGVSEQYLNQLSYKAIEEYVDKVVT +LEIKELKGEK* +>MW460250_1_129 # 98181 # 98402 # 1 # ID=1_129;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.342 +MNNNWEKEGVNYWEKEGVNYWENEDCPREYLEKAFIDLVEYVEGVTVPPKDVKQLREDKL +REDIGFYEYVADK* +>MW460250_1_130 # 98468 # 98779 # 1 # ID=1_130;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.317 +MKKLIVLLTITISLLLGGCSPDNHEGKVVGVGEYREPTTYIKSGSVTVPVIGEMKYYVDL +ETDKGEDRVYLNKEVYHKFDKGDDFSNVGEKVYKNDELIYKGD* +>MW460250_1_131 # 98782 # 99291 # 1 # ID=1_131;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.269 +MYLNDYVGKFIKEDNYYGYQSTDLVSNYVQRLTLGRYKTKLNANKMKYERLPSSWKIIKA +KDLLRTDDYREGDIFVSERISVFGFNGIIVYNHDFNNVTVITQNRDGKATNPVEEHLYPK +KDIDYIIRPIERDYREYFKKSDSKEKVTLSKQEYKKLLEAYNKMKEVFK* +>MW460250_1_132 # 99293 # 99622 # 1 # ID=1_132;partial=00;start_type=ATG;rbs_motif=4Base/6BMM;rbs_spacer=13-15bp;gc_cont=0.294 +MNSTKLVEYFTNKQGKSLILPDENKVELYRVDVTPYTMRLNFTYNTEVVAIDIDKLHSDS +IEMHIPQGLYITTVVKITSTQSISSVLHKVLEEWVRQVQNDGIFGFVWE* +>MW460250_1_133 # 99628 # 99822 # 1 # ID=1_133;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.226 +MISIEHDYTIRTVDNRKYTYYSKYESLVTLYENIMSKDCIEVTKYGKDKKVIIDTRHIVS +IERW* +>MW460250_1_134 # 99846 # 100160 # 1 # ID=1_134;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.273 +MINAGHAKYLSEIYEDDVHYETIDSIVEDILDNINDGIIEEAMKGNTSYQYVLRDLRVDN +EVEYRVIEELTNQGYSVNHISNDIEYPSISTNNLAGLDYLNIKW* +>MW460250_1_135 # 100175 # 100342 # 1 # ID=1_135;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.238 +MINKYKKLWDEITQQIVNVEIINFKNETVTIESTDDSGLSEIRGFEEVEFIDYYG* +>MW460250_1_136 # 100379 # 100480 # 1 # ID=1_136;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.284 +MDLFAKIIIMSIGVVPLLTIIVAQLITDYHDNH* +>MW460250_1_137 # 101353 # 101652 # 1 # ID=1_137;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.290 +MIDIYLGEGYNKEYLSKALRLINDHAPRELSYDFNNVEADVNIHTMLYVKPEDRFIYKDI +SYYFPGDLIICIVDDDAIVYHQGEQISGISILRILEEIF* +>MW460250_1_138 # 101668 # 101853 # 1 # ID=1_138;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.247 +MIGITILITIMSISTISMYIYFLVDLIQSIRYNSFDKVINVITFVLMTVIIASGILAILG +I* +>MW460250_1_139 # 101960 # 102250 # 1 # ID=1_139;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.309 +MIHIFVKEDYNKETLRSLLEYINDTVGRELTYGINTDYDKDVVIETDDPIDEEDTIELSG +TNMFKDDLCILIEELYCKAFVNGEPVIIRKYVEEML* +>MW460250_1_140 # 102250 # 102537 # 1 # ID=1_140;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.281 +MIIIFLTEKYDAKALKKVLEHIDNCSSRGLSYLMGKGEADVCIEKNVFRERDDVRINSNI +IDEGKLCILINRHGLECSYYRGISCNIGSFVKERL* +>MW460250_1_141 # 102537 # 102830 # 1 # ID=1_141;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.327 +MIEIYLSENYDKNLLKAELKWIKETASRELTYDVNRSPGLDVYVNPYRCTKDEVEEWSTL +PPFEDDILVFIAETWIHEYLKGESIGVDSMEEYVKEM* +>MW460250_1_142 # 102834 # 103091 # 1 # ID=1_142;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MFKVYYTVYHRGSMKTIKDKLDRSSLIYFLYDTWYKDISNVFPNHYNKEFGSKSDDIDID +KLIEAVNEEGILLINRGNYVTIREW* +>MW460250_1_143 # 103169 # 103408 # 1 # ID=1_143;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.312 +MDTLTYTIIHKESDRVIASGLNETETMNLVQRMINTNLVTDISLDDYKRRPHGKIDVVNL +LVDIRRQGVFDFNHIWHVG* +>MW460250_1_144 # 103419 # 103784 # 1 # ID=1_144;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.311 +MIVIYTDVSKDYLKDEFLPWLNERDRYLEYYKDELPEDIDSSYIVSVVYCKDMEGLLERK +DIVLDNSYNEPVALLGVPEFFGNYSNYFYYRGESISKHDLGEIVRLKAWQRMGGDWLSSS +P* +>MW460250_1_145 # 103975 # 104313 # -1 # ID=1_145;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.198 +MKINYIPMWDNEDVLQYAKSQLLVNELETKEIIFKNYQISDDLDGGTDKKYYEIYESKFY +VDEETTKEEFNNLIIENEKLIKEYKTQNGLIKNLIKSQHEVNEFEYNVINIL* +>MW460250_1_146 # 104624 # 104932 # 1 # ID=1_146;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.366 +MDVKEIANTIMELWQMDGYRCAEPPLYESTLNHTRTHTALIVSINGNYDTVQMFRKTPIM +SMRGQSQPASMLVNVIDDVIIIVYENVVYGVQNKEIKFIEEI* +>MW460250_1_147 # 105138 # 105422 # 1 # ID=1_147;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.337 +MTNKNYLYEETHTVQGQDITAFRIPNDANGNPRYVVHFMDLDIKLADYDNINKLYGFKKY +TAKWFGGGVVFQSYNIADTLEYAYTQVKTNRISQ* +>MW460250_1_148 # 105497 # 105688 # 1 # ID=1_148;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.177 +MKFKIEKNNSDIKTLWNLAKNGYMSYQTVHNIFKNESDEFIIFNSKQTYNKFMKLRYNRS +AIQ* +>MW460250_1_149 # 106005 # 106493 # -1 # ID=1_149;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.276 +MVENKINEIWKPIKKEYFNKYNFYVSNLGRVKINDRLSKVHQDRDGYLTVRVNNKKHMVH +RLVYEYFGNDFIKSNHVHHIDGNKQNNCIDNLECISPSEHNKRHHKDNTFNRYNRGYALT +EDERKAIASKYKPRKYTQPMLAKEYNISEITVRRIIKKYKKD* +>MW460250_1_150 # 106661 # 106819 # 1 # ID=1_150;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.245 +MIKFKWKNKTIKSTQKTDNILLLIIGGLVATVTPKLVNWFLLLQDNINIFLR* +>MW460250_1_151 # 106889 # 107020 # 1 # ID=1_151;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.242 +MKKITTTLNLIGMKNNERFTEELKNYRQDVTFLKANKIVKYSK* +>MW460250_1_152 # 107188 # 107424 # 1 # ID=1_152;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.350 +MKLINRDNEIVISIATLESVKQALIWEYIDHLDNNILDKEIHDQEAVVITSDTLQSLKFA +DTMEELEEYVNDIGWKLV* +>MW460250_1_153 # 107504 # 107974 # 1 # ID=1_153;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.380 +MTNTIQAFLQGQEASTVKDVATHGVQSGAIGKLIYTSDVVNFFDSYEQDIEAVITEYIEE +VTGQQYYDLLNYELMRDLENYANVEFEDEDEYNNIQFDLAENIASDEVEGFEDMDEADRA +EAIYEAMDDVELELQETDKVQYVNLAVEIVAQRMAL* +>MW460250_1_154 # 108004 # 108129 # 1 # ID=1_154;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.310 +MNNNTTSYSNSPYGSLEELREAYDLSSLSTGEIKELIQTFV* +>MW460250_1_155 # 108214 # 108393 # 1 # ID=1_155;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.256 +MRNLLEQEQLEKDVKDIIWVLDRMIAKGEQYTEAYDILVNKLERQEKRIVEIKKQNGIF* +>MW460250_1_156 # 108727 # 108963 # -1 # ID=1_156;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.287 +MKLYQVEHDNCEPYEDNFHFREDKIYTDKENLIKRIKEEGYKEETNHRGEQEFIKGDPRD +FYGMDMITIHELEFVNNT* +>MW460250_1_157 # 108965 # 109450 # -1 # ID=1_157;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.300 +MNKEQAKLKLETSIINYENQIKFLDPASMYTRGLIDAKGYSKLALKELEDTGKHSYEDTT +WKDSYAKVFTDEEILEFLLSKPRVTFKGNQEKLDEIKKEREKIQKEATKDLPKGSPLGDL +SKENYEKFWGALQWSREEREKLTQESRAYYENYLKKIKENK* +>MW460250_1_158 # 109463 # 109870 # -1 # ID=1_158;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.282 +MGLDFEVIGVTLSNRKVEQKGLQHFINNARYRHILEKNYYKGFNFEDDFRKPGYFMDLLL +RDAETYYDEFEEWCEGVFVLTKDKLVNLMKNEFNEKTFKGTHDAEYYYRLMSHIYNVEQY +EGKFYDFYLIMSVNV* +>MW460250_1_159 # 109870 # 110301 # -1 # ID=1_159;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=11-12bp;gc_cont=0.315 +MENYKNFIIEEMNKAHILVTKAEQIKRNRKLAETELEEVYKKAEAFDEIVNELLYQLQNL +ESWDTLDQKDCQTLKQILEENIKEEKQLKRYKVKRTITTEEVRYIDAETEEDAWYSVEYE +DEGADTAHYNAEYGTWSYEEEEK* +>MW460250_1_160 # 110304 # 110495 # -1 # ID=1_160;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.328 +MIEISISWTYLISFLLLWSAGILYINYLVYRIRLTNKERKEMSKEHHRNREEIKQRIENR +RDK* +>MW460250_1_161 # 110489 # 110977 # -1 # ID=1_161;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.286 +MNKTFFKFLGKNTLEYSKQGLGFLVALPIMLIIFSVFLAFIIGIPAVIIYALHALNVDND +FIIQLVPVMWFIILYGIVRTGEHKKPFVKLKLKDYLLSILYLTTITAISVLENYLLFQSL +PFTGDVRAVITLLSFIVFVAVNRGICKIAIKSYKEYKEDSQW* +>MW460250_1_162 # 110970 # 111401 # -1 # ID=1_162;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.282 +MNIKYIDLVLENCDVVRLEPKDVSRFHISGITEGIDYYGTYKGTSSISRTRHCTYFGILI +DKPMEIPQVGFAYPDNTNAYEMITAYSDITAIDIIYENDANEYIYVDFNEYNDNYNINQK +NDYYNNMLEITITESNSKEEEDE* +>MW460250_1_163 # 111415 # 111957 # -1 # ID=1_163;partial=00;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.243 +MDKINLNKKHEGSTVVNISNNITLKIQCTDLRKECDDSEAPTTYTHFKAYIVYNIFIVVN +DRKQKKKVKYDCYNDHVGRGNVKDLLKVKDVIFQLSTQLNTNEIIKISGADERRYKIYKY +FIEKDIRFEDNMYYSKSNIWIINNFSLLQKFQWNTVVTKDGDYNKKELKKVDKEWKELLI +* +>MW460250_1_164 # 111969 # 112457 # -1 # ID=1_164;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.315 +MRETREYIMFWGKEDIYSNFYPIKFKHQGRTFNNSEQAFMWRKARYFNDFQIAGEILNAK +NPNHAKSLGRKVRNFNEEQWNKVRYNIMVEVVKDKFMTTHLKQRILDTDVRKDFVEASPY +DKIWGVGLKANDPKILEQSNWKGQNLLGKVMEDVRVHCIYNK* +>MW460250_1_165 # 112470 # 112838 # -1 # ID=1_165;partial=00;start_type=TTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.257 +MNDFEKEVFGIKKNKKYKKMKKKLGRNEPKYWNYDMSFFIQLYADLNAFIESSNHVDMEY +HTFVDVDGKERTQIDMIKHILSLIKYYHKEMDDFDMDKYDELEQVQSKILDNFKIVLPSL +WN* +>MW460250_1_166 # 112805 # 113572 # -1 # ID=1_166;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.289 +MAIYVVPDIYGEYQKLLTIMDKINNERKPEETIVFLGDYVDRGKRSKDVVNYIFDLMSND +DNVVTLLGNHDDEFYNIMENVDRLSIYDIEWLSRYCIETLNSYGVSTVTLKYSSVEENLR +NNYDFIKSELKKLKESDDYRKFKILMVNCRKYYKEDKYIFSHSGGVSWKPVEEQTIDQLI +WSRDFQPRKDGFTYVCGHTPTDSGEVEINGDMLMCDVGAVFRNIDFPFIKLEVKKWRKNI +LKVLNWMTLKKKFLG* +>MW460250_1_167 # 113672 # 114226 # -1 # ID=1_167;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MMVNVLPSVYDAEKGEWVTLLAKPIAEEVLKIMKADYLEHKGNIGFFISKYKDGDSSIEQ +PNVVVFYNEKDYDTMELTESELTNALNEYIDYTLDGKYKPFSLNNFINYLEDYGYRLPVN +FEVDVTIILSDGQKFTYPRTSSITNNASIVDALKSEDQYIEVKYIYNDHAIDDKKLAHGN +DTLK* +>MW460250_1_168 # 114242 # 114559 # -1 # ID=1_168;partial=00;start_type=GTG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MERTLNLYDSKGKLLKSSEKITGASAKIIIEKLTPNTVYSQGSFKISWTINGKESILTDV +PEFTTKSNEDKQEIVFNTLNIDSNSFVVSETEPSDKSKLWFKPIN* +>MW460250_1_169 # 115539 # 116093 # -1 # ID=1_169;partial=00;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.238 +MKKIYILEEEIEEMDYDLWEEDTVYTTSYEILGYTDSLEDAEYIRDNYGTSNPIFINEYP +YITKEKLIEEQRYFRYNSYIELKRVNGYFEISEINDLQVTEDFSINKDDKNFDSPFSINM +FSHNRNSIGIEFIMFSEYDDKEDIIEKEKNSFLMKLKYLLKHSKEADIRSTSKIIDSIDK +LTWH* +>MW460250_1_170 # 116097 # 116315 # -1 # ID=1_170;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.260 +MKNIINFLVDYNINFSYSEDSLNVMNNSYLVDKHGTQDYEIVGNYEHITGVFSYQTEEEV +IAKLKNLIGVWE* +>MW460250_1_171 # 116316 # 116510 # -1 # ID=1_171;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.272 +MRDKRIHSELLYDIIGKHIQEEENITPYIEAIYVDMMNIIVVEYTFYNENGTRMLGQYPI +GEVM* +>MW460250_1_172 # 116500 # 117237 # -1 # ID=1_172;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.255 +MNLEKSFLLSTIEFGSTYQGTSDEHSDKDYMSLVVQPLSDTIFRNNEKASKHTEVSRYYA +VERFISLVLKSGFDNVLNLCAQLEQAKNTRFNKTVLDLFYDDFIFLTYVRANFKPIAYSV +IGNINNILKKGELTGKDLVKFYTFYNHLEYYNDLLDDLDNLNVSYKDFAKVKYMPKEVLD +NKRSNVSIEKKKDLVNKVEPLIQEVKDKLKSNESNIKHYKDAMELVEKSLKDKTVEFLTE +VYNER* +>MW460250_1_173 # 117300 # 117404 # -1 # ID=1_173;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.238 +MKYILGLITLGIILFKVYEHFKYKQDEVDTEEDI* +>MW460250_1_174 # 117416 # 117655 # -1 # ID=1_174;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.279 +MDFYQFLNHENVRVNSITPSQKNFIRENLELTNLEDTDIDFISSKQAKEEIEKIIRIKNE +EEYDIAMDALAGWVTKHGY* +>MW460250_1_175 # 117657 # 118046 # -1 # ID=1_175;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.318 +MFKKAPQYIMEKVEKENNILGEDLSLDIYYKGVKLTVKRHPETGHLNGYITLPSDINEKE +YDSLERRAHRGITYDDYDYEGKRVLGFDCAHAWDMTPYAIIGSLDDQYRDLEYVLSILKD +MAEYVKKDE* +>MW460250_1_176 # 118145 # 118318 # -1 # ID=1_176;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.328 +MEKVNHEFLAELAKSNSPVLNSKPLQDGDYNIEFDYDGFHFEFSQKNGYWRWSYNAK* +>MW460250_1_177 # 118272 # 118841 # -1 # ID=1_177;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.289 +MANEKEIIRMVNYLIDNMSMWHINYARAVLIPSEVEKIIKEHEKFDDLLKKRGEWLVKGS +DTDNIDDLETYNQIMNNQKDEMMIQEIDIYTQGKTITIDNEHYSSDDLGEVLNKLEQSED +IKIKSNYKSLYVGYTNVVGYEVTYASSYEETFKNDLEKDLWYISTVRIIRRNKPYGKSKS +WVSSRIGKE* +>MW460250_1_178 # 118891 # 119433 # -1 # ID=1_178;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.276 +MDRIIGKHNLTQDLRLGDKVEVYDAHKFKENEDGTIELGDKITEGIVVDYKGDFTGNTSG +LVTLDSSEKELIIGEYNFKLIEEGNLQAVYDSVSKNKVESLSEDYDMYRKLLGVKSGELA +GIEDELEYLVRQYNSKVDNYNGLLTLSKEKARELSLLTGDKKMIPHMKNRRLELGTEADF +* +>MW460250_1_179 # 119433 # 119966 # -1 # ID=1_179;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.270 +MVYDSIISRTMAVSILNKWIAELITDVDLDKCKFTEEEYGKVVTNSINKIQDVLIEKNYE +VTDGELYDIVCTELINPIKNNTEEEKHNEKNDLLEHLEDLAFRHDIDLGYVSDGSYNLTV +THWLMQDEFTDVNIKVNNDEDFYTVTIPESKYFWLPITKENLEMFLTQDPINKGEVK* +>MW460250_1_180 # 119969 # 120133 # -1 # ID=1_180;partial=00;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.267 +MKNLIKLLSMVVVTILTFSLTYVILKKETNNKRNGVAPFDFSLEDHIHLNKEIK* +>MW460250_1_181 # 120136 # 120411 # -1 # ID=1_181;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.264 +MANNIWAVVLSIVILLIILLILWFLFRKKVNGSSKNVEIQKAEEDNDNKEQEVEEAQYRE +LNEEEKEKNENSSKDYKYDKEKVKNKLKELE* +>MW460250_1_182 # 120411 # 121256 # -1 # ID=1_182;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.307 +MGRRLIDNSELNVIKYDGLPDFFSALKKNRVSGRDNSSDTGSYDFTGTHSFQEAYNLMVK +GDRESYDMVVKLKKMTDALFRMDKSVKRKPVVAPEGYQPHVPNAIKGLPNSMMSQQRVKA +EKKVIDVFYNSSISWMEDPENLAYRGAIMLSAIQTLETKGYSINLYLGKLSNSGYEDKLT +GFVVNIKHSYQRLNVFKSSFYLVNPSFLRRISFRVLEVEPDMVDLTNHGYGSVVSKSSYG +NKLTEHILDNAVIFDSSVGIDINNDSSENLRAVKKLFGGRL* +>MW460250_1_183 # 121268 # 122386 # -1 # ID=1_183;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.323 +MAKQDTIERLERLVEQQMETTKDLADKLGEKNSNPYEQAIVDAIVEKAGTESREIIITDV +KKQIEEYVEEQLSNLPVKIELQQEGKTIKDISGIFHYRYQDILKLVNQNIPVFLKGGAGS +GKNHVLEQVAEALDLDFYFSNAITQEFKLTGFIDANGKFHETQFYKAFTKGGLFFLDEMD +ASIPEVLLILNSAIANKYFDFPIGRVTAHEDFRVVSAGNTMGTGADHIYVGRQQLDGATL +DRFAQVEFDYDTKVEHQLSSNEDLVNFVQQLRHENDEKGLPYVFSMRAIINGSKLDGVME +DEFVVESIIFKSVPKDEINQFISSLPEGNRYTEATRKLLGMQQEPKQEPRKSDSTSKDSM +DFDTIMDKLGLE* +>MW460250_1_184 # 122540 # 122866 # -1 # ID=1_184;partial=00;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.254 +MSKRTDNFIYFCKYYFSEYLPSLGVEVLNHNETSHGTMEGVRKYYIANILYEGQELTVTI +DLEEFNNATSMHNMLEIMNNHTYNCMFMYDMDTHETKDIDDFFKLMYF* +>MW460250_1_185 # 122859 # 123275 # -1 # ID=1_185;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.273 +MNAKEFMKTQAQVEDYLDKLKVTIIEDALSVSKEWSNDSNDLGYALSSLGESIGLLEDYY +NIQVDAHLPEHYKGSKDVISFLEEHFSYDGFVDSMIFNIVKYTTRLGRKDAVDKEVQKIK +TYYVRLERNIKYGDSTRV* +>MW460250_1_186 # 123409 # 123711 # -1 # ID=1_186;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.300 +MEKVELIKQWAKDRNLQTGKPEGQMLKLLEEAGELASGIAKSNDHVTRDSVGDIFVVLTV +LCLQLDIDIEECIDMAYDEIKDRKGKLINGVFVKEEDLKK* +>MW460250_1_187 # 123711 # 123899 # -1 # ID=1_187;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.254 +MEKFQEDYVNIDIRVKAYVRVGYRYEEDITNNLHELVEDNLNVTSDSDNLIIKDTEIKGD +IE* +>MW460250_1_188 # 123943 # 124104 # -1 # ID=1_188;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.290 +MVKPVITLEPEDVKVLLDYLSFLEDDMRNYEGMRELYEELHKKYQLAKGNYSD* +>MW460250_1_189 # 124104 # 126152 # -1 # ID=1_189;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.257 +MAITYKQKGLTEQEIINLPKVNKGCIYIGEEDVFLKKKKNNIINLGSKELFRDIHNIFSF +DTATEIHLFLALCGNKEVTNFEGNPYETVEKLVEGVVDDNKGRSYKEYIESNREERKDFP +VYGYKSRRRIQSKGYVEEKIKELEGNDHLWRNESRQLEEYKKVVDSLNNDIMDVLDQGKY +GLIKSSIIVMNEDIEKGSSEYYSAMTDELYSRVWYMHPSTENYSSFGLKVKHIRDKHNMG +NKWVLENKSSFDVKTGEVKVFLTDSLVNKEITLNLYKDDISKSEYKNELTLSVLLNVILK +NYAQPNLTRGIIIKIIEQTLEHHNFDFSSWCPDNTDVYGHINYRGDKYRIFIGENSTSNY +LITLTDIVKNIDKINNLEEFGLFERNALLFHIPKKPKWKVHEAFNLTKQTYKKLLTLNKF +EQGNYLRFANILYKHYNHLHNEVNLHQLFDDTFLMVRDSRDVTDALKVKPIVNQILSISF +ANYKKMTHYLDVDAQDRQRITGYALDNYYLDYLHDLSILIREGYRTLESVSLTPFSLKLE +HDIVTDEKQSIQQQLDDAELKAKYDNKLEKIIDKTYKLKDGRKVKFLPADTVSKLKDEGK +MLSHCVGGYANRILKNSCLILLARLEEDLDNSWFTVEIRITDNGYVLGQQQSIDAYKLPN +ELKEALEKDIKKINKEEFKEVA* +>MW460250_1_190 # 126230 # 126493 # -1 # ID=1_190;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.258 +MSIEKKEEVIAHNEVVFRSLTQGLYVKEVDIYSDVVSYTKDVDEALAMPNTINFKNSRKY +KKLIMNLDLEPLNKIQKVIYETHLEGL* +>MW460250_1_191 # 126510 # 126683 # -1 # ID=1_191;partial=00;start_type=TTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.270 +MNDLIKEGNKYYHKVRAGETLWTISKNYDVEIKKLQELNNIKSVSLTNLEYVLVCVE* +>MW460250_1_192 # 126690 # 127268 # -1 # ID=1_192;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.273 +MDNLSHYLSILYAILITVGYIPGLVALVKAESVKGVSNYFWYLIVATVGISFYNLLLTDA +SVFQIVSVGLNLTLGIVCLLVASYRKKDYFSIPFIIVFSLLLFLLSDFTALTQTVATITI +ILAYVTQITTFYKTKSAEGTNRFLFLIIGLGLASLIVSMVLTHTYVHIIATEFVNFVLIL +ICYLQANYYSRG* +>MW460250_1_193 # 127261 # 127887 # -1 # ID=1_193;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.308 +MGNKIKDKVIYMGGHILNEAMVDYRDKQHKEVDGIVGVTPYSPHKDKSINDKANAEQTKL +AERILTNDFKAMQESDIFVFDILNEGLGTIAELGILLGMKHQAEETINHIYDNGEEYFNY +FTNKFETSLNTEEELIVDKLENIVNKPVLIYCSDIRQGHGKPYNDPDRAEFSTNQFVYGM +VLELTDGEGFISWEEVINRLEKLGEQDG* +>MW460250_1_194 # 127880 # 128776 # -1 # ID=1_194;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.265 +MKSYTKVKNKGIVLDKFKERGLVVQEKLDGSNASFTVENGELVCFSRRKKLNENETLNGF +YDWVHENINVRNTYVSALEKYIIFGEWLVKHKIQYKEEFYNNFYVFDVYDKENEVYLSVE +DMNVIAHHLGLKTVKTLLVSKPSHYLNDLKPEEIQELVGKSDMTVKPDKGEGIVIKYLDG +KSEYDDYFKLVSNEFKEFSRQKMKTEVKKNESVADYAITRARMEKMIFRAIEEDRLSEDD +LELENFGLIMKQVGQNFVDDIMEEEKENILKIVDKQIKKKMPHILREILEEKGDTIDG* +>MW460250_1_195 # 128776 # 129000 # -1 # ID=1_195;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.244 +MNYLAKVFINNNWLVKLITIVLLTLFLSGLVYVISAISLFLSTVLNLPGLVVLAFLASVS +LILFSIVHNSKEDN* +>MW460250_1_196 # 129069 # 129809 # -1 # ID=1_196;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.306 +MAIQLKELDFKLKDYPNVRYNMGEHLVFNEFLEKATTEQLDFCEDFFNDNVEILWNESQA +GTGKTMCSVACAYADYLNKNRKLVFIISPVSEDLGSRPGNQTEKEMAYFMGLHDALIELN +MNPEQQITEMLMMEDNVKEDKLGDCWVSQISHLFLRGGNLRDATIIINEAQNFKRSELKK +VLTRVHTKNSTVIVEGNFKQIDLKNESKSGFGDYMEYFKNYEGAVFHNFTVNFRSKLAQY +ADNFKW* +>MW460250_1_197 # 129861 # 130475 # -1 # ID=1_197;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.289 +MKKINSVIKGEGKKVQTADVRKISYYVKDYNPCMTVDDANDYNATSQYLVSDNGKFIAKY +NKDMNAVGFYEESGDTVKHLTHTTPERLEGTVFTIEEETEIDLINDTLPQGDILIKFSDG +SIYLPDNESVLDSVNYLADNDWDSVDDIIYTGLSKGNSENCIVDFNYNNYDIGYDDVEDE +DVCDNYPECECSNYCSSTGEYIGN* +>MW460250_1_198 # 130491 # 130916 # -1 # ID=1_198;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.298 +MQDSVNIYTDGSSSYNKGKVGSGAVLVSKEGNIISEISKSVDKPGLIKYNNVAGEILACC +YGIEEAIKLGYNQAIVYIDYIGLIHWYEGTWSARNILSKTYINMIREYQKVIDINFVKVK +SHSNDKWNDYADNLAKKSIDI* +>MW460250_1_199 # 130906 # 131097 # -1 # ID=1_199;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.281 +MKKGVFTVIADGFKFNVIAKDKKEVQEHCFKCFDFNYISVSFCREVYSDCEFPQFMEDYK +YAG* +>MW460250_1_200 # 131120 # 131761 # -1 # ID=1_200;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.255 +MENNNLVNFLMTTDDIDDTIEMVDSFELQDINKVLGEDTFLTIMEITDSLPDNQYKIVLL +SSLDKLLNTDRKELVEYDEEFPTIRKHNVSELKRDTVNSVIDSYMNTNVEILYTEYPTIS +NYSVVVDSVKVLNTLYLIESKNGKIEATLSEDGEDLHEYISEEGYSVTDILNKFDDVEDL +FDEDDSLINFFSDIDEGKNKTIKSFIELVINLK* +>MW460250_1_201 # 131751 # 131981 # -1 # ID=1_201;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.329 +MDEKKESKPLNLQKIRVEKGHTLRSLASEIGVHYSLISYWEYGKKKPRSANLMRLEKALN +TPGKELFKELEEDDGE* +>MW460250_1_202 # 131984 # 132211 # -1 # ID=1_202;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.263 +MNKFKRWFRINVLKKETLLFKVYWRYESPSLKKPHVFHIELYAKSKAEARNKSQEYILKN +AKASEDFKFLKVEEK* +>MW460250_1_203 # 132321 # 133013 # -1 # ID=1_203;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.348 +MKKTIFATLALGTAITFGGIATNEASADEIDYNKLAEQAKSNSAEVNTKPIQAGNYDFSF +SDGEFTYHFYNYNGNFGYEYHSGSTQVDNTVSRLAGEEQTPEQKVDQQQAQFDTQNKQDT +KKEVQTTSAPVQKETKQPTQSTSSTGGSVAEQIRQAGGDEAMIEIAMRESTMNPNAVNAS +SGAQGLFQGLGKSWSGGSIAEQTKGAKQYMIDRYGSTSGALAYHNAHNSY* +>MW460250_1_204 # 133200 # 133835 # -1 # ID=1_204;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.280 +MIGETINKLKVIKESSKRDKSRCKMYECLCECGEVIIVRSSTLRQGKIKSCGCESNKIHS +ELMRERNTTHGLSSNPMYQRWLGMKQRCYDVNAINYKNYGGRGIEICEEWKNDFKKFYDY +MGDPPNENYQIDRINNDGNYEPGNVKWSTRSENSTNIRKKSTHNIYKKSNNVYNIQIVRK +NKVKYFSAKSLEEAIELRDNVINKYNETGEW* +>MW460250_1_205 # 133902 # 134693 # -1 # ID=1_205;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.298 +MRKSVVISGVLGFLAIIGFIILLMCITKIPQGHVGVVYSVNGVKEDTKSPGWHLTAPFDK +VNKYPTKTQTHKYKDLNVATSDGKNIKLDIDVSYKVDATKAVNLFNRFGSADIEELEKGY +LRSRVQDNVRQAISKYSVIDAFGVKTGEIKQDTLNKLNDNLEKQGFIIDDIALSSPTADK +NTQKAIDERVKANQELERTKVDKQIAEENAKKKEIEAKGEKKANDIRSESLTEEVLQQQL +IEKWNGKQPISIGSDSVITNLNK* +>MW460250_1_206 # 134693 # 135001 # -1 # ID=1_206;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.275 +MALLLTYFAIFIVFLVLVGFGISYLFDFLSMKEKKSNIRKQYRELVRQGTLDEYGLEQYV +KYKKQFLNDRRQSIVTRADKQEIDQEEKALNSLIKEIEKGEM* +>MW460250_1_207 # 135114 # 135743 # -1 # ID=1_207;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.379 +MSASDAQFLKNEQAVFQFTAEKFKEWGLTPNRKTVRLHMEFVPTACPHRSMVLHTGFNPV +TQGRPSQAIMNKLKDYFIKQIKNYMDKGTSSSTVVKDGKTSSASTPATRPVTGSWKKNQY +GTWYKPENATFVNGNQPIVTRIGSPFLNAPVGGNLPAGATIVYDEVCIQAGHIWIGYNAY +NGNRVYCPVRTCQGVPPNQIPGVAWGVFK* +>MW460250_1_208 # 136014 # 136514 # -1 # ID=1_208;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.283 +MEKKLNEIPGLEIYENYTITDKGEVISYKGKEPKKLKLQKNNKGYLFVRLRYHSPKIHRL +VAMAFIPNPDNKEQVNHLNGKNDNSVGNLEWVSNSENREHAIKTGLKNEINYNIAQYDLE +GNLLNVFYTAQEALEFLGISNKRSGNIGRCIKGERKTAYGYIWKQY* +>MW460250_1_209 # 136674 # 137477 # -1 # ID=1_209;partial=00;start_type=ATG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.353 +MAKTQAEINKRLDAYAKGTVDSPYRVKKATSYDPSFGVMEAGAIDADGYYHAQCQDLITD +YVLWLTDNKVRTWGNAKDQIKQSYGTGFKIHENKPSTVPKKGWIAVFTSGSYEQWGHIGI +VYDGGNTSTFTILEQNWNGYANKKPTKRVDNYYGLTHFIEIPVKAGTTVKKETAKKSASK +TPAPKKKATLKVSKNHINYTMDKRGKKPEGMVIHNDAGRSSGQQYENSLANAGYARYANG +IAHYYGSEGYVWEAIDAKNQIAWHTGK* +>MW460250_1_210 # 137477 # 137980 # -1 # ID=1_210;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.349 +MANETKQPKVVGGINLSTRTKSKTFWVAIISAVALFANQIIGAFGLDYSAQIEQGVNIVG +SILTLLAGLGIIVDNNTKGLKDSDIVQTDYLKPRDSKDPNEFVQWQANANNTSTFEIDSY +ENNAEPDTDDSDEVPAIEDEIDGGSAPSQDEEDTEEHGKVFAEEEVK* +>MW460250_1_211 # 138065 # 138250 # -1 # ID=1_211;partial=00;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.258 +MASAKQLYYTESLVGKAIINNKVSNKEEVWDKLELLPETKLEDLDNKQMSEVIKKLNQIN +E* +>MW460250_1_212 # 139797 # 140015 # -1 # ID=1_212;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.274 +MKRQKMFYSSLICKECGNVFKVPRKRANKREEGHIKDIYCIKCCKTTKHIEDNRSEAERR +WDAIQEELTKDN* diff --git a/tests/test_data/overall/Standard_examples/SAOMS1_subset.fasta b/tests/test_data/overall/Standard_examples/SAOMS1_subset.fasta new file mode 100644 index 0000000..48c10fa --- /dev/null +++ b/tests/test_data/overall/Standard_examples/SAOMS1_subset.fasta @@ -0,0 +1,24 @@ +>MW460250_1 +CCCCACCCCGCCATCCCCGATTCATGAGCTATGTTCTAAGTCGATACCATTTAATAAGATAGGGTCATCT +TCTTTACCTACCATATAATCAGATAGTAAGTCTGCTTCAGCTTTTTGCCCTGGTCGTGATAGTTTAGATT +TCTTAGTTTCAATACGCATAATGTGACCATTGTATTAAATAATTAGAATACTATTTTAAAAGATTCTATT +CTGTTTGGATTAATATATACTTGAGGTGAAGTTATAGCACTTTCAGTATATACTTTTATAGAGGTTTCAT +CCATTCCTCTTAACATATAATCTATATCTTGCCTATTGTAACTCTTTTCATCAGTAGATACTAAAAAGTA +TTTAGCTCCACTTGACATTGTTATTTCAATATGTTTTGACATCTACAATCTCTCCTATGCAAATTTGTTA +AAGACAAAGGATAATATAGCTCCTAGAACAAGTAAAAGAACCTTCTCAGTTGTATCCTTTTTCTCAGTAT +CCTTAGTTTTTGTACTTTCAGCAAGTTCTGAAATCTTTTCATCAAGTCTTTCTAATTGGACGTAAATTGC +TGATTGTTTTTCACTATTGACAGCTACATCTTTATCTATACTAACTATCATTTTTCTTAGTTCAGCTACC +TCAACTTCTAAATCTTTGAAAGTTCCTCTATCTATATAATTACCTTCTTGTATCTTAGACTTAATAGTTT +CTACTTGAGAAACAAGGTTGTTTATCTCCTTATCCAACTAGAATCACCTCTAAGGTCTAACCGTTTCAGA +TTCAGAATGGATATCATAATTTTCTAAGAAATCATTGATAATCTCCATATAATTATCCGTAACGACTTTT +CCGTAAGATGTTTTTGTATCAATTTCAAACCTAAGCTTACCAAAACTTTGGAGGTCTAATTCTTTTATTA +CAATATTAGGGTCATCAGAAGGAAGGTAATAATAGTCGAAGTATATAATTGAGCCATTTATTAATACTCT +GTCTATTCTATAGACGTGGAAATAGCGTCTGTCTCTTTTAAAATGGGCTAGTGCATCTTTAAACTCTAAC +TTAAGGATATCCTTATATTTAATCAAAGTGGTAACCTCCTTACTATTAATTTTTAAATTTACTTATTTTG +TGGTATAATAGTTATGATAAAGGCAGTTATTATAATTATATTAAGAATAATGATAATAATTATTTTTTCT +GAGAAAATAAGCCAAATACTAAAAACAGATAAAGCATAGATAGCTGATAGATATACTATATTAAGAGTTA +CCTTACTTTTATCTTTTCTATAGATAGAATAACCTAAAGACGTTGTAACACCACTAAGTATAAAATAATA +GAAACAAAAAAGAGGTATAGACAGAAAAAAAGATACGATAATCATTGTTAAACACCTATTTCTTTTTGAC +CTATTATTTCTAGAACTTTTAGATTACACCACTAATATAACATTAAAAGCCAGTCATAAAAGTCAATTGT +TAGATTAATAATATAATAAAAAAAGACAATAGGAGGTTAAAGTGGTTGAATAATAACATAGCTATATTCA +TATTCAAAACACTGGTTATCATTATATTCTTACTACTAATTTTGTCTGTTATTAATTCCTTGTCCCTTAT diff --git a/tests/test_data/overall/Standard_examples/dupe_header.fasta b/tests/test_data/overall/Standard_examples/dupe_header.fasta new file mode 100644 index 0000000..e165187 --- /dev/null +++ b/tests/test_data/overall/Standard_examples/dupe_header.fasta @@ -0,0 +1,34 @@ +>MW460250_1 +CCCCACCCCGCCATCCCCGATTCATGAGCTATGTTCTAAGTCGATACCATTTAATAAGATAGGGTCATCT +TCTTTACCTACCATATAATCAGATAGTAAGTCTGCTTCAGCTTTTTGCCCTGGTCGTGATAGTTTAGATT +TCTTAGTTTCAATACGCATAATGTGACCATTGTATTAAATAATTAGAATACTATTTTAAAAGATTCTATT +CTGTTTGGATTAATATATACTTGAGGTGAAGTTATAGCACTTTCAGTATATACTTTTATAGAGGTTTCAT +CCATTCCTCTTAACATATAATCTATATCTTGCCTATTGTAACTCTTTTCATCAGTAGATACTAAAAAGTA +TTTAGCTCCACTTGACATTGTTATTTCAATATGTTTTGACATCTACAATCTCTCCTATGCAAATTTGTTA +AAGACAAAGGATAATATAGCTCCTAGAACAAGTAAAAGAACCTTCTCAGTTGTATCCTTTTTCTCAGTAT +CCTTAGTTTTTGTACTTTCAGCAAGTTCTGAAATCTTTTCATCAAGTCTTTCTAATTGGACGTAAATTGC +TGATTGTTTTTCACTATTGACAGCTACATCTTTATCTATACTAACTATCATTTTTCTTAGTTCAGCTACC +TCAACTTCTAAATCTTTGAAAGTTCCTCTATCTATATAATTACCTTCTTGTATCTTAGACTTAATAGTTT +CTACTTGAGAAACAAGGTTGTTTATCTCCTTATCCAACTAGAATCACCTCTAAGGTCTAACCGTTTCAGA +TTCAGAATGGATATCATAATTTTCTAAGAAATCATTGATAATCTCCATATAATTATCCGTAACGACTTTT +CCGTAAGATGTTTTTGTATCAATTTCAAACCTAAGCTTACCAAAACTTTGGAGGTCTAATTCTTTTATTA +CAATATTAGGGTCATCAGAAGGAAGGTAATAATAGTCGAAGTATATAATTGAGCCATTTATTAATACTCT +GTCTATTCTATAGACGTGGAAATAGCGTCTGTCTCTTTTAAAATGGGCTAGTGCATCTTTAAACTCTAAC +TTAAGGATATCCTTATATTTAATCAAAGTGGTAACCTCCTTACTATTAATTTTTAAATTTACTTATTTTG +>MW460250_1 +CCCCACCCCGCCATCCCCGATTCATGAGCTATGTTCTAAGTCGATACCATTTAATAAGATAGGGTCATCT +TCTTTACCTACCATATAATCAGATAGTAAGTCTGCTTCAGCTTTTTGCCCTGGTCGTGATAGTTTAGATT +TCTTAGTTTCAATACGCATAATGTGACCATTGTATTAAATAATTAGAATACTATTTTAAAAGATTCTATT +CTGTTTGGATTAATATATACTTGAGGTGAAGTTATAGCACTTTCAGTATATACTTTTATAGAGGTTTCAT +CCATTCCTCTTAACATATAATCTATATCTTGCCTATTGTAACTCTTTTCATCAGTAGATACTAAAAAGTA +TTTAGCTCCACTTGACATTGTTATTTCAATATGTTTTGACATCTACAATCTCTCCTATGCAAATTTGTTA +AAGACAAAGGATAATATAGCTCCTAGAACAAGTAAAAGAACCTTCTCAGTTGTATCCTTTTTCTCAGTAT +CCTTAGTTTTTGTACTTTCAGCAAGTTCTGAAATCTTTTCATCAAGTCTTTCTAATTGGACGTAAATTGC +TGATTGTTTTTCACTATTGACAGCTACATCTTTATCTATACTAACTATCATTTTTCTTAGTTCAGCTACC +TCAACTTCTAAATCTTTGAAAGTTCCTCTATCTATATAATTACCTTCTTGTATCTTAGACTTAATAGTTT +CTACTTGAGAAACAAGGTTGTTTATCTCCTTATCCAACTAGAATCACCTCTAAGGTCTAACCGTTTCAGA +TTCAGAATGGATATCATAATTTTCTAAGAAATCATTGATAATCTCCATATAATTATCCGTAACGACTTTT +CCGTAAGATGTTTTTGTATCAATTTCAAACCTAAGCTTACCAAAACTTTGGAGGTCTAATTCTTTTATTA +CAATATTAGGGTCATCAGAAGGAAGGTAATAATAGTCGAAGTATATAATTGAGCCATTTATTAATACTCT +GTCTATTCTATAGACGTGGAAATAGCGTCTGTCTCTTTTAAAATGGGCTAGTGCATCTTTAAACTCTAAC +TTAAGGATATCCTTATATTTAATCAAAGTGGTAACCTCCTTACTATTAATTTTTAAATTTACTTATTTTG \ No newline at end of file diff --git a/tests/test_external_commands.py b/tests/test_external_commands.py index 6ddda6c..e3c72b2 100644 --- a/tests/test_external_commands.py +++ b/tests/test_external_commands.py @@ -15,7 +15,7 @@ from loguru import logger from bin.processes import (run_aragorn, run_mash_sketch, run_minced, - run_phanotate, run_pyrodigal) + run_phanotate, run_pyrodigal, run_pyrodigal_gv) # import functions from bin.util import remove_directory @@ -49,19 +49,45 @@ def test_run_pyrodigal(self): fasta: Path = f"{standard_data}/SAOMS1.fasta" coding_table = 11 meta = False - run_pyrodigal(fasta, standard_data_output, meta, coding_table) # meta = False + threads = 2 + run_pyrodigal( + fasta, standard_data_output, meta, coding_table, threads + ) # meta = False + + def test_run_pyrodigal_small_fasta(self): + """ + handle instance where input is under 20000 nucleotides + """ + fasta: Path = f"{standard_data}/SAOMS1_subset.fasta" + coding_table = 11 + meta = False + threads = 2 + run_pyrodigal( + fasta, standard_data_output, meta, coding_table, threads + ) # meta = False def test_run_pyrodigal_meta(self): fasta: Path = f"{standard_data}/SAOMS1.fasta" coding_table = 11 meta = True - run_pyrodigal(fasta, standard_data_output, meta, coding_table) # meta = False + threads = 2 + run_pyrodigal( + fasta, standard_data_output, meta, coding_table, threads + ) # meta = False def test_run_pyrodigal_c4(self): fasta: Path = f"{standard_data}/SAOMS1.fasta" coding_table = 4 meta = False - run_pyrodigal(fasta, standard_data_output, meta, coding_table) # meta = False + threads = 2 + run_pyrodigal( + fasta, standard_data_output, meta, coding_table, threads + ) # meta = False + + def test_run_pyrodigal_gv(self): + fasta: Path = f"{standard_data}/SAOMS1.fasta" + threads = 2 + run_pyrodigal_gv(fasta, standard_data_output, threads) # meta = False def test_run_minced(self): fasta: Path = f"{standard_data}/SAOMS1.fasta" diff --git a/tests/test_overall.py b/tests/test_overall.py index f4078f6..f68df33 100755 --- a/tests/test_overall.py +++ b/tests/test_overall.py @@ -93,16 +93,16 @@ def test_overall_crispr(tmp_dir): def test_overall_vfdb(tmp_dir): - """test pharokka overall crispr""" + """test pharokka overall on a phage with vfdb hits. Also include --skip_extra_annotations""" input_fasta: Path = f"{VFDB_data}/NC_004617.fasta" - cmd = f"pharokka.py -i {input_fasta} -d {database_dir} -o {tmp_dir} -t {threads} -f" + cmd = f"pharokka.py -i {input_fasta} -d {database_dir} -o {tmp_dir} -t {threads} -f --skip_extra_annotations" exec_command(cmd) def test_overall_amr(tmp_dir): - """test pharokka overall amr""" + """test pharokka overall amr also includes '--skip_mash""" input_fasta: Path = f"{AMR_data}/NC_007458.fasta" - cmd = f"pharokka.py -i {input_fasta} -d {database_dir} -o {tmp_dir} -t {threads} -f" + cmd = f"pharokka.py -i {input_fasta} -d {database_dir} -o {tmp_dir} -t {threads} -f --skip_mash" exec_command(cmd) @@ -120,6 +120,13 @@ def test_meta(tmp_dir): exec_command(cmd) +def test_meta_prodigal_gv(tmp_dir): + """test pharokka meta with prodigal-gv""" + input_fasta: Path = f"{meta_data}/combined_meta.fasta" + cmd = f"pharokka.py -i {input_fasta} -d {database_dir} -o {tmp_dir} -t {threads} -f -m -g prodigal-gv" + exec_command(cmd) + + def test_meta_dnaapler_all_bug(tmp_dir): """test pharokka meta dnaapler bug and split""" input_fasta: Path = f"{meta_data}/combined_meta.fasta" @@ -190,6 +197,11 @@ def test_meta_no_cds_contig(tmp_dir): exec_command(cmd) +###### +# pharokka CI was timing out (>6 hours) +# These are covered by other rules anyway +# so just run as is + # def test_meta_hmm(tmp_dir): # """test pharokka meta hmm""" # input_fasta: Path = f"{meta_data}/fake_meta.fa" @@ -228,6 +240,13 @@ def test_overall_genbank_meta(tmp_dir): class testFails(unittest.TestCase): """Tests for fails""" + def test_dupe_header(self): + """tests that pharokka exits if a duplicate header is passed""" + with self.assertRaises(RuntimeError): + input_fasta: Path = f"{standard_data}/dupe_header.fasta" + cmd = f"pharokka.py -i {input_fasta} -d {database_dir} -o {temp_dir} -t 1 -f -m" + exec_command(cmd) + def test_meta_with_single_contig(self): """tests that pharokka exits if single contig is passed to meta""" with self.assertRaises(RuntimeError):