-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BACANNOT:SUMMARY, KeyError: 'tmRNA' #131
Comments
Hi @JavariaAshraf , I see this Thus, I want to confirm whether you are running the latest and there was a problem when I pushed the docker image. |
I have tried both 3.2 and 3.3. |
Now I am running with version v3.4.1, and got following |
This warning is fine. |
got following error: Error executing process > 'BACANNOT:SOURMASH_LCA (SH33)'
# classify
sourmash \
lca classify \
--db bacannot_dbs/sourmash_db/genbank-31.lca.json.gz \
--query SH33.fna.sig
# summarize
sourmash \
lca summarize \
--db bacannot_dbs/sourmash_db/genbank-31.lca.json.gz \
--query SH33.fna.sig > SH33.fna_sourmash.summary.txt
Command exit status:
255
Command output:
(empty)
Command error:
== This is sourmash version 4.8.2. ==
Caused by:
Process `BACANNOT:SOURMASH_LCA (SH33)` terminated with an error exit status (255)
Command executed:
# get version file
sourmash --version > sourmash_version.txt
# sketch input genome
sourmash \
sketch dna \
-p scaled=1000,k=31 \
--name-from-first \
SH33.fna
== Please cite Brown and Irber (2016), doi:10.21105/joss.00027. ==
computing signatures for files: SH33.fna
Computing a total of 1 signature(s) for each input.
... reading sequences from SH33.fna
...SH33.fna 800 sequences
calculated 1 signatures for 801 sequences in SH33.fna
saved 1 signature(s) to 'SH33.fna.sig'. Note: signature license is CC0.
== This is sourmash version 4.8.2. ==
== Please cite Brown and Irber (2016), doi:10.21105/joss.00027. ==
Error! Could not find the following files. Make sure the file paths are specified correctly.
bacannot_dbs/sourmash_db/genbank-31.lca.json.gz
Work dir:
/home/cdc-bioinfo/work/52/94dc78825a7223d7e962705ec6f956
Tip: you can try to figure out what's wrong by changing to the process work dir and showing the script file named `.command.sh` |
Hi @JavariaAshraf , The options are the following:
The option of skipping sourmash is the quickest one. Hope that helps. |
I have used the option and got the following error: '/....rror executing process > BACANNOT:GFF2GBK (SH32)
Caused by:
Process BACANNOT:GFF2GBK (SH32) terminated with an error exit status (255)
Command executed:
# Run emboss seqret
seqret \
-sequence SH32.fna \
-feature \
-fformat gff \
-fopenfile SH32.gff \
-osformat genbank \
-osname_outseq SH32 \
-ofdirectory_outseq gbk_file \
-auto
Command exit status:
255
Command output:
(empty)
Command error:
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16129505;PHAST:Product=Qin_prophage;_predicted_tail_fibre_assembly_protein'
Warning: GFF3 tag parsing failed on '_essential_for_genome_maintenance' in
'ID=ALKKIJME_02014;eC_number=2.3.1.234;Name=tsaD;db_xref=COG:COG0533;gene=tsaD;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P05852;locus_tag=ALKKIJME_02014;product=tRNA N6-adenosine threonylcarbamoyltransferase;Additional_database=KEGG;KO=K07406;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16130960;PHAST:Product=glycation_binding_protein,_predicted_protease/chaperone;_essential_for_genome_maintenance'
Warning: GFF3 tag parsing failed on '_predicted_prophage_CPS_53_integrase' in
'ID=ALKKIJME_03671;Name=intS_1;db_xref=COG:COG0582;gene=intS_1;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P37326;locus_tag=ALKKIJME_03671;product=Prophage integrase IntS;Additional_database=PHAST;PHAST:Target=PHAST_16130281;PHAST:Product=CPS_53_(KpLE1)_prophage;_predicted_prophage_CPS_53_integrase'
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16129505;PHAST:Product=Qin_prophage;_predicted_tail_fibre_assembly_protein'
Warning: GFF3 tag parsing failed on '_essential_for_genome_maintenance' in
'ID=ALKKIJME_02014;eC_number=2.3.1.234;Name=tsaD;db_xref=COG:COG0533;gene=tsaD;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P05852;locus_tag=ALKKIJME_02014;product=tRNA N6-adenosine threonylcarbamoyltransferase;Additional_database=KEGG;KO=K07406;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16130960;PHAST:Product=glycation_binding_protein,_predicted_protease/chaperone;_essential_for_genome_maintenance'
Warning: GFF3 tag parsing failed on '_predicted_prophage_CPS_53_integrase' in
'ID=ALKKIJME_03671;Name=intS_1;db_xref=COG:COG0582;gene=intS_1;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P37326;locus_tag=ALKKIJME_03671;product=Prophage integrase IntS;Additional_database=PHAST;PHAST:Target=PHAST_16130281;PHAST:Product=CPS_53_(KpLE1)_prophage;_predicted_prophage_CPS_53_integrase'
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16129505;PHAST:Product=Qin_prophage;_predicted_tail_fibre_assembly_protein'
Warning: GFF3 tag parsing failed on '_essential_for_genome_maintenance' in
'ID=ALKKIJME_02014;eC_number=2.3.1.234;Name=tsaD;db_xref=COG:COG0533;gene=tsaD;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P05852;locus_tag=ALKKIJME_02014;product=tRNA N6-adenosine threonylcarbamoyltransferase;Additional_database=KEGG;KO=K07406;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16130960;PHAST:Product=glycation_binding_protein,_predicted_protease/chaperone;_essential_for_genome_maintenance'
Warning: GFF3 tag parsing failed on '_predicted_prophage_CPS_53_integrase' in
'ID=ALKKIJME_03671;Name=intS_1;db_xref=COG:COG0582;gene=intS_1;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P37326;locus_tag=ALKKIJME_03671;product=Prophage integrase IntS;Additional_database=PHAST;PHAST:Target=PHAST_16130281;PHAST:Product=CPS_53_(KpLE1)_prophage;_predicted_prophage_CPS_53_integrase'
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16129505;PHAST:Product=Qin_prophage;_predicted_tail_fibre_assembly_protein'
Warning: GFF3 tag parsing failed on '_essential_for_genome_maintenance' in
'ID=ALKKIJME_02014;eC_number=2.3.1.234;Name=tsaD;db_xref=COG:COG0533;gene=tsaD;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P05852;locus_tag=ALKKIJME_02014;product=tRNA N6-adenosine threonylcarbamoyltransferase;Additional_database=KEGG;KO=K07406;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16130960;PHAST:Product=glycation_binding_protein,_predicted_protease/chaperone;_essential_for_genome_maintenance'
Warning: GFF3 tag parsing failed on '_predicted_prophage_CPS_53_integrase' in
'ID=ALKKIJME_03671;Name=intS_1;db_xref=COG:COG0582;gene=intS_1;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P37326;locus_tag=ALKKIJME_03671;product=Prophage integrase IntS;Additional_database=PHAST;PHAST:Target=PHAST_16130281;PHAST:Product=CPS_53_(KpLE1)_prophage;_predicted_prophage_CPS_53_integrase'
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16129505;PHAST:Product=Qin_prophage;_predicted_tail_fibre_assembly_protein'
Warning: GFF3 tag parsing failed on '_essential_for_genome_maintenance' in
'ID=ALKKIJME_02014;eC_number=2.3.1.234;Name=tsaD;db_xref=COG:COG0533;gene=tsaD;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P05852;locus_tag=ALKKIJME_02014;product=tRNA N6-adenosine threonylcarbamoyltransferase;Additional_database=KEGG;KO=K07406;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16130960;PHAST:Product=glycation_binding_protein,_predicted_protease/chaperone;_essential_for_genome_maintenance'
Warning: GFF3 tag parsing failed on '_predicted_prophage_CPS_53_integrase' in
'ID=ALKKIJME_03671;Name=intS_1;db_xref=COG:COG0582;gene=intS_1;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P37326;locus_tag=ALKKIJME_03671;product=Prophage integrase IntS;Additional_database=PHAST;PHAST:Target=PHAST_16130281;PHAST:Product=CPS_53_(KpLE1)_prophage;_predicted_prophage_CPS_53_integrase'
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16129505;PHAST:Product=Qin_prophage;_predicted_tail_fibre_assembly_protein'
Warning: GFF3 tag parsing failed on '_essential_for_genome_maintenance' in
'ID=ALKKIJME_02014;eC_number=2.3.1.234;Name=tsaD;db_xref=COG:COG0533;gene=tsaD;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P05852;locus_tag=ALKKIJME_02014;product=tRNA N6-adenosine threonylcarbamoyltransferase;Additional_database=KEGG;KO=K07406;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16130960;PHAST:Product=glycation_binding_protein,_predicted_protease/chaperone;_essential_for_genome_maintenance'
Warning: GFF3 tag parsing failed on '_predicted_prophage_CPS_53_integrase' in
'ID=ALKKIJME_03671;Name=intS_1;db_xref=COG:COG0582;gene=intS_1;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P37326;locus_tag=ALKKIJME_03671;product=Prophage integrase IntS;Additional_database=PHAST;PHAST:Target=PHAST_16130281;PHAST:Product=CPS_53_(KpLE1)_prophage;_predicted_prophage_CPS_53_integrase'
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16129505;PHAST:Product=Qin_prophage;_predicted_tail_fibre_assembly_protein'
Warning: GFF3 tag parsing failed on '_essential_for_genome_maintenance' in
'ID=ALKKIJME_02014;eC_number=2.3.1.234;Name=tsaD;db_xref=COG:COG0533;gene=tsaD;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P05852;locus_tag=ALKKIJME_02014;product=tRNA N6-adenosine threonylcarbamoyltransferase;Additional_database=KEGG;KO=K07406;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16130960;PHAST:Product=glycation_binding_protein,_predicted_protease/chaperone;_essential_for_genome_maintenance'
Warning: GFF3 tag parsing failed on '_predicted_prophage_CPS_53_integrase' in
'ID=ALKKIJME_03671;Name=intS_1;db_xref=COG:COG0582;gene=intS_1;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P37326;locus_tag=ALKKIJME_03671;product=Prophage integrase IntS;Additional_database=PHAST;PHAST:Target=PHAST_16130281;PHAST:Product=CPS_53_(KpLE1)_prophage;_predicted_prophage_CPS_53_integrase'
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16129505;PHAST:Product=Qin_prophage;_predicted_tail_fibre_assembly_protein'
Warning: GFF3 tag parsing failed on '_essential_for_genome_maintenance' in
'ID=ALKKIJME_02014;eC_number=2.3.1.234;Name=tsaD;db_xref=COG:COG0533;gene=tsaD;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P05852;locus_tag=ALKKIJME_02014;product=tRNA N6-adenosine threonylcarbamoyltransferase;Additional_database=KEGG;KO=K07406;Method=KOfamscan;Additional_database=PHAST;PHAST:Target=PHAST_16130960;PHAST:Product=glycation_binding_protein,_predicted_protease/chaperone;_essential_for_genome_maintenance'
Warning: GFF3 tag parsing failed on '_predicted_prophage_CPS_53_integrase' in
'ID=ALKKIJME_03671;Name=intS_1;db_xref=COG:COG0582;gene=intS_1;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P37326;locus_tag=ALKKIJME_03671;product=Prophage integrase IntS;Additional_database=PHAST;PHAST:Target=PHAST_16130281;PHAST:Product=CPS_53_(KpLE1)_prophage;_predicted_prophage_CPS_53_integrase'
Warning: GFF3 tag parsing failed on '_predicted_tail_fibre_assembly_protein' in
'ID=ALKKIJME_03776;Name=tfaE;gene=tfaE;inference=ab initio prediction:Prodigal:002006,similar to AA sequence:UniProtKB:P09153;locus_tag=ALKKIJME_03776;product=Prophage tail fiber assembly protein TfaE;Additional_database=KEGG;KO=K09810,K02003;Method=KOfamscan;Additional_da'"
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; please have a look. |
Hi @JavariaAshraf , checking the error We can see here that indeed this seems to be the case: ❯ grep ';' bacannot_dbs_2024_jul_05/phast_db/sequences | head -n 2
>PHAST~~~Qin_prophage;_predicted_tail_fibre_assembly_protein~~~NP_416064.1~~~PHAST_16129505~~~PROPHAGE_Escher_MG1655-gi|16129505|ref|NP_416064.1| Qin prophage; predicted tail fibre assembly protein [Escherichia coli str. K-12 substr. MG1655]
>PHAST~~~Qin_prophage;_predicted_side_tail_fibre_assembly_protein~~~NP_416065.1~~~PHAST_16129506~~~PROPHAGE_Escher_MG1655-gi|16129506|ref|NP_416065.1| Qin prophage; predicted side tail fibre assembly protein [Escherichia coli str. K-12 substr. MG1655] This is an error that I only saw now because I never executed with this species. Thanks to you to have noticed such scenario. The proper fix The correct fix for the pipeline would be the following:
This round would take a while and we probably can finish next week. The quick fix If you really need to have some results the soonest possible, I would suggest you run the same command line, but add the following parameter Let me know which one you are more inclined to. Anyways, I will start (in this branch) with the ""the proper fix"" solution and will ping here in this thread once I would have something for you to test. |
Also, is this a private genome or a public genome? Was just wondering if I could test with the same sequence while implementing. |
Hi @JavariaAshraf , Assuming you have the bacannot database in the following path:
To assess whether the code added fixes the bug or not. If it does, then I can go for a release. If it does not, or something else appears, I have to check the code again. |
Hi fmalmeida, WARN:Cannot read project manifest -- Cause: Remote resource not found: https://api.github.com/repos/nextflow/bacannot/contents/nextflow.config?ref=131-bacannotsummary-keyerror-tmrna
Remote resource not found: https://api.github.com/repos/nextflow/bacannot/contents/main.nf?ref=131-bacannotsummary-keyerror-tmrna It occurred while I was following your defined steps in the proper fix. |
Hi @JavariaAshraf , Was just a misspelling in the command line. I did not have the proper name I have just updated the comment with the correct spelling. Can you try again? |
getting the same error still. |
What command exactly you executed? You mean the last error or the first? It should be |
Hi fmalmeida,
i am experiencing the following error with the new version.
please help me fix the error.
below is the error:
help needed.
thank you,
Javaria Ashraf
The text was updated successfully, but these errors were encountered: