-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
agat_sp_extract_sequences.pl WARNINGs & 13 exons created that were missing #273
Comments
Hi, Yes it is because the "cDNA_match" are level1 feature (l1 feature does expect child features linked to it, thus need to be identifieable with a uniq ID).
Will give
So you will create a chimere group of feature of cDNA_match, instead to get 3 cDNA_match group of features with each its own child. The cDNA_match expect a child which does exist in your file. Consequetly AGAT get rid of them. As you are interested in exon and none are present representing the cDNA_match, you can ignore the message. In case you want to parse those feature properly, you will have to handle them as level2 feature. Either you change the feature level json file by moving cDNA_match from level1 to level2 file, either you change |
Describe the bug
I was trying to extract exon fasta from a gff3 annotation and reference genome by
agat_sp_extract_sequences.pl -f GCF_014356525.1_ASM1435652v1_genomic.fna -g GCF_014356525.1_ASM1435652v1_genomic.gff -t exon --split -o nl_exons.fasta
.AGAT was like:
and in the end I noticed this:
I wonder if it's an AGAT bug that causes these warning messages and some exons to be missing. Any advice would be appreciated.
General (please complete the following information):
To Reproduce
I downloaded genomic fna and gff files from ncbi and unzipped them.
https://ftp.ncbi.nlm.nih.gov/genomes/refseq/invertebrate/Nilaparvata_lugens/latest_assembly_versions/GCF_014356525.1_ASM1435652v1/GCF_014356525.1_ASM1435652v1_genomic.fna.gz
https://ftp.ncbi.nlm.nih.gov/genomes/refseq/invertebrate/Nilaparvata_lugens/latest_assembly_versions/GCF_014356525.1_ASM1435652v1/GCF_014356525.1_ASM1435652v1_genomic.gff.gz
AGAT v0.9.2 installation by Conda.
And I run AGAT with this command
agat_sp_extract_sequences.pl -f GCF_014356525.1_ASM1435652v1_genomic.fna -g GCF_014356525.1_ASM1435652v1_genomic.gff -t exon --split -o nl_exons.fasta > agat.log
agat.log
The text was updated successfully, but these errors were encountered: