Group exercise #1

The data folder contains a fasta file Oryza_sativa.IRGSP-1.0.dna_rm.chromosome.10 containing the genomic sequence for chromosome 10 of the japonica rice genome and a gffs file containing the genes feature for each gene on chromosome 10. Using these files, along with the aquaporin files in the aquaporin folder, write a shell or python script that will create a fasta file containing the CDS file of all the aquaporin transcript on rice chromosome 10. Use git to collaborate on this project.

Hints

Familiarise yourself with the format of a gff3 file. Check the link below for more details: https://www.ensembl.org/info/website/upload/gff3.html
A CDS only contains exon(s) sequences
You can append new output to an existing file in python by opening it with the “a” argument. E.g. a file opened with output_file = open(“filename.txt”, a) will take in new output without overwriting the existing content.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

group_exercise1.md

group_exercise1.md

Group exercise #1

Hints

Files

group_exercise1.md

Latest commit

History

group_exercise1.md

File metadata and controls

Group exercise #1

Hints