Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use 64bit int for genomic coordinates everywhere (GSeg & up) #13

Open
gpertea opened this issue Sep 13, 2024 · 0 comments
Open

use 64bit int for genomic coordinates everywhere (GSeg & up) #13

gpertea opened this issue Sep 13, 2024 · 0 comments
Assignees

Comments

@gpertea
Copy link
Owner

gpertea commented Sep 13, 2024

Genomes with contigs larger than 2GB cause unpredictable errors/crashes due to the core GSeg data structure using 32bit unsigned int (the unsigned part doesn't help with consistently increasing the limit to 4GB due to inevitable signed int arithmetic).

GFaSeqGet is likely the first to cause a failure in gffread so it needs to be migrated to 64bit coordinates at the same time.
This requires gffread/gffcompare updates as well.

Proper tests should be written for these gff parsing and GFaSeqGet on small and large genomes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant