-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
read compressed alignment files? #323
Conversation
Would Zip suit? I know it's not hip, even a little bit, but we are already using it. https://github.com/matsen/pplacer/blob/dev/pplacer_src/refpkg_parse.ml |
camlzip supports zip and gzip; we could use that without adding On Mon, Nov 18, 2013 at 11:12 AM, Erick Matsen [email protected]:
|
Ah, nice. I'll have a go on this some afternoon. On Mon, Nov 18, 2013 at 11:16 AM, Connor McCoy [email protected]:
Frederick "Erick" Matsen, Assistant Member |
gzip would probably be preferable for single files if available |
Camlzip knows how to read bytes, characters, and sets thereof. When we read in fasta (e.g.) files, they get read in line by line and tokenized (see ppatteries for the definition of |
Maybe we could hook into the Batteries I/O interface? IO.create_in (for camlzip |
Excellent! On Mon, Nov 18, 2013 at 5:19 PM, Connor McCoy [email protected]:
Frederick "Erick" Matsen, Assistant Member |
e.g. `pplacer -c vaginal_16s.refpkg src/p4z1r36.fasta.gz` generates p4z1r36.jplace
checked via: pplacer -p -c vaginal_16s.refpkg src/p4z1r36.fasta.gz -o test.jplace.gz
Same as Ppatteries.safe_chop_suffix
This is working for me on the microbiome demo - sequence files ending in I added |
Whiplash. Nice work, and glad to see tests. |
read compressed alignment files?
A low priority request/question: can pplacer and friends be made to read compressed alignment files (eg, .gz or .bz2) natively? This could potentially save a lot of room on disk for work in progress.