-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MRG] Fix multigather
so that the output CSV contains all matches.
#2322
Conversation
Codecov Report
@@ Coverage Diff @@
## latest #2322 +/- ##
==========================================
+ Coverage 84.84% 92.14% +7.30%
==========================================
Files 131 100 -31
Lines 15689 11420 -4269
Branches 2190 2191 +1
==========================================
- Hits 13311 10523 -2788
+ Misses 2083 602 -1481
Partials 295 295
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
multigather
so that the output CSV contains all matches.multigather
so that the output CSV contains all matches.
multigather
so that the output CSV contains all matches.multigather
so that the output CSV contains all matches.
…o fix/multigather_indent
@sourmash-bio/devs @dkoslicki ready for review! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great to me! When I have a chance, I'll probably give a shot at parallelizing this. I have plans for 1000's of gather
s
Note: PR into #2065. This PR updates #2065 with all the changes from `latest`. This includes the fix & update to multigather in #2322, which: - fixes the output of multigather to include more than one line 😆 - prints out the output filename - supports `--output-dir` No functional changes are made beyond the merge, so some tests are still failing; will discuss fixes in yet a new PR :). --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: Keya Barve <[email protected]> Co-authored-by: ccbaumler <[email protected]> Co-authored-by: Tessa Pierce Ward <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Taylor Reiter <[email protected]> Co-authored-by: Erik Young <[email protected]> Co-authored-by: David Koslicki <[email protected]> Co-authored-by: Luiz Irber <[email protected]> Co-authored-by: Colton Baumler <[email protected]> Co-authored-by: Luiz Irber <[email protected]> Co-authored-by: N. Tessa Pierce-Ward <[email protected]> Co-authored-by: Peter Cock <[email protected]> Co-authored-by: Francesco Beghini <[email protected]> Co-authored-by: Jason Stajich <[email protected]> Co-authored-by: Katrin Leinweber <[email protected]>
This PR fixes #2321 so that more than one output line is placed in the CSV. Oops!
It also adds a notification of what the CSV output file name is.
Last but not least, it supports
--output-dir
as a way to set the base path for all output files.Fixes #2321.
TODO:
multigather
CSV output uses signaturefilename
as basename. #2328