-
Notifications
You must be signed in to change notification settings - Fork 4k
ARROW-15281: [C++] Implement ability to retrieve fragment filename #12560
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
python/pyarrow/_dataset.pyx
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
C++ may return an error, but Python will raise an exception, right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes I think so, I checked that again.
If we see here
https://github.com/apache/arrow/blob/master/python/pyarrow/_dataset.pyx#L2014
then this goes to here,
https://github.com/apache/arrow/blob/master/python/pyarrow/error.pxi#L99
which raises an ArrowInvalid exception. So, it might be better to mention that it raises an exception here.
cc: @westonpace
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yikes, yes this was totally my fault. My original suggestion was bad. Thank you for switching it back.
…gment, used ToString()
56fa37f to
55b6aea
Compare
lidavidm
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, just a minor suggestion
|
Failure in hash-join-node-test is unrelated, but interesting. I've added it to ARROW-15221. |
|
Benchmark runs are scheduled for baseline = 600288e and contender = 3eb1bac. 3eb1bac is a master commit associated with this PR. Results will be available as each benchmark for each run completes. |
This PR adds the feature to include the filename of a fragment if it is a FileFragment as a scalar value and includes the augmented fields in the dataset schema