-
Notifications
You must be signed in to change notification settings - Fork 4k
GH-14872: [R] arrow returns wrong variable content when multiple group_by/summarise statements are used #14905
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
|
Wouldn't it be better to add this example to the test? |
|
I did add a test that covers that case (and made sure it failed on current master); however, the test I added matches the style and content the tests around it (e.g., they all use |
I thought an example of defining a new column in
|
Good catch there, though that looks like a bug which exists separately from this PR as I was able to reproduce this on the tip of the master branch. @paleolimbot - wanna take a look at this here, or shall we open a separate issue and come back to this? |
|
It's definitely worth adding a test! Thanks @eitsupi for the catch. It's definitely related to this PR...the overlapping groups thing results in some very odd arguments to the |
|
Is there any opposition to merging this? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK, I see now that @eitsupi 's comment was referring to the behaviour before these changes giving the surprising outcome, which has now been fixed by the changes here.
LGTM!
|
Benchmark runs are scheduled for baseline = 16d0eb4 and contender = 9753a67. 9753a67 is a master commit associated with this PR. Results will be available as each benchmark for each run completes. |
|
['Python', 'R'] benchmarks have high level of regressions. |
Reprex using CRAN arrow:
Created on 2022-12-09 with reprex v2.0.2
After this PR:
Created on 2022-12-09 with reprex v2.0.2