Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: Calling count on a pyarrow dataset results in an error #843

Merged
merged 1 commit into from
Sep 22, 2024

Conversation

Michael-J-Ward
Copy link
Contributor

@Michael-J-Ward Michael-J-Ward commented Aug 27, 2024

Which issue does this PR close?

Closes #800.

Rationale for this change

This PR includes a test to demonstrate the issue, which was resolved with an upstream PR: apache/arrow-rs#6318.

I have now rebased the PR against latest on main, which includes that upstream PR and thus the tests pass.

What changes are included in this PR?

Are there any user-facing changes?

df.count() succeeds on arrow datasets.

@Michael-J-Ward Michael-J-Ward marked this pull request as ready for review September 18, 2024 18:24
@timsaucer timsaucer merged commit f6261b0 into apache:main Sep 22, 2024
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Error on dataframe count using arrow dataset
2 participants