Skip to content

Implement row group pruning with stats in experimental PQ reader#18543

Merged
rapids-bot[bot] merged 33 commits intorapidsai:branch-25.06from
mhaseeb123:fea/filter-row-groups-with-stats
May 6, 2025
Merged

Implement row group pruning with stats in experimental PQ reader#18543
rapids-bot[bot] merged 33 commits intorapidsai:branch-25.06from
mhaseeb123:fea/filter-row-groups-with-stats

Conversation

@mhaseeb123
Copy link
Member

@mhaseeb123 mhaseeb123 commented Apr 22, 2025

Description

Contributes to #17896. Part of #18011.

This PR implements row group pruning with stats in the experimental Parquet reader optimized for hybrid scan queries

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Apr 22, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@github-actions github-actions bot added libcudf Affects libcudf (C++/CUDA) code. CMake CMake build issue labels Apr 22, 2025
@mhaseeb123 mhaseeb123 added feature request New feature or request 2 - In Progress Currently a work in progress cuIO cuIO issue DO NOT MERGE Hold off on merging; see PR for details non-breaking Non-breaking change labels Apr 22, 2025
@mhaseeb123 mhaseeb123 changed the title Impl Row group pruning with stats in experimental PQ reader Implement row group pruning with stats in experimental PQ reader Apr 22, 2025
@mhaseeb123 mhaseeb123 changed the title Implement row group pruning with stats in experimental PQ reader 🚧 Implement row group pruning with stats in experimental PQ reader Apr 22, 2025
@mhaseeb123 mhaseeb123 removed the DO NOT MERGE Hold off on merging; see PR for details label Apr 30, 2025
@github-actions github-actions bot removed the CMake CMake build issue label Apr 30, 2025
@mhaseeb123 mhaseeb123 requested a review from davidwendt May 1, 2025 20:51
@mhaseeb123 mhaseeb123 added 4 - Needs Review Waiting for reviewer to review or respond and removed 3 - Ready for Review Ready for review by team labels May 1, 2025
@GregoryKimball GregoryKimball moved this to Burndown in libcudf May 5, 2025
Copy link
Contributor

@vuule vuule left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

as usual, some nitpicks and questions, nothing truly blocking

@mhaseeb123 mhaseeb123 requested a review from vuule May 5, 2025 22:28
Copy link
Contributor

@vuule vuule left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

one more!

Copy link
Contributor

@vuule vuule left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔥

@mhaseeb123 mhaseeb123 added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 4 - Needs Review Waiting for reviewer to review or respond labels May 6, 2025
@mhaseeb123
Copy link
Member Author

/merge

@rapids-bot rapids-bot bot merged commit e5e8164 into rapidsai:branch-25.06 May 6, 2025
109 of 110 checks passed
vyasr added a commit to vyasr/cudf that referenced this pull request May 6, 2025
@mhaseeb123 mhaseeb123 deleted the fea/filter-row-groups-with-stats branch May 6, 2025 18:01
@GregoryKimball GregoryKimball moved this from Burndown to Landed in libcudf May 6, 2025
@GregoryKimball GregoryKimball removed this from libcudf Jul 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

5 - Ready to Merge Testing and reviews complete, ready to merge cuIO cuIO issue feature request New feature or request libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants