Support multi-partition `Select` operations with aggregations#18492

Merged

rapids-bot[bot] merged 20 commits intorapidsai:branch-25.06from

rjzamora:complex-aggregations-ir

Apr 29, 2025

Member

rjzamora commented Apr 14, 2025 •

edited

Loading

Description

This PR supersedes #17941

In contrast to 17941, this PR does not introduce any new task-graph logic. Instead, complex expression graphs (expression graphs containing non-pointwise nodes) are decomposed into multiple IR nodes.

The design used in this PR is probably more intuitive than FusedExpr concept.

Illustration

TODO:

Test that performance is similar to FusedExpr.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.


          Refactored Select decomposition - IR rewrites only - No new task-grap…

72017e0

…h logic.

rjzamora added feature request 2 - In Progress non-breaking cudf-polars labels

rjzamora self-assigned this

github-project-automation bot added this to cuDF Python

copy-pr-bot bot commented Apr 14, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

github-actions bot added the Python label

rjzamora commented

View reviewed changes

python/cudf_polars/cudf_polars/dsl/utils/__init__.py

Member Author

rjzamora Apr 14, 2025

This file comes from #18369

rjzamora commented

View reviewed changes

python/cudf_polars/cudf_polars/dsl/utils/naming.py

Member Author

rjzamora Apr 14, 2025

This file comes from #18369

Member Author

rjzamora commented Apr 14, 2025

/ok to test

copy-pr-bot bot commented Apr 14, 2025

/ok to test

@rjzamora, there was an error processing your request: E1

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/1/

This was referenced Apr 14, 2025

Enable multi-partition Select operations containing basic aggregations #17941

Closed

[DMN][WIP] Experimental multi-GPU Polars testing #18335

Closed


          Merge branch 'branch-25.06' into complex-aggregations-ir

a3494b7

rjzamora marked this pull request as ready for review

April 14, 2025 20:11

rjzamora requested a review from a team as a code owner

April 14, 2025 20:11

rjzamora requested review from brandon-b-miller and wence-

April 14, 2025 20:11

rjzamora added 4 commits

April 15, 2025 14:03


          Merge remote-tracking branch 'upstream/branch-25.06' into complex-agg…

b4d4507

…regations-ir


          improve coverage

657ebae


          Merge branch 'branch-25.06' into complex-aggregations-ir

121ec5f


          Merge remote-tracking branch 'upstream/branch-25.06' into complex-agg…

e7ed92a

…regations-ir

This was referenced Apr 17, 2025

Rewrite groupby aggregations in cudf-polars to simplify evaluation #18369

Merged

[FEA] Streaming Sort in cudf-polars #18527

Closed

rjzamora added 2 commits

April 18, 2025 17:38


          Merge branch 'branch-25.06' into complex-aggregations-ir


          Merge branch 'branch-25.06' into complex-aggregations-ir

0c26572

wence- reviewed

View reviewed changes

Contributor

wence- left a comment

This looks pretty good I think. I have relatively minor comments

python/cudf_polars/cudf_polars/dsl/ir.py Outdated Show resolved Hide resolved

python/cudf_polars/cudf_polars/experimental/select.py

Comment on lines +106 to +109

+                          # Try decomposing the underlying expressions
+                          return decompose_select(
+                              ir, child, partition_info, rec.state["config_options"]
+                          )

Contributor

wence- Apr 23, 2025

I was thinking that you could do the rewrite as a pass over the IR representation before lowering and assigning partitioning (which I think would make things a bit simpler), but I guess you don't want to do the "complicated" thing if there's only a single partition so we need to do things here.

Member Author

rjzamora Apr 23, 2025

There is also the fact that we don't handle iterative lowering at the moment, so it would take a larger diff to make something like that work. We would run into Select(Agg) nodes during the lowering stage and need to know (or deduce) that those nodes are already decomposed.

Member Author

rjzamora Apr 23, 2025

However, I do think your right that the key problem is that you don't want to do the decomposition unless you need to.

python/cudf_polars/cudf_polars/experimental/expressions.py Outdated Show resolved Hide resolved

python/cudf_polars/cudf_polars/experimental/expressions.py Outdated Show resolved Hide resolved

python/cudf_polars/cudf_polars/experimental/expressions.py Outdated Show resolved Hide resolved

python/cudf_polars/cudf_polars/experimental/expressions.py Outdated

Comment on lines 198 to 207

+                  See Also
+                  --------
+                  _add_select_ir
+                  _decompose_expr_node
+                  Notes
+                  -----
+                  This function is called by ``_decompose_expr_node`` to decompose
+                  an Agg node into multiple IR nodes. The new IR nodes are added
+                  with ``_add_select_ir``.

Contributor

wence- Apr 23, 2025

nit: As above, I am not sure these xrefs add much to locally understanding how to use this function/what it does, and would perhaps be better served by an overview module-level docstring/comment.

python/cudf_polars/cudf_polars/experimental/expressions.py Show resolved Hide resolved

python/cudf_polars/cudf_polars/experimental/expressions.py Outdated Show resolved Hide resolved

python/cudf_polars/cudf_polars/experimental/expressions.py Outdated Show resolved Hide resolved

python/cudf_polars/cudf_polars/experimental/expressions.py Outdated Show resolved Hide resolved


          Merge remote-tracking branch 'upstream/branch-25.06' into complex-agg…

51eb142

…regations-ir

rjzamora added 8 commits

April 23, 2025 12:20


          address partial code review

69e6168


          remove _maybe_shuffle and refactor _decompose_expr_node

2ed1e26


          fix unique_input_irs

7f25e4b


          tweak comments

77b372f


          remove comment

9764a65


          Merge remote-tracking branch 'upstream/branch-25.06' into complex-agg…

69642c1

…regations-ir


          Merge branch 'branch-25.06' into complex-aggregations-ir

02343a3


          Merge remote-tracking branch 'upstream/branch-25.06' into complex-agg…

36c8551

…regations-ir

rjzamora added 4 - Needs Review and removed 2 - In Progress labels

rjzamora requested a review from wence-

April 25, 2025 16:47

rjzamora added 3 commits

April 28, 2025 06:50


          Merge remote-tracking branch 'upstream/branch-25.06' into complex-agg…

ffa5964

…regations-ir


          high-level notes

45ab83b


          Merge branch 'branch-25.06' into complex-aggregations-ir

a6e8ea6

wence- reviewed

View reviewed changes

python/cudf_polars/cudf_polars/experimental/expressions.py

Comment on lines +240 to +255

+                      columns, input_ir, partition_info = select(
+                          [Cast(agg.dtype, agg)],
+                          input_ir,
+                          partition_info,
+                          names=names,
+                          repartition=True,
+                      )
+                      # Combined stage
+                      (column,) = columns
+                      columns, input_ir, partition_info = select(
+                          [Agg(agg.dtype, "sum", None, column)],
+                          input_ir,
+                          partition_info,
+                          names=names,
+                      )

Contributor

wence- Apr 29, 2025 •

edited

Loading

To understand this, is there a reason we can't put the cast into the final sum aggregation? Ah, it's the repartitioning step?

wence- reviewed

View reviewed changes

python/cudf_polars/cudf_polars/experimental/expressions.py

Comment on lines +372 to +374

+                      schema: MutableMapping[str, Any] = {}
+                      for ir in unique_input_irs:
+                          schema.update(ir.schema)

Contributor

wence- Apr 29, 2025

nit (maybe a followup): should we check that none of the column names overlap?

wence- approved these changes

View reviewed changes

Contributor

wence- left a comment

Thanks for all the work here @rjzamora! Looks good to me

Contributor

wence- commented Apr 29, 2025

/merge

rapids-bot bot merged commit c40ca62 into rapidsai:branch-25.06

111 checks passed

github-project-automation bot moved this to Done in cuDF Python

rjzamora deleted the complex-aggregations-ir branch

April 29, 2025 13:48

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

4 - Needs Review cudf-polars feature request non-breaking Python