Unify treatment of `Expr` and `IR` nodes in cudf-polars DSL #17016

wence- · 2024-10-08T17:12:31Z

Description

As part of in-progress multi-GPU work, we will likely want to:

Introduce additional nodes into the IR namespace;
Implement rewrite rules for IR trees to express needed communication patterns;
Write visitors that translate expressions into an appropriate description for whichever multi-GPU approach we end up taking.

It was already straightforward to write generic visitors for Expr nodes, since those uniformly have a .children property for their dependents. In contrast, the IR nodes were more ad-hoc. To solve this, pull out the generic implementation from Expr into an abstract Node class. Now Expr nodes just inherit from this, and IR nodes do so similarly.

Redoing the IR nodes is a little painful because we want to make them hashable, so we have to provide a bunch of custom get_hashable implementations (the schema dict, for example, is not hashable).

With these generic facilities in place, we can now implement traversal and visitor infrastructure. Specifically, we provide:

a mechanism for pre-order traversal of an expression DAG, yielding each unique node exactly once. This is useful if one wants to know if an expression contains some particular node;
a mechanism for writing recursive visitors and then wrapping a caching scheme around the outside. This is useful for rewrites.

Some example usages are shown in tests.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

python/cudf_polars/tests/dsl/test_traversal.py

python/cudf_polars/cudf_polars/dsl/nodebase.py

rjzamora

Thanks for working on this @wence- ! I'll probably end up taking several passes over this. Leaving a few comments after my first pass.

python/cudf_polars/cudf_polars/dsl/ir.py

python/cudf_polars/cudf_polars/dsl/expr.py

python/cudf_polars/cudf_polars/dsl/ir.py

python/cudf_polars/cudf_polars/dsl/nodebase.py

python/cudf_polars/cudf_polars/dsl/translate.py

python/cudf_polars/tests/dsl/test_traversal.py

python/cudf_polars/cudf_polars/dsl/expr.py

Matt711

Just a small things I noticed. I haven't really been a part of the multi-GPU discussions. Can you explain at a high level how the cudf.polars IR graph will be translated to a dask task graph? I think that's the approach @rjzamora mentioned offline.

Edit: I'm asking for my own understanding and because maybe that might help us identify any other helper functions like resuse_if_unchanged that might help with the effort.

python/cudf_polars/cudf_polars/dsl/nodebase.py

wence- · 2024-10-11T17:21:24Z

Just a small things I noticed. I haven't really been a part of the multi-GPU discussions. Can you explain at a high level how the cudf.polars IR graph will be translated to a dask task graph? I think that's the approach @rjzamora mentioned offline.

Not completely straightforwardly. But broadly, write a visitor that emits the correct task graph for computing a node in the plan, given that the children have already been computed (i.e. we have a task graph for them).

Edit: I'm asking for my own understanding and because maybe that might help us identify any other helper functions like resuse_if_unchanged that might help with the effort.

I think that we should wait for that until we write things and then see if we can generalise the implementation.

Right now I know I need reuse-if-unchanged because it's a useful utility for transforming an IR graph into another one of the same type with some new nodes.

wence- · 2024-10-11T17:24:11Z

I will rebase on top of #17014 on monday.

rjzamora · 2024-10-11T17:51:17Z

Just a small things I noticed. I haven't really been a part of the multi-GPU discussions. Can you explain at a high level how the cudf.polars IR graph will be translated to a dask task graph? I think that's the approach @rjzamora mentioned offline.

Not completely straightforwardly. But broadly, write a visitor that emits the correct task graph for computing a node in the plan, given that the children have already been computed (i.e. we have a task graph for them).

Yes. We will eventually want to implement a visitor that can traverse the final IR/Node graph to generate a task graph that can be scheduled with distributed/dask-cuda.

Further Notes: In practice, it will probably make sense to start by rewriting the initial IR graph into a "partition-aware" IR graph before we generate the task graph. In other words: We probably want to define and propagate leaf-node partitioning, and define boundaries between node sequences that do or do-not require inter-partition communication. Ideally, this partition-aware representation would not have anything Dask-specific in it.

Matt711 · 2024-10-11T19:33:51Z

Not completely straightforwardly. But broadly, write a visitor that emits the correct task graph for computing a node in the plan, given that the children have already been computed (i.e. we have a task graph for them).

Yes. We will eventually want to implement a visitor that can traverse the final IR/Node graph to generate a task graph that can be scheduled with distributed/dask-cuda.

Thanks that's a nice overview.

Further Notes: In practice, it will probably make sense to start by rewriting the initial IR graph into a "partition-aware" IR graph before we generate the task graph. In other words: We probably want to define and propagate leaf-node partitioning,

So this "partition aware" graph will need to carry metadata about the partitions? And as the graph is executed this metadata will be carried upward.

and define boundaries between node sequences that do or do-not require inter-partition communication. Ideally, this partition-aware representation would not have anything Dask-specific in it.

I think this is because some operations can execute on a single partition and others can't. What does "inter-partition communication" look like? Is this where shuffling the data comes into play?

rjzamora · 2024-10-11T20:04:31Z

So this "partition aware" graph will need to carry metadata about the partitions? And as the graph is executed this metadata will be carried upward.

There are probably several ways to approach this, so we don't need to commit to anything specific right now. We will probably want to be able to ask a node how many partitions it will produce (i.e. how many output tasks it will define). One way to do this is to attach an npartitions-like property to all IR nodes. Most operations will simply inherit the partitioning of its child(ren), but leaf nodes and reductions will depend on the data/algorithm.

I think this is because some operations can execute on a single partition and others can't. What does "inter-partition communication" look like? Is this where shuffling the data comes into play?

Yes. If I do something like add two columns together to assign a new column, then we don't need any communication between partition-0 and partition-1. However, if we want to perform a groupby aggregation, we usually do need to reduce/shuffle data between distinct partitions. When we use dask for execution, we define the inter-partition communication as edges in the task graph.

wence- · 2024-10-14T09:51:08Z

Rebased onto and fixed conflicts after the expression move.

Matt711

Thanks @wence- for adding examples to overview.md. I had some non-blocking questions.

python/cudf_polars/cudf_polars/dsl/traversal.py

python/cudf_polars/tests/dsl/test_expr.py

python/cudf_polars/cudf_polars/dsl/traversal.py

rjzamora

Thanks for working on this @wence- !

I have a few minor thoughts/questions, but I generally approve of the change.

python/cudf_polars/cudf_polars/dsl/ir.py

python/cudf_polars/docs/overview.md

python/cudf_polars/cudf_polars/dsl/traversal.py

python/cudf_polars/docs/overview.md

vyasr

I'm not exactly sure where this is going next since none of the traversal logic seems to actually be used yet, but I'm sure I'll see it soon :)

python/cudf_polars/cudf_polars/dsl/nodebase.py

python/cudf_polars/docs/overview.md

python/cudf_polars/cudf_polars/dsl/ir.py

python/cudf_polars/cudf_polars/dsl/expressions/base.py

python/cudf_polars/tests/dsl/test_expr.py

python/cudf_polars/tests/dsl/test_traversal.py

This simplifies things a bit and means we don't need to type the children property everywhere else.

vyasr

Looks great. A couple of threads are still open and it would be good to resolve them one way or another, but I don't need to review again. Thanks!

…olars-uniform-nodes

This simplifies the implementation and removes the need for type-narrowing. The special method dunder-eq handles objects of any type.

wence- · 2024-10-21T13:28:27Z

/merge

…t filters (#17141) Previously, we always applied parquet filters by post-filtering. This negates much of the potential gain from having filters available at read time, namely discarding row groups. To fix this, implement, with the new visitor system of #17016, conversion to pylibcudf expressions. We must distinguish two types of expressions, ones that we can evaluate via `cudf::compute_column`, and the more restricted set of expressions that the parquet reader understands, this is handled by having a state that tracks the usage. The former style will be useful when we implement inequality joins. While here, extend the support in pylibcudf expressions to handle all supported literal types and expose `compute_column` so we can test the correctness of the broader (non-parquet) implementation. Authors: - Lawrence Mitchell (https://github.com/wence-) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) URL: #17141

wence- requested a review from a team as a code owner October 8, 2024 17:12

wence- requested review from bdice and charlesbluca October 8, 2024 17:12

github-actions bot added Python Affects Python cuDF API. cudf.polars Issues specific to cudf.polars labels Oct 8, 2024

wence- added improvement Improvement / enhancement to an existing function breaking Breaking change labels Oct 8, 2024

wence- commented Oct 8, 2024

View reviewed changes

python/cudf_polars/tests/dsl/test_traversal.py Outdated Show resolved Hide resolved

rjzamora reviewed Oct 8, 2024

View reviewed changes

python/cudf_polars/cudf_polars/dsl/nodebase.py Outdated Show resolved Hide resolved

rjzamora reviewed Oct 8, 2024

View reviewed changes

wence- force-pushed the wence/fea/polars-uniform-nodes branch from 48e99d0 to 9ea84a6 Compare October 9, 2024 11:22

rjzamora reviewed Oct 9, 2024

View reviewed changes

python/cudf_polars/cudf_polars/dsl/nodebase.py Outdated Show resolved Hide resolved

python/cudf_polars/cudf_polars/dsl/translate.py Outdated Show resolved Hide resolved

python/cudf_polars/tests/dsl/test_traversal.py Outdated Show resolved Hide resolved

wence- force-pushed the wence/fea/polars-uniform-nodes branch from 9ea84a6 to 87970fc Compare October 10, 2024 09:54

GregoryKimball requested a review from madsbk October 10, 2024 13:33

Matt711 reviewed Oct 11, 2024

View reviewed changes

python/cudf_polars/cudf_polars/dsl/expr.py Outdated Show resolved Hide resolved

Matt711 reviewed Oct 11, 2024

View reviewed changes

python/cudf_polars/cudf_polars/dsl/nodebase.py Show resolved Hide resolved

python/cudf_polars/cudf_polars/dsl/nodebase.py Show resolved Hide resolved

Matt711 mentioned this pull request Oct 11, 2024

Reorganize cudf_polars expression code #17014

Merged

wence- closed this Oct 11, 2024

wence- reopened this Oct 11, 2024

wence- force-pushed the wence/fea/polars-uniform-nodes branch from 87970fc to 74a98f0 Compare October 14, 2024 09:49

wence- force-pushed the wence/fea/polars-uniform-nodes branch from a7d050c to 9a8d59e Compare October 14, 2024 11:34

wence- requested a review from a team as a code owner October 14, 2024 11:34

wence- force-pushed the wence/fea/polars-uniform-nodes branch from 9a8d59e to 56e1634 Compare October 14, 2024 11:37

Renaming in typing for clarity

6248ec3

Overview documentation for visitor pattern/utilities

73019c8

wence- force-pushed the wence/fea/polars-uniform-nodes branch from 56e1634 to 73019c8 Compare October 14, 2024 11:56

Matt711 approved these changes Oct 14, 2024

View reviewed changes

rjzamora approved these changes Oct 14, 2024

View reviewed changes

python/cudf_polars/cudf_polars/dsl/ir.py Show resolved Hide resolved

python/cudf_polars/docs/overview.md Outdated Show resolved Hide resolved

python/cudf_polars/docs/overview.md Outdated Show resolved Hide resolved

python/cudf_polars/cudf_polars/dsl/traversal.py Show resolved Hide resolved

wence- and others added 5 commits October 14, 2024 16:41

Some grammar fixes

b14b150

Reinstate docstrings for properties

a49846f

Use side-effect free rather than pure

9449b44

Merge branch 'branch-24.12' into wence/fea/polars-uniform-nodes

6525cf8

Merge branch 'branch-24.12' into wence/fea/polars-uniform-nodes

a2eb05d

wence- commented Oct 16, 2024

View reviewed changes

python/cudf_polars/docs/overview.md Outdated Show resolved Hide resolved

wence- commented Oct 16, 2024

View reviewed changes

python/cudf_polars/docs/overview.md Outdated Show resolved Hide resolved

Grammar

d8f770d

vyasr requested changes Oct 16, 2024

View reviewed changes

wence- force-pushed the wence/fea/polars-uniform-nodes branch from 6e6d68b to f68d914 Compare October 17, 2024 16:05

wence- added 3 commits October 17, 2024 16:11

CTRP for type of children

cf286d2

This simplifies things a bit and means we don't need to type the children property everywhere else.

Doc fixes

bc8375c

Add more complicated rewrite test

6260ff9

wence- force-pushed the wence/fea/polars-uniform-nodes branch from f68d914 to 6260ff9 Compare October 17, 2024 16:11

wence- requested a review from vyasr October 18, 2024 08:01

vyasr approved these changes Oct 18, 2024

View reviewed changes

rjzamora mentioned this pull request Oct 18, 2024

[FEA] [Proposal] Separate IR evaluation logic from the IR object in cudf-polars #17127

Closed

wence- added 3 commits October 21, 2024 13:01

Merge remote-tracking branch 'upstream/branch-24.12' into wence/fea/p…

02e3100

…olars-uniform-nodes

is_equal only applies to objects of the same type

01eb886

This simplifies the implementation and removes the need for type-narrowing. The special method dunder-eq handles objects of any type.

Tidy up test

2d58189

Merge branch 'branch-24.12' into wence/fea/polars-uniform-nodes

c70becd

rapids-bot bot merged commit 637e320 into rapidsai:branch-24.12 Oct 22, 2024
102 checks passed

wence- deleted the wence/fea/polars-uniform-nodes branch October 22, 2024 10:25

wence- mentioned this pull request Oct 22, 2024

Add conversion from cudf-polars expressions to libcudf ast for parquet filters #17141

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify treatment of `Expr` and `IR` nodes in cudf-polars DSL #17016

Unify treatment of `Expr` and `IR` nodes in cudf-polars DSL #17016

wence- commented Oct 8, 2024 •

edited

Loading

rjzamora left a comment

Matt711 left a comment •

edited

Loading

wence- commented Oct 11, 2024

wence- commented Oct 11, 2024

rjzamora commented Oct 11, 2024

Matt711 commented Oct 11, 2024

rjzamora commented Oct 11, 2024

wence- commented Oct 14, 2024

Matt711 left a comment

rjzamora left a comment

vyasr left a comment

vyasr left a comment

wence- commented Oct 21, 2024

Unify treatment of Expr and IR nodes in cudf-polars DSL #17016

Unify treatment of Expr and IR nodes in cudf-polars DSL #17016

Conversation

wence- commented Oct 8, 2024 • edited Loading

Description

Checklist

rjzamora left a comment

Choose a reason for hiding this comment

Matt711 left a comment • edited Loading

Choose a reason for hiding this comment

wence- commented Oct 11, 2024

wence- commented Oct 11, 2024

rjzamora commented Oct 11, 2024

Matt711 commented Oct 11, 2024

rjzamora commented Oct 11, 2024

wence- commented Oct 14, 2024

Matt711 left a comment

Choose a reason for hiding this comment

rjzamora left a comment

Choose a reason for hiding this comment

vyasr left a comment

Choose a reason for hiding this comment

vyasr left a comment

Choose a reason for hiding this comment

wence- commented Oct 21, 2024

Unify treatment of `Expr` and `IR` nodes in cudf-polars DSL #17016

Unify treatment of `Expr` and `IR` nodes in cudf-polars DSL #17016

wence- commented Oct 8, 2024 •

edited

Loading

Matt711 left a comment •

edited

Loading