Simplify memory-resource handling in Dask integration by rjzamora · Pull Request #172 · rapidsai/rapidsmpf

rjzamora · 2025-03-26T19:07:31Z

This is a follow-up to #150

Closes #157

The goal of this PR is to simplify memory-resource creation by avoiding it in rapidsmp altogether. Since the user can just use LocalCUDACluster to deploy a Dask cluster, they can also use existing options/utilities to create a memory pool on each worker. When rapidsmp.integrations.dask.bootstrap_dask_cluster is called, each worker only needs to wrap the current memory resource in a StatisticsResourceAdaptor.

This is technically "breaking", because it removes the pool_size argument from bootstrap_dask_cluster. However, we are only using that option in rapidsai/cudf#18335 (which is still experimental - and can be easily changed).

rjzamora · 2025-03-26T19:08:41Z

python/rapidsmp/rapidsmp/integrations/dask.py

Note: The current memory resource doesn't need to be a pool for spilling to work.

TomAugspurger · 2025-03-26T19:13:07Z

I think this closes #157.

Does this require that the dask-cuda worker be created with any special arguments? I think the answer is no: rmm.mr.get_current_device_resource will always return something, regardless of whether the user created the LocalCUDACluster with something like rmm_pool_size.

rjzamora · 2025-03-26T19:23:15Z

Does this require that the dask-cuda worker be created with any special arguments? I think the answer is no: rmm.mr.get_current_device_resource will always return something, regardless of whether the user created the LocalCUDACluster with something like rmm_pool_size.

Right - It doesn't require the user to pass in an rmm_pool_size argument, but they should do this if they want good performance.

TomAugspurger · 2025-03-26T19:30:00Z

In that case, it might be good to add that note to our quickstart example: https://github.com/rapidsai/rapids-multi-gpu/blob/branch-25.06/docs/source/quickstart.md#dask-cudf-example.

rjzamora · 2025-03-26T19:47:57Z

In that case, it might be good to add that note to our quickstart example

Good suggestion. I tweaked the code and added a brief comment.

pentschev

Makes sense, thanks @rjzamora .

rjzamora · 2025-03-26T22:10:44Z

/merge

remove pool creation, and use existing mr on each process

b1aa2cb

rjzamora added breaking Introduces a breaking change improvement Improves an existing functionality labels Mar 26, 2025

rjzamora self-assigned this Mar 26, 2025

rjzamora requested a review from a team as a code owner March 26, 2025 19:07

rjzamora commented Mar 26, 2025

View reviewed changes

TomAugspurger approved these changes Mar 26, 2025

View reviewed changes

rjzamora changed the title ~~remove pool creation, and use existing mr on each process~~ Simplify memory-resource handling in Dask integration Mar 26, 2025

add comment to quickstart.md

d19a9ac

TomAugspurger approved these changes Mar 26, 2025

View reviewed changes

pentschev approved these changes Mar 26, 2025

View reviewed changes

rapids-bot bot merged commit b1ca99b into rapidsai:branch-25.06 Mar 26, 2025
25 checks passed

rjzamora deleted the use-existing-mr-dask branch March 26, 2025 22:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify memory-resource handling in Dask integration#172

Simplify memory-resource handling in Dask integration#172
rapids-bot[bot] merged 2 commits intorapidsai:branch-25.06from
rjzamora:use-existing-mr-dask

rjzamora commented Mar 26, 2025 •

edited

Loading

Uh oh!

rjzamora Mar 26, 2025

Uh oh!

TomAugspurger commented Mar 26, 2025

Uh oh!

rjzamora commented Mar 26, 2025

Uh oh!

TomAugspurger commented Mar 26, 2025

Uh oh!

rjzamora commented Mar 26, 2025

Uh oh!

pentschev left a comment

Uh oh!

rjzamora commented Mar 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

rjzamora commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rjzamora Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

TomAugspurger commented Mar 26, 2025

Uh oh!

rjzamora commented Mar 26, 2025

Uh oh!

TomAugspurger commented Mar 26, 2025

Uh oh!

rjzamora commented Mar 26, 2025

Uh oh!

pentschev left a comment

Choose a reason for hiding this comment

Uh oh!

rjzamora commented Mar 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

rjzamora commented Mar 26, 2025 •

edited

Loading