Simplify memory-resource handling in Dask integration#172
Simplify memory-resource handling in Dask integration#172rapids-bot[bot] merged 2 commits intorapidsai:branch-25.06from
Conversation
There was a problem hiding this comment.
Note: The current memory resource doesn't need to be a pool for spilling to work.
|
I think this closes #157. Does this require that the dask-cuda worker be created with any special arguments? I think the answer is no: |
Right - It doesn't require the user to pass in an |
|
In that case, it might be good to add that note to our quickstart example: https://github.com/rapidsai/rapids-multi-gpu/blob/branch-25.06/docs/source/quickstart.md#dask-cudf-example. |
Good suggestion. I tweaked the code and added a brief comment. |
|
/merge |
This is a follow-up to #150
Closes #157
The goal of this PR is to simplify memory-resource creation by avoiding it in
rapidsmpaltogether. Since the user can just useLocalCUDAClusterto deploy a Dask cluster, they can also use existing options/utilities to create a memory pool on each worker. Whenrapidsmp.integrations.dask.bootstrap_dask_clusteris called, each worker only needs to wrap the current memory resource in aStatisticsResourceAdaptor.This is technically "breaking", because it removes the
pool_sizeargument frombootstrap_dask_cluster. However, we are only using that option in rapidsai/cudf#18335 (which is still experimental - and can be easily changed).