Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings #321

kkraus14 · 2025-07-16T14:35:22Z

This PR adds important deadlock warnings to the docstrings of Stream.add_callback and Stream.async_done methods.

Changes Made

Added warning about potential deadlock when using libraries that call CUDA functions without releasing the GIL
This can occur when callback functions attempt to acquire the GIL while another thread is holding it and making CUDA calls
Recommends using libraries that properly release the GIL around CUDA operations

Context

Based on the third bullet point from #317 (comment), these methods need clearer documentation about potential deadlock scenarios when integrating with other CUDA libraries that don't properly handle GIL management.

The warnings help developers understand the threading implications and make informed decisions about their code architecture to avoid deadlocks.

This pull request was generated from Cursor

…cstrings - Add warning about potential deadlock when using libraries that call CUDA functions without releasing the GIL - This can occur when callback functions attempt to acquire the GIL while another thread is holding it and making CUDA calls - Recommends using libraries that properly release the GIL around CUDA operations

copy-pr-bot · 2025-07-16T14:35:25Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

- Clarify that deadlock is due to lock ordering issue between GIL and CUDA driver lock - Remove reference to 'another thread attempting to make CUDA calls' as this is not required - Focus on the core issue: callback acquiring GIL while CUDA driver lock is held

gmarkall · 2025-07-18T14:00:04Z

/ok to test

- Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings (NVIDIA#321) - `MemoryPointer`: ensure `CUdeviceptr` used with NVIDIA binding (NVIDIA#328) - Fix indexing GPUs with CUdevice object (NVIDIA#319) - Fix bindings: consistency of contexts, streams, and events, similar to NVIDIA#295 (NVIDIA#296) - Fix nvrtc resolution when CUDA_HOME env is set (NVIDIA#314)

- Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings (#321) - `MemoryPointer`: ensure `CUdeviceptr` used with NVIDIA binding (#328) - Fix indexing GPUs with CUdevice object (#319) - Fix bindings: consistency of contexts, streams, and events, similar to #295 (#296) - Fix nvrtc resolution when CUDA_HOME env is set (#314)

…cstrings (NVIDIA#321) * Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings - Add warning about potential deadlock when using libraries that call CUDA functions without releasing the GIL - This can occur when callback functions attempt to acquire the GIL while another thread is holding it and making CUDA calls - Recommends using libraries that properly release the GIL around CUDA operations * Update docstring warnings to clarify lock ordering issue - Clarify that deadlock is due to lock ordering issue between GIL and CUDA driver lock - Remove reference to 'another thread attempting to make CUDA calls' as this is not required - Focus on the core issue: callback acquiring GIL while CUDA driver lock is held

- Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings (NVIDIA#321) - `MemoryPointer`: ensure `CUdeviceptr` used with NVIDIA binding (NVIDIA#328) - Fix indexing GPUs with CUdevice object (NVIDIA#319) - Fix bindings: consistency of contexts, streams, and events, similar to NVIDIA#295 (NVIDIA#296) - Fix nvrtc resolution when CUDA_HOME env is set (NVIDIA#314)

kkraus14 added 2 commits July 16, 2025 10:40

fix docstrings

fb9db38

gmarkall added the 3 - Ready for Review Ready for review by team label Jul 18, 2025

gmarkall approved these changes Jul 18, 2025

View reviewed changes

gmarkall added 5 - Ready to merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Jul 18, 2025

gmarkall merged commit 43184d7 into NVIDIA:main Jul 18, 2025
249 of 273 checks passed

gmarkall mentioned this pull request Jul 18, 2025

Bump version to 0.17.0 #331

Merged

kkraus14 mentioned this pull request Aug 25, 2025

[FEA] DeviceBuffer and other classes that hold smart pointers with destructors that make CUDA API calls should release the GIL on destruction rapidsai/rmm#2026

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings #321

Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings #321

Uh oh!

kkraus14 commented Jul 16, 2025

Uh oh!

copy-pr-bot bot commented Jul 16, 2025

Uh oh!

gmarkall commented Jul 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings #321

Add deadlock warnings to Stream.add_callback and Stream.async_done docstrings #321

Uh oh!

Conversation

kkraus14 commented Jul 16, 2025

Changes Made

Context

Uh oh!

copy-pr-bot bot commented Jul 16, 2025

Uh oh!

gmarkall commented Jul 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants