Skip to content

Conversation

@LucasWilkinson
Copy link
Collaborator

Fix DBO using full-cudagraphs for mixed-prefill-decode batches (it should not have been doing this)

Signed-off-by: Lucas Wilkinson <[email protected]>
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a hang in the Dual-ubatch Overlap (DBO) feature when processing mixed prefill-decode batches. The change modifies the condition for replaying a cached CUDA graph to ensure it only happens when the runtime mode is CUDAGraphMode.FULL. This prevents the incorrect replay of a full CUDA graph on an incompatible mixed batch. The fix is targeted and appears correct. I have not identified any further issues of high or critical severity in this change.

@DarkLight1337
Copy link
Member

Is this related to #25607?

@tlrmchlsmth tlrmchlsmth added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 25, 2025
@tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) September 25, 2025 14:21
@tlrmchlsmth tlrmchlsmth merged commit 13cc7f5 into vllm-project:main Sep 25, 2025
55 checks passed
yewentao256 pushed a commit that referenced this pull request Oct 3, 2025
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: xuebwang-amd <[email protected]>
choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: xuebwang-amd <[email protected]>
rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants