[cudagraphs] Refactor cudagraph capture loop by LucasWilkinson · Pull Request #32946 · vllm-project/vllm

LucasWilkinson · 2026-01-23T15:55:23Z

Refactor cudagraph capture loop to pave the way for different PIECEWISE and FULL sizes and dynamic spec-decode sizes

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

gemini-code-assist

Code Review

The pull request refactors the CUDA graph capture loop by centralizing the logic for determining which graphs to capture into the CudagraphDispatcher. This significantly cleans up the capture_model method in gpu_model_runner.py. New test cases have been added to ensure the get_capture_descs method in the dispatcher works as expected. However, a critical issue was identified in how uniform_decode is determined for CUDA graph capture, which could lead to incorrect graph configurations.

vllm/v1/worker/gpu_model_runner.py

ProExpertProg

Nice, didn't realize we had logic for different keys in two places

vllm/v1/cudagraph_dispatcher.py

ProExpertProg · 2026-01-23T16:06:22Z

vllm/v1/worker/gpu_model_runner.py

        # We skip EPLB here since we don't want to record dummy metrics
-        for num_tokens, activate_lora in compilation_cases:
+        for batch_desc in batch_descriptors:
+            num_tokens = batch_desc.num_tokens


I feel like we're moving closer and closer to passing BatchDescriptor to dummy run directly...

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by: 陈建华 <1647430658@qq.com>

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

cg refactor

94c1a90

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

mergify bot added nvidia v1 labels Jan 23, 2026

github-project-automation bot added this to NVIDIA Jan 23, 2026

clean

4527d82

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

gemini-code-assist bot reviewed Jan 23, 2026

View reviewed changes

vllm/v1/worker/gpu_model_runner.py Show resolved Hide resolved

LucasWilkinson added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 23, 2026

ProExpertProg approved these changes Jan 23, 2026

View reviewed changes

github-project-automation bot moved this to Ready in NVIDIA Jan 23, 2026

LucasWilkinson merged commit 3a41459 into vllm-project:main Jan 23, 2026
49 checks passed

github-project-automation bot moved this from Ready to Done in NVIDIA Jan 23, 2026

cwazai pushed a commit to cwazai/vllm that referenced this pull request Jan 25, 2026

[cudagraphs] Refactor cudagraph capture loop (vllm-project#32946)

6740169

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by: 陈建华 <1647430658@qq.com>

lapy pushed a commit to lapy/vllm that referenced this pull request Jan 27, 2026

[cudagraphs] Refactor cudagraph capture loop (vllm-project#32946)

8224ecd

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[cudagraphs] Refactor cudagraph capture loop (vllm-project#32946)

cb61486

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

This was referenced Feb 27, 2026

[Bugfix] Cap FULL decode cudagraph sizes for Mamba/hybrid models (#34094) #34571

Merged

clean unused cudagraph_batch_sizes #35552

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[cudagraphs] Refactor cudagraph capture loop#32946

[cudagraphs] Refactor cudagraph capture loop#32946
LucasWilkinson merged 2 commits intovllm-project:mainfrom
neuralmagic:lwilkinson/cg-refactor

LucasWilkinson commented Jan 23, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

ProExpertProg left a comment

Uh oh!

Uh oh!

ProExpertProg Jan 23, 2026

Uh oh!

LucasWilkinson Jan 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

LucasWilkinson commented Jan 23, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ProExpertProg Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

LucasWilkinson Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants