[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_metadata_builder #27427

fhl2000 · 2025-10-23T17:04:22Z

Purpose

Previously, resolving cudagraph_mode happened after attn_metadata_builder initialisation, which means attn_metadata_builder is not aware of the potential changed cudagraph_mode. (could lead to querying invalid cudagraph_capturing_sizes list if the resolved mode is NONE).

This should also fix the CI for #26016

Test Plan

To see if the CI is fixed in #26016 after merging this.

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: fhl2000 <[email protected]>

gemini-code-assist

Code Review

This pull request correctly addresses a bug where cudagraph_mode was resolved after the initialization of attn_metadata_builder, which could lead to incorrect behavior. The fix refactors the initialization logic in GPUModelRunner to ensure that cudagraph_mode is determined based on attention backend capabilities before the metadata builders are instantiated. This is achieved by first collecting all attention backend classes, determining the minimum supported CUDA graph level, and only then initializing the attention groups. The changes are logical, well-implemented, and effectively resolve the issue. The associated refactoring improves the code's clarity and correctness. The changes look solid.

Signed-off-by: fhl2000 <[email protected]>

mergify · 2025-10-23T17:34:51Z

Documentation preview: https://vllm--27427.org.readthedocs.build/en/27427/

…etadata_builder (vllm-project#27427) Signed-off-by: fhl2000 <[email protected]>

…etadata_builder (vllm-project#27427) Signed-off-by: fhl2000 <[email protected]> Signed-off-by: 0xrushi <[email protected]>

fhl2000 added 3 commits October 23, 2025 16:13

move resolving cudagraph_mode before metadata_builder init

d168bac

Signed-off-by: fhl2000 <[email protected]>

fix

38f73df

Signed-off-by: fhl2000 <[email protected]>

add comments

16ac4c1

Signed-off-by: fhl2000 <[email protected]>

mergify bot added the v1 label Oct 23, 2025

gemini-code-assist bot reviewed Oct 23, 2025

View reviewed changes

fix doc

a15bcf4

Signed-off-by: fhl2000 <[email protected]>

mergify bot added the documentation Improvements or additions to documentation label Oct 23, 2025

ProExpertProg approved these changes Oct 23, 2025

View reviewed changes

ProExpertProg enabled auto-merge (squash) October 23, 2025 18:27

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 23, 2025

vllm-bot merged commit 85fee74 into vllm-project:main Oct 24, 2025
45 of 48 checks passed

fhl2000 deleted the fix_cudagraph_mode_CI branch October 24, 2025 06:52

atalhens pushed a commit to atalhens/vllm that referenced this pull request Oct 24, 2025

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_m…

624344e

…etadata_builder (vllm-project#27427) Signed-off-by: fhl2000 <[email protected]>

kingsmad pushed a commit to kingsmad/vllm that referenced this pull request Oct 25, 2025

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_m…

ba72e97

…etadata_builder (vllm-project#27427) Signed-off-by: fhl2000 <[email protected]>

rohin-garg pushed a commit to rohin-garg/vllm that referenced this pull request Oct 25, 2025

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_m…

3e9982c

…etadata_builder (vllm-project#27427) Signed-off-by: fhl2000 <[email protected]>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_m…

c471066

…etadata_builder (vllm-project#27427) Signed-off-by: fhl2000 <[email protected]> Signed-off-by: 0xrushi <[email protected]>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_m…

9488f00

…etadata_builder (vllm-project#27427) Signed-off-by: fhl2000 <[email protected]> Signed-off-by: 0xrushi <[email protected]>

fhl2000 mentioned this pull request Nov 2, 2025

[Perf] Enable full CUDA graphs for spec decoding with FlashInfer #26937

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_metadata_builder #27427

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_metadata_builder #27427

Uh oh!

fhl2000 commented Oct 23, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mergify bot commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_metadata_builder #27427

[Bugfix][CI] Move resolving cudagraph_mode before initializing attn_metadata_builder #27427

Uh oh!

Conversation

fhl2000 commented Oct 23, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mergify bot commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fhl2000 commented Oct 23, 2025 •

edited by github-actions bot

Loading