[torch.compile][Minor Fix] Gate cudagraph_unsafe tag for torch>=2.9 by BoyuanFeng · Pull Request #25304 · vllm-project/vllm

BoyuanFeng · 2025-09-20T05:55:52Z

torch._C.Tag.cudagraph_unsafe is only used for PyTorch >= 2.9.

Signed-off-by: Boyuan Feng <boyuan@meta.com>

BoyuanFeng · 2025-09-20T05:56:52Z

cc @xuechendi @bigPYJ1151 Thanks for #25298! This PR should be a proper fix. The issue is that the tag is only available in pytorch 2.9+.

gemini-code-assist

Code Review

This pull request correctly gates the usage of torch._C.Tag.cudagraph_unsafe based on the PyTorch version, ensuring compatibility with versions 2.9 and newer. The implementation uses is_torch_equal_or_newer which is a clean approach. However, the version-checking logic is duplicated in both vllm/attention/layer.py and tests/compile/silly_attention.py. My review focuses on refactoring this duplication into a centralized utility to improve code maintainability and prevent potential future inconsistencies.

tests/compile/silly_attention.py

gemini-code-assist · 2025-09-20T05:56:52Z

vllm/attention/layer.py

+if is_torch_equal_or_newer("2.9.0.dev"):
    tag_cudagraph_unsafe = (torch._C.Tag.cudagraph_unsafe, )
-except AttributeError:
+else:
    tag_cudagraph_unsafe = ()  # type: ignore[assignment]


This logic for determining tag_cudagraph_unsafe based on the PyTorch version is also present in tests/compile/silly_attention.py. To improve maintainability and avoid potential inconsistencies in the future, it would be better to define this in a single, shared location (e.g., vllm/utils) and import it here.

moved to vllm/utils/__init__.py.

Don't keep it in vllm/attention/layer.py since other ops (e.g., moe) may use it in the future.
Don't keep it in vllm/compilation to avoid circular import.

Signed-off-by: Boyuan Feng <boyuan@meta.com>

xuechendi · 2025-09-20T06:31:09Z

vllm/utils/__init__.py

        return prompt_token_len
+
+
+if is_torch_equal_or_newer("2.9.0.dev"):


May you also add condition check for attribute? We are working on xpu(intel GPU) and Hpu(intel gaudi), we use torch-XPU or torch-HPU, which I am afraid will not have cudagraph related attributes or assert.

yes added current_platform.is_cuda_alike()

Signed-off-by: Boyuan Feng <boyuan@meta.com>

mergify · 2025-09-22T19:44:03Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @BoyuanFeng.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Boyuan Feng <boyuan@meta.com>

BoyuanFeng · 2025-10-14T17:14:31Z

Close for #26116

gate cudagraph_unsafe tag for torch-2.9

8a673cf

Signed-off-by: Boyuan Feng <boyuan@meta.com>

BoyuanFeng requested a review from LucasWilkinson as a code owner September 20, 2025 05:55

gemini-code-assist bot reviewed Sep 20, 2025

View reviewed changes

BoyuanFeng added 2 commits September 19, 2025 22:58

nit

68ed7cc

Signed-off-by: Boyuan Feng <boyuan@meta.com>

move to vllm.utils.__init__

56d5f48

Signed-off-by: Boyuan Feng <boyuan@meta.com>

xuechendi reviewed Sep 20, 2025

View reviewed changes

gate by is_cuda_alike

99aee97

Signed-off-by: Boyuan Feng <boyuan@meta.com>

xuechendi approved these changes Sep 22, 2025

View reviewed changes

mergify bot added the needs-rebase label Sep 22, 2025

Merge branch 'main' into bf/cg-unsafe-tag

c46b0c1

Signed-off-by: Boyuan Feng <boyuan@meta.com>

mergify bot removed the needs-rebase label Oct 14, 2025

nit

6447252

Signed-off-by: Boyuan Feng <boyuan@meta.com>

BoyuanFeng closed this Oct 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[torch.compile][Minor Fix] Gate cudagraph_unsafe tag for torch>=2.9#25304

[torch.compile][Minor Fix] Gate cudagraph_unsafe tag for torch>=2.9#25304
BoyuanFeng wants to merge 6 commits intovllm-project:mainfrom
BoyuanFeng:bf/cg-unsafe-tag

BoyuanFeng commented Sep 20, 2025

Uh oh!

BoyuanFeng commented Sep 20, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

gemini-code-assist bot Sep 20, 2025

Uh oh!

BoyuanFeng Sep 20, 2025

Uh oh!

xuechendi Sep 20, 2025

Uh oh!

BoyuanFeng Sep 20, 2025

Uh oh!

mergify bot commented Sep 22, 2025

Uh oh!

BoyuanFeng commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return prompt_token_len


		if is_torch_equal_or_newer("2.9.0.dev"):

Uh oh!

Conversation

BoyuanFeng commented Sep 20, 2025

Uh oh!

BoyuanFeng commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist bot Sep 20, 2025

Choose a reason for hiding this comment

Uh oh!

BoyuanFeng Sep 20, 2025

Choose a reason for hiding this comment

Uh oh!

xuechendi Sep 20, 2025

Choose a reason for hiding this comment

Uh oh!

BoyuanFeng Sep 20, 2025

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Sep 22, 2025

Uh oh!

BoyuanFeng commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BoyuanFeng commented Sep 20, 2025 •

edited

Loading