Skip to content

[BugFix] fix graph partition signature#27139

Merged
ProExpertProg merged 3 commits intovllm-project:mainfrom
BoyuanFeng:bf/partition-signature
Oct 18, 2025
Merged

[BugFix] fix graph partition signature#27139
ProExpertProg merged 3 commits intovllm-project:mainfrom
BoyuanFeng:bf/partition-signature

Conversation

@BoyuanFeng
Copy link
Copy Markdown
Collaborator

@BoyuanFeng BoyuanFeng commented Oct 18, 2025

A pytorch 2.9 monkey patch to fix test_attn_quant. See pytorch/pytorch#165815

Signed-off-by: Boyuan Feng <boyuan@meta.com>
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a monkey patch for get_graph_partition_signature in PyTorch 2.9 to address a bug related to graph partitioning, which fixes the test_attn_quant test. The changes in the test file reflect an update to the logging infrastructure. The core change in vllm/env_override.py seems correct in its intent to backport a fix. However, there is a misleading comment for the new monkey patch that should be corrected to improve maintainability.

)

with caplog_mp_spawn(logging.DEBUG) as log_holder:
with caplog_vllm.at_level(logging.DEBUG):
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This test does not use mp. caplog_mp_spawn would also fail for dynamo graph partition.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if that's just on your machine - for me even tp=1 tests used MP. I'll enable CI and we can see. But I think we should probably just do both, and we can investigate later

Signed-off-by: Boyuan Feng <boyuan@meta.com>
)

with caplog_mp_spawn(logging.DEBUG) as log_holder:
with caplog_vllm.at_level(logging.DEBUG):
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if that's just on your machine - for me even tp=1 tests used MP. I'll enable CI and we can see. But I think we should probably just do both, and we can investigate later

@ProExpertProg ProExpertProg added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 18, 2025
Signed-off-by: Boyuan Feng <boyuan@meta.com>
@BoyuanFeng
Copy link
Copy Markdown
Collaborator Author

@ProExpertProg yes we should use caplog_mp_spawn. I was using VLLM_ENABLE_V1_MULTIPROCESSING=0 for debugging. I guess that's the reason.

@ProExpertProg ProExpertProg merged commit e133d6d into vllm-project:main Oct 18, 2025
45 checks passed
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
Signed-off-by: Boyuan Feng <boyuan@meta.com>
adabeyta pushed a commit to adabeyta/vllm that referenced this pull request Oct 20, 2025
Signed-off-by: Boyuan Feng <boyuan@meta.com>
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
Signed-off-by: Boyuan Feng <boyuan@meta.com>
Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
Signed-off-by: Boyuan Feng <boyuan@meta.com>
Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
ilmarkov pushed a commit to neuralmagic/vllm that referenced this pull request Nov 7, 2025
Signed-off-by: Boyuan Feng <boyuan@meta.com>
rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025
Signed-off-by: Boyuan Feng <boyuan@meta.com>
devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025
Signed-off-by: Boyuan Feng <boyuan@meta.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Another issue with Inductor partition codegen for attn+nvfp4 quant fusion

2 participants