[Bugfix]: resolve torch.compile cache conflict between mm_encoder_tp_modes by HirokenOvo · Pull Request #32842 · vllm-project/vllm

HirokenOvo · 2026-01-22T09:45:28Z

Purpose

PR #23207 introduced torch compile support for the ViT part of Qwen2.5-VL. This PR addresses an issue where enabling torch.compile for the vision encoder (--compilation-config '{"compile_mm_encoder": true}') caused crashes when switching between --mm-encoder-tp-mode "weights" and --mm-encoder-tp-mode "data".

The Problem:

vLLM uses VllmConfig.compute_hash() to identify unique configurations for caching compiled graphs. However, mm_encoder_tp_mode was missing from this hash calculation. As a result, running the model with weights mode generated a cache that data mode would try to reuse (or vice versa). Since these modes result in different tensor shapes/strides for the ViT, this caused an AssertionError in the generated Inductor kernels.

The Solution:

Updated vllm/config/multimodal.py: Added mm_encoder_tp_mode to the factors used in MultiModalConfig.compute_hash().
Updated vllm/config/vllm.py: Modified VllmConfig.compute_hash() to explicitly include the multimodal_config hash if and only if compile_mm_encoder is enabled. This ensures correct cache isolation without affecting the hash for non-compiled runs.

Related Error Log

vllm serve /data/models/qwen2_5vl-3B/ --compilation-config '{"compile_mm_encoder": true}' --tensor-parallel-size 2 --mm-encoder-tp-mode "data" --max-model-len 8192 --gpu-memory-utilization 0.5
vllm serve /data/models/qwen2_5vl-3B/ --compilation-config '{"compile_mm_encoder": true}' --tensor-parallel-size 2 --mm-encoder-tp-mode "weights" --max-model-len 8192 --gpu-memory-utilization 0.5

(Worker_TP0 pid=2909827) ERROR 01-22 16:56:04 [multiproc_executor.py:839]   File "/tmp/torchinductor_root/he/chehoqaxuhke4t2nqjka646fx2in2rwogdaqfv5djiz374ppmcpq.py", line 550, in call
(Worker_TP0 pid=2909827) ERROR 01-22 16:56:04 [multiproc_executor.py:839]     assert_size_stride(arg4_1, (3456, 1152), (1152, 1))
(Worker_TP0 pid=2909827) ERROR 01-22 16:56:04 [multiproc_executor.py:839] AssertionError: expected size 1728==3456, stride 1152==1152 at dim=0

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please

DarkLight1337

Thanks for fixing

DarkLight1337 · 2026-01-24T12:51:40Z

cc @ywang96 @Isotr0py

Signed-off-by: Hongjian Zhang <zhanghongjian@xiaohongshu.com> Signed-off-by: Xingran Wang <wangxingran123456@outlook.com> Co-authored-by: Xingran Wang <wangxingran123456@outlook.com>

…modes (vllm-project#32842) Signed-off-by: Hongjian Zhang <zhanghongjian@xiaohongshu.com> Signed-off-by: Xingran Wang <wangxingran123456@outlook.com> Co-authored-by: Xingran Wang <wangxingran123456@outlook.com> Signed-off-by: Mieszko Syty <mieszko@ms1design.pl>

…modes (vllm-project#32842) Signed-off-by: Hongjian Zhang <zhanghongjian@xiaohongshu.com> Signed-off-by: Xingran Wang <wangxingran123456@outlook.com> Co-authored-by: Xingran Wang <wangxingran123456@outlook.com> Signed-off-by: 陈建华 <1647430658@qq.com>

…modes (vllm-project#32842) Signed-off-by: Hongjian Zhang <zhanghongjian@xiaohongshu.com> Signed-off-by: Xingran Wang <wangxingran123456@outlook.com> Co-authored-by: Xingran Wang <wangxingran123456@outlook.com>

HirokenOvo requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256 and youkaichao as code owners January 22, 2026 09:45

mergify bot added the bug Something isn't working label Jan 22, 2026

DarkLight1337 approved these changes Jan 24, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) January 24, 2026 12:51

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 24, 2026

fix: resolve torch.compile cache conflict between mm_encoder_tp_modes

4ff69ba

Signed-off-by: Hongjian Zhang <zhanghongjian@xiaohongshu.com> Signed-off-by: Xingran Wang <wangxingran123456@outlook.com> Co-authored-by: Xingran Wang <wangxingran123456@outlook.com>

auto-merge was automatically disabled January 24, 2026 12:52
Head branch was pushed to by a user without write access

HirokenOvo force-pushed the fix/mm_encoder_torch_compile_hash branch from f059fee to 4ff69ba Compare January 24, 2026 12:52

DarkLight1337 enabled auto-merge (squash) January 24, 2026 12:57

Isotr0py approved these changes Jan 24, 2026

View reviewed changes

DarkLight1337 merged commit 1209b78 into vllm-project:main Jan 24, 2026
48 checks passed

HirokenOvo deleted the fix/mm_encoder_torch_compile_hash branch January 24, 2026 15:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix]: resolve torch.compile cache conflict between mm_encoder_tp_modes#32842

[Bugfix]: resolve torch.compile cache conflict between mm_encoder_tp_modes#32842
DarkLight1337 merged 1 commit intovllm-project:mainfrom
HirokenOvo:fix/mm_encoder_torch_compile_hash

HirokenOvo commented Jan 22, 2026 •

edited by github-actions bot

Loading

Uh oh!

DarkLight1337 left a comment

Uh oh!

DarkLight1337 commented Jan 24, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

HirokenOvo commented Jan 22, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

The Problem:

The Solution:

Related Error Log

Test Plan

Test Result

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HirokenOvo commented Jan 22, 2026 •

edited by github-actions bot

Loading

DarkLight1337 commented Jan 24, 2026 •

edited

Loading