[Bug] Fix AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'intermediate_size' by yewentao256 · Pull Request #30567 · vllm-project/vllm

yewentao256 · 2025-12-12T18:14:07Z

Purpose

export MODEL="Qwen/Qwen3-VL-235B-A22B-Thinking-FP8"
vllm serve $MODEL -tp 4 --port 9256 --enable-expert-parallel

^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     compiled_fn = compiler_fn(gm, example_inputs)
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/.venv/lib/python3.12/site-packages/torch/_dynamo/repro/after_dynamo.py", line 156, in __call__
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     compiled_gm = compiler_fn(gm, example_inputs)
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/.venv/lib/python3.12/site-packages/torch/__init__.py", line 2437, in __call__
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     return self.compiler_fn(model_, inputs_, **self.kwargs)
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/vllm-source/vllm/compilation/backends.py", line 704, in __call__
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     self.configure_post_pass()
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/vllm-source/vllm/compilation/backends.py", line 552, in configure_post_pass
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     self.pass_manager.configure(self.vllm_config)
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/vllm-source/vllm/compilation/pass_manager.py", line 118, in configure
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     self.passes += [RMSNormQuantFusionPass(config)]
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/vllm-source/vllm/compilation/inductor_pass.py", line 134, in fn_new
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     result = fn(*args, **kwargs)
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]              ^^^^^^^^^^^^^^^^^^^
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/vllm-source/vllm/compilation/fusion.py", line 495, in __init__
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     FusedAddRMSNormGroupQuantPattern(
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/vllm-source/vllm/compilation/fusion.py", line 270, in __init__
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     super().__init__(epsilon, key)
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/vllm-source/vllm/compilation/fusion.py", line 130, in __init__
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     config.model_config.hf_config.intermediate_size,
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]   File "/home/wentao/.venv/lib/python3.12/site-packages/transformers/configuration_utils.py", line 207, in __getattribute__
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]     return super().__getattribute__(key)
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822] torch._dynamo.exc.BackendCompilerFailed: backend='<vllm.compilation.backends.VllmBackend object at 0x7ede706738c0>' raised:
^[[0;36m(Worker_TP3_EP3 pid=1427826)^[[0;0m ERROR 12-12 09:15:20 [multiproc_executor.py:822] AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'intermediate_size'

This PR fixes the issue

(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /load, Methods: GET
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/models, Methods: GET
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /version, Methods: GET
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/responses, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/responses/{response_id}, Methods: GET
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/responses/{response_id}/cancel, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/messages, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/chat/completions, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/completions, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/audio/transcriptions, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/audio/translations, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /ping, Methods: GET
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /ping, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /invocations, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /classify, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/embeddings, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /score, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/score, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /rerank, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/rerank, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v2/rerank, Methods: POST
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /pooling, Methods: POST
(APIServer pid=1577619) INFO:     Started server process [1577619]
(APIServer pid=1577619) INFO:     Waiting for application startup.
(APIServer pid=1577619) INFO:     Application startup complete.

…mediate_size' Signed-off-by: yewentao256 <zhyanwentao@126.com>

chatgpt-codex-connector · 2025-12-12T18:14:15Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

gemini-code-assist

Code Review

This pull request fixes an AttributeError for Qwen3VLMoeConfig by safely accessing intermediate_size and hidden_size from the model configuration. The change correctly looks for these attributes in text_config for multimodal models. I've added a suggestion to make the code more robust by handling cases where model_config itself might be None, which can occur in certain testing environments.

vllm/compilation/fusion.py

cjackal · 2025-12-12T18:46:33Z

I think #30244 also fixes the same VLM kernel fusion issue.

DarkLight1337

Is this PR still needed now that #30244 has been merged?

mergify · 2025-12-15T07:22:45Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @yewentao256.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

yewentao256

@cjackal @DarkLight1337 Thanks for letting me know, it is not needed now

yewentao256 · 2025-12-15T15:18:04Z

Close in favor of #30244

Fix AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'inter…

19ad147

…mediate_size' Signed-off-by: yewentao256 <zhyanwentao@126.com>

yewentao256 requested review from ProExpertProg, youkaichao and zou3519 as code owners December 12, 2025 18:14

mergify bot added the qwen Related to Qwen models label Dec 12, 2025

gemini-code-assist bot reviewed Dec 12, 2025

View reviewed changes

vllm/compilation/fusion.py Show resolved Hide resolved

yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 12, 2025

DarkLight1337 reviewed Dec 15, 2025

View reviewed changes

mergify bot added the needs-rebase label Dec 15, 2025

yewentao256 commented Dec 15, 2025

View reviewed changes

yewentao256 closed this Dec 15, 2025

jeejeelee mentioned this pull request Dec 16, 2025

[Bug]: AttributeError: 'Qwen3VLConfig' object has no attribute 'intermediate_size' #30721

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug] Fix AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'intermediate_size'#30567

[Bug] Fix AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'intermediate_size'#30567
yewentao256 wants to merge 1 commit intomainfrom
wentao-fix-qwen3vl-launch-bug

yewentao256 commented Dec 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

chatgpt-codex-connector bot commented Dec 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

cjackal commented Dec 12, 2025

Uh oh!

DarkLight1337 left a comment

Uh oh!

mergify bot commented Dec 15, 2025

Uh oh!

yewentao256 left a comment

Uh oh!

yewentao256 commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

yewentao256 commented Dec 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Uh oh!

chatgpt-codex-connector bot commented Dec 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

cjackal commented Dec 12, 2025

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Dec 15, 2025

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

yewentao256 commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yewentao256 commented Dec 12, 2025 •

edited by github-actions bot

Loading