[Bug] Fix AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'intermediate_size'#30567
[Bug] Fix AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'intermediate_size'#30567yewentao256 wants to merge 1 commit intomainfrom
Conversation
…mediate_size' Signed-off-by: yewentao256 <zhyanwentao@126.com>
|
Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits. |
There was a problem hiding this comment.
Code Review
This pull request fixes an AttributeError for Qwen3VLMoeConfig by safely accessing intermediate_size and hidden_size from the model configuration. The change correctly looks for these attributes in text_config for multimodal models. I've added a suggestion to make the code more robust by handling cases where model_config itself might be None, which can occur in certain testing environments.
|
I think #30244 also fixes the same VLM kernel fusion issue. |
DarkLight1337
left a comment
There was a problem hiding this comment.
Is this PR still needed now that #30244 has been merged?
|
This pull request has merge conflicts that must be resolved before it can be |
yewentao256
left a comment
There was a problem hiding this comment.
@cjackal @DarkLight1337 Thanks for letting me know, it is not needed now
|
Close in favor of #30244 |
Purpose
export MODEL="Qwen/Qwen3-VL-235B-A22B-Thinking-FP8"
vllm serve $MODEL -tp 4 --port 9256 --enable-expert-parallel
This PR fixes the issue
(APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /load, Methods: GET (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/models, Methods: GET (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /version, Methods: GET (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/responses, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/responses/{response_id}, Methods: GET (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/responses/{response_id}/cancel, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/messages, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/chat/completions, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/completions, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/audio/transcriptions, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/audio/translations, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /ping, Methods: GET (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /ping, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /invocations, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /classify, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/embeddings, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /score, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/score, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /rerank, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v1/rerank, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /v2/rerank, Methods: POST (APIServer pid=1577619) INFO 12-12 10:26:49 [launcher.py:46] Route: /pooling, Methods: POST (APIServer pid=1577619) INFO: Started server process [1577619] (APIServer pid=1577619) INFO: Waiting for application startup. (APIServer pid=1577619) INFO: Application startup complete.