Update vllm pin to 12.24 by leo-pony · Pull Request #5307 · vllm-project/vllm-ascend

leo-pony · 2025-12-24T02:09:52Z

What this PR does / why we need it?

Fix vllm break in the pr:

[Add MiMo-V2-Flash support] ([Model] Add MiMo-V2-Flash support vllm#30836)

Does this PR introduce any user-facing change?

How was this patch tested?

Co-authored-by: zxwang 1476209578@qq.com

vLLM version: release/v0.13.0
vLLM main: vllm-project/vllm@5fbfa8d

Signed-off-by: leo-pony <nengjunma@outlook.com>

Signed-off-by: zxwang <1476209578@qq.com>

github-actions · 2025-12-24T02:09:59Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request updates the vLLM pin and introduces a v_head_size parameter to AscendQKVParallelLinear. However, the new parameter is not used in the layer's output size calculations, which is a critical bug. I've left a comment with details on how to fix it. The documentation change looks fine.

vllm_ascend/ops/linear.py

leo-pony · 2025-12-24T02:58:47Z

@shenchuxiaofugui Could you help take a look?
failed test case:
tests/e2e/multicard/test_qwen3_moe.py::test_qwen3_moe_w8a8_distributed_tp2_ep_dynamic_eplb FAILED

It's not a function issue. It's my local env issue.

shenchuxiaofugui · 2025-12-24T03:16:14Z

export HCCL_BUFFSIZE=1024
export DYNAMIC_EPLB="true"

vllm serve Qwen3-30B-A3B-m4
--served-model-name qwen3
--host 0.0.0.0
--port 20002
--tensor-parallel-size 2
--max_model_len 8192
--enable_expert_parallel
--quantization "ascend"
--enforce_eager
--additional-config '{"dynamic_eplb": true, "num_iterations_eplb_update": 100, "num_wait_worker_iterations": 50}'

### What this PR does / why we need it? Fix vllm break in the pr: 1. [Add MiMo-V2-Flash support] (vllm-project/vllm#30836) ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Co-authored-by: zxwang [1476209578@qq.com](mailto:1476209578@qq.com) - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@5fbfa8d --------- Signed-off-by: leo-pony <nengjunma@outlook.com> Signed-off-by: zxwang <1476209578@qq.com> Co-authored-by: zxwang <1476209578@qq.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

leo-pony and others added 2 commits December 24, 2025 10:04

pin vllm to bc0a5a0c089844b17cb93f3294348f411e523586

98e38c3

Signed-off-by: leo-pony <nengjunma@outlook.com>

fix

f980203

Signed-off-by: zxwang <1476209578@qq.com>

github-actions bot added documentation Improvements or additions to documentation ci/build module:ops labels Dec 24, 2025

leo-pony marked this pull request as draft December 24, 2025 02:10

gemini-code-assist bot reviewed Dec 24, 2025

View reviewed changes

vllm_ascend/ops/linear.py Show resolved Hide resolved

leo-pony marked this pull request as ready for review December 24, 2025 02:58

leo-pony added ready read for review ready-for-test start test by label for PR labels Dec 24, 2025

wangxiyuan approved these changes Dec 24, 2025

View reviewed changes

wangxiyuan merged commit 42c989a into vllm-project:main Dec 24, 2025
54 of 66 checks passed

leo-pony deleted the update_12_24 branch December 30, 2025 06:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update vllm pin to 12.24#5307

Update vllm pin to 12.24#5307
wangxiyuan merged 2 commits intovllm-project:mainfrom
leo-pony:update_12_24

leo-pony commented Dec 24, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 24, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

leo-pony commented Dec 24, 2025 •

edited

Loading

Uh oh!

shenchuxiaofugui commented Dec 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

leo-pony commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Dec 24, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

leo-pony commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shenchuxiaofugui commented Dec 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

leo-pony commented Dec 24, 2025 •

edited

Loading

leo-pony commented Dec 24, 2025 •

edited

Loading