[Misc] Upgrade vllm hash to 12_14 by Potabk · Pull Request #5000 · vllm-project/vllm-ascend

Potabk · 2025-12-14T14:16:52Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

fix [v1] Add PrefixLM support to FlexAttention backend vllm#27938
fix [Model][6/N] Improve all pooling task | Support chunked prefill with ALL pooling vllm#27145
pooling models now supports chunked prefill and prefix caching,
fix [Model] Move multimodal_cpu_fields definition to field config vllm#30181
define the CPU fields in the field config where they really belong.
fix [Core][MM] Add mechanism to configure multimodal fields which should stay on CPU vllm#28168
define the CPU fields in the field config where they really belong.
fix kv_transfer: Rename the shared storage connectors vllm#30201
some moudle rename
fix [MoE][Refactor] Make select_experts a non-static method vllm#29067
fusedmoe moudle refactor
fix [MoE][Refactor] Remove most arguments to FusedMoEMethodBase.apply vllm#29066
fusedmoe moudle refactor
fix [Attention] Make seq_lens_cpu optional in CommonAttentionMetadata to enable true async spec-decode vllm#29624

How was this patch tested?

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

github-actions · 2025-12-14T14:17:00Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request primarily focuses on upgrading the vLLM hash and ensuring compatibility with the new version. The changes are well-structured, involving API adaptations, refactoring for backward compatibility (e.g., expert_map handling), and introducing version-conditional logic, particularly for vLLM v0.12.0. Overall, the changes appear correct and necessary for the upgrade. I've identified one minor issue in a test file where a model is duplicated, which should be addressed to avoid redundant test runs.

tests/e2e/multicard/test_shared_expert_dp.py

github-actions · 2025-12-15T06:44:32Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Potabk · 2025-12-15T10:05:52Z

follow up:1. qwen3-next refactor; 2. npu_model_runner get_attn_backend remove

Potabk · 2025-12-15T11:02:00Z

the full CI passed here https://github.com/vllm-project/vllm-ascend/actions/runs/20224367586/job/58052479148?pr=5000

Signed-off-by: wangli <wangli858794774@gmail.com>

### What this PR does / why we need it? ### Does this PR introduce _any_ user-facing change? 1. fix vllm-project/vllm#27938 2. fix vllm-project/vllm#27145 pooling models now supports chunked prefill and prefix caching, 3. fix vllm-project/vllm#30181 define the CPU fields in the field config where they really belong. 4. fix vllm-project/vllm#28168 define the CPU fields in the field config where they really belong. 5. fix vllm-project/vllm#30201 some moudle rename 6. fix vllm-project/vllm#29067 fusedmoe moudle refactor 7. fix vllm-project/vllm#29066 fusedmoe moudle refactor 8. fix vllm-project/vllm#29624 ### How was this patch tested? - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: wangli <wangli858794774@gmail.com>

### What this PR does / why we need it? ### Does this PR introduce _any_ user-facing change? 1. fix vllm-project/vllm#27938 2. fix vllm-project/vllm#27145 pooling models now supports chunked prefill and prefix caching, 3. fix vllm-project/vllm#30181 define the CPU fields in the field config where they really belong. 4. fix vllm-project/vllm#28168 define the CPU fields in the field config where they really belong. 5. fix vllm-project/vllm#30201 some moudle rename 6. fix vllm-project/vllm#29067 fusedmoe moudle refactor 7. fix vllm-project/vllm#29066 fusedmoe moudle refactor 8. fix vllm-project/vllm#29624 ### How was this patch tested? - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

github-actions bot added documentation Improvements or additions to documentation ci/build module:tests module:ops module:core labels Dec 14, 2025

gemini-code-assist bot reviewed Dec 14, 2025

View reviewed changes

tests/e2e/multicard/test_shared_expert_dp.py Outdated Show resolved Hide resolved

Potabk force-pushed the fix_1214_test branch from 751e9a7 to ccb5fcc Compare December 15, 2025 06:42

github-actions bot added the merge-conflicts label Dec 15, 2025

Potabk force-pushed the fix_1214_test branch from ccb5fcc to 2f0110b Compare December 15, 2025 07:33

github-actions bot removed the merge-conflicts label Dec 15, 2025

Potabk added ready read for review ready-for-test start test by label for PR labels Dec 15, 2025

Potabk requested a review from wangxiyuan December 15, 2025 07:46

Potabk added 11 commits December 15, 2025 19:06

main2main

4d82d83

Signed-off-by: wangli <wangli858794774@gmail.com>

fix attn_meta

dee877b

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

51c54dc

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

3f96731

Signed-off-by: wangli <wangli858794774@gmail.com>

lint

1f4e8a2

Signed-off-by: wangli <wangli858794774@gmail.com>

refact npu_input_barch

33ba405

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

b2f30d9

Signed-off-by: wangli <wangli858794774@gmail.com>

fix lint

dcb14a0

Signed-off-by: wangli <wangli858794774@gmail.com>

remove debug lines

263758b

Signed-off-by: wangli <wangli858794774@gmail.com>

fix lint

500b573

Signed-off-by: wangli <wangli858794774@gmail.com>

remove redundant lines

0969842

Signed-off-by: wangli <wangli858794774@gmail.com>

Potabk force-pushed the fix_1214_test branch from 5852b77 to 0969842 Compare December 15, 2025 11:06

wangxiyuan approved these changes Dec 15, 2025

View reviewed changes

wangxiyuan merged commit 8d2998d into vllm-project:main Dec 15, 2025
20 of 23 checks passed

wangxiyuan added the vllm-break label Dec 19, 2025

Potabk deleted the fix_1214_test branch December 31, 2025 02:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Upgrade vllm hash to 12_14#5000

[Misc] Upgrade vllm hash to 12_14#5000
wangxiyuan merged 11 commits intovllm-project:mainfrom
Potabk:fix_1214_test

Potabk commented Dec 14, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 14, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

Potabk commented Dec 15, 2025 •

edited

Loading

Uh oh!

Potabk commented Dec 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Potabk commented Dec 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Dec 14, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

Potabk commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Potabk commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Potabk commented Dec 14, 2025 •

edited

Loading

Potabk commented Dec 15, 2025 •

edited

Loading

Potabk commented Dec 15, 2025 •

edited

Loading