Update corresponding vllm commit ID to 12 29 by leo-pony · Pull Request #5475 · vllm-project/vllm-ascend

leo-pony · 2025-12-29T07:51:06Z

What this PR does / why we need it?

Fixes vllm break:

[[BugFix] register quant scale tensors as buffer #31395] ([BugFix] register quant scale tensors as buffer vllm#31395)

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@5326c89

gemini-code-assist

Code Review

This pull request updates a commit hash in the documentation and adds a necessary context manager for setting the vLLM configuration when loading a model. My review identifies a critical issue where a similar context is missing for device initialization, which will cause a runtime error. I've provided a suggestion on how to fix this in the init_device method.

github-actions · 2025-12-29T09:02:48Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: leo-pony <nengjunma@outlook.com>

…to FIA_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: (88 commits) [1/N] Refactor nightly test structure (vllm-project#5479) Docs: Remove deprecated --task parameter for embedding models (vllm-project#5257) Revert "moe_gating_top_k" (vllm-project#5512) [Doc] Fix issue link for 0.12.0 (vllm-project#5500) [CI]update triton ascend version (vllm-project#5392) moe_gating_top_k (vllm-project#5271) [refactor] refactor model runner capture model (vllm-project#5230) Update corresponding vllm commit ID to 12 29 (vllm-project#5475) [Kernel]update csrc cmakelist for open-source cann (vllm-project#5458) [OP] add custom op aclnnMoeInitRoutingCustom (vllm-project#5251) [Refactor][EAGLE] 1/N delete __init__ in mtp_proposer (vllm-project#5176) [Refactor][Triton] Move reject sample triton kernels into ops/triton (vllm-project#5324) [Feature] support eager mode in model runner v2 (vllm-project#5210) [feature] fia support sliding windows (vllm-project#5239) Optimize some rejectsampler functions to make npu op launch non-blocking (vllm-project#4587) [Feature] Support to use fullgraph with eagle (vllm-project#5118) [EPLB][refactor] Modification of the initialization logic for expert_map and log2phy（depend on pr5285） (vllm-project#5311) [Refactor]6/N Extract common code of class AscendMLAImpl (vllm-project#5314) [Refactor] cache cos/sin in mla & remove parameter model in builder. (vllm-project#5277) update vllm pin to 12.27 (vllm-project#5412) ...

### What this PR does / why we need it? - Fixes vllm break: 1. [[BugFix] register quant scale tensors as buffer #31395] (vllm-project/vllm#31395) ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: leo-pony <nengjunma@outlook.com>

### What this PR does / why we need it? - Fixes vllm break: 1. [[BugFix] register quant scale tensors as buffer #31395] (vllm-project/vllm#31395) ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: leo-pony <nengjunma@outlook.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? - Fixes vllm break: 1. [[BugFix] register quant scale tensors as buffer #31395] (vllm-project/vllm#31395) ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: leo-pony <nengjunma@outlook.com>

### What this PR does / why we need it? - Fixes vllm break: 1. [[BugFix] register quant scale tensors as buffer #31395] (vllm-project/vllm#31395) ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: leo-pony <nengjunma@outlook.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

gemini-code-assist bot reviewed Dec 29, 2025

View reviewed changes

Comment thread vllm_ascend/worker/worker.py

github-actions bot added documentation Improvements or additions to documentation ci/build labels Dec 29, 2025

wangxiyuan approved these changes Dec 29, 2025

View reviewed changes

leo-pony changed the title ~~Update 12 29~~ Update corresponding vllm commit ID to 12 29 Dec 29, 2025

leo-pony added ready read for review ready-for-test start test by label for PR and removed ready read for review ready-for-test start test by label for PR labels Dec 29, 2025

leo-pony added 2 commits December 29, 2025 17:17

fix break by vllm pr 31395

0475e38

Signed-off-by: leo-pony <nengjunma@outlook.com>

pin vllm version to 12-29

ea63759

Signed-off-by: leo-pony <nengjunma@outlook.com>

leo-pony force-pushed the update_12_29 branch from 4204297 to ea63759 Compare December 29, 2025 09:17

wangxiyuan merged commit 5e96f94 into vllm-project:main Dec 29, 2025
19 checks passed

leo-pony deleted the update_12_29 branch December 30, 2025 06:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update corresponding vllm commit ID to 12 29#5475

Update corresponding vllm commit ID to 12 29#5475
wangxiyuan merged 2 commits intovllm-project:mainfrom
leo-pony:update_12_29

leo-pony commented Dec 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Dec 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leo-pony commented Dec 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Dec 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

leo-pony commented Dec 29, 2025 •

edited by github-actions bot

Loading