[Main2Main] Upgrade vllm commit to 0108 by zhangxinyuehfad · Pull Request #5727 · vllm-project/vllm-ascend

zhangxinyuehfad · 2026-01-08T08:53:31Z

What this PR does / why we need it?

Upgrade vllm commit to 0108 (eac3b96)

remove init_cached_hf_modules due to [Chore] Try remove init_cached_hf_modules vllm#31786
skip spec_decode e2e test due to [Perf] Async Scheduling + Speculative Decoding + Structured Outputs vllm#29821 break
fix vllm.v1.attention.backends.utils duo to [Chore] Migrate V0 attention utils vllm#31891
skip test_qwen3_next_distributed_mp_full_decode_only_tp4 due to [Attention][1/n] Remove usage of deprecated seq_lens_cpu and num_computed_tokens_cpu CommonAttentionMetadata properties vllm#31773 ([Bugfix] Keep all tensors to be on the same device vllm#31958 will fix)

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@2f4e654

gemini-code-assist

Code Review

This pull request upgrades the vLLM commit hash and introduces compatibility shims for changes in the new version. The changes correctly identify areas that need adaptation, such as the import path for PAD_SLOT_ID and the usage of init_cached_hf_modules.

However, there are some critical and high-severity issues. The compatibility logic relies on an exact version match (vllm_version_is), which is brittle and will likely fail for versions other than the one specified. This needs to be replaced with more robust version range checks. Additionally, the conditional import logic is duplicated across multiple files, which impacts maintainability. I've left specific comments with suggestions to address these points by refactoring the version checking utility and centralizing compatibility imports.

github-actions · 2026-01-08T09:08:15Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

github-actions · 2026-01-09T08:05:04Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

wjunLu · 2026-01-12T07:43:32Z

We don't need this PR anymore, please just keep 0112

vllm-ascend-ci added ready read for review ready-for-test start test by label for PR labels Jan 8, 2026

gemini-code-assist bot reviewed Jan 8, 2026

View reviewed changes

Comment thread tests/ut/worker/test_worker_v1.py

Comment thread vllm_ascend/attention/mla_v1.py Outdated

Comment thread vllm_ascend/worker/worker.py

Comment thread vllm_ascend/attention/mla_v1.py Outdated

zhangxinyuehfad force-pushed the main0108 branch from 978aeed to cef6be4 Compare January 8, 2026 08:58

github-actions bot added documentation Improvements or additions to documentation ci/build module:tests module:ops labels Jan 8, 2026

zhangxinyuehfad force-pushed the main0108 branch 2 times, most recently from 9ee9933 to 7fbcbb0 Compare January 9, 2026 03:43

github-actions bot added the merge-conflicts label Jan 9, 2026

zhangxinyuehfad added 2 commits January 9, 2026 17:41

[Main2Main] Upgrade vllm commit to 0107

06965d0

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

[Main2Main] Upgrade vllm commit to 0108

6111062

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

zhangxinyuehfad force-pushed the main0108 branch from 7fbcbb0 to 6111062 Compare January 9, 2026 09:42

github-actions bot removed the merge-conflicts label Jan 9, 2026

zhangxinyuehfad closed this Jan 12, 2026

zhangxinyuehfad deleted the main0108 branch March 19, 2026 02:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Main2Main] Upgrade vllm commit to 0108#5727

[Main2Main] Upgrade vllm commit to 0108#5727
zhangxinyuehfad wants to merge 2 commits intovllm-project:mainfrom
zhangxinyuehfad:main0108

zhangxinyuehfad commented Jan 8, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

wjunLu commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zhangxinyuehfad commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

wjunLu commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhangxinyuehfad commented Jan 8, 2026 •

edited

Loading