Update 12 23 by Toneymiller · Pull Request #5268 · vllm-project/vllm-ascend

Toneymiller · 2025-12-23T03:19:06Z

What this PR does / why we need it?

fix the break from vllm-project/vllm#30836 and update vllm version to 1223

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: release/v0.13.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

This pull request introduces a refactoring in the multi-modal encoder attention mechanism by extracting the QKV reshaping logic into a new method. It also updates the configuration logic in the NPU platform. My review focuses on improving code clarity and removing a redundant state mutation. I've suggested removing an unnecessary attribute assignment in the new reshape_qkv_to_3d method to eliminate side effects and improve maintainability. Additionally, I've pointed out an unconventional line break in platform.py that should be corrected for better readability.

gemini-code-assist · 2025-12-23T03:21:03Z

vllm_ascend/ops/mm_encoder_attention.py

+        query = query.view(bsz * q_len, self.num_heads, self.head_size)
+        key = key.view(bsz * kv_len, self.num_kv_heads, self.head_size)
+        value = value.view(bsz * kv_len, self.num_kv_heads, self.head_size)
+        self.num_queries_per_kv = self.num_heads // self.num_kv_heads


The self.num_queries_per_kv attribute is already initialized in the __init__ method of the Attention superclass. Re-assigning it here on every forward pass is redundant and introduces an unnecessary side effect. This can make the code harder to reason about and maintain. It's best practice to initialize such constant attributes once in the constructor and avoid mutating them in forward passes.

gemini-code-assist · 2025-12-23T03:21:03Z

vllm_ascend/platform.py

+                data_parallel_size=vllm_config.parallel_config.
+                data_parallel_size,


The line break in the data_parallel_size argument is unconventional and harms readability. It appears to be an accidental formatting issue. For better code clarity and maintainability, it's best to keep the attribute access on a single line.

Suggested change

data_parallel_size=vllm_config.parallel_config.

data_parallel_size,

data_parallel_size=vllm_config.parallel_config.data_parallel_size,

github-actions · 2025-12-23T04:33:46Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

github-actions · 2025-12-23T15:58:27Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: zxwang <1476209578@qq.com>

github-actions · 2025-12-24T09:36:23Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

gemini-code-assist bot reviewed Dec 23, 2025

View reviewed changes

github-actions bot added documentation Improvements or additions to documentation ci/build module:ops module:core merge-conflicts labels Dec 23, 2025

Toneymiller added 2 commits December 24, 2025 09:45

pin to the latest vllm

09ce6ec

Signed-off-by: zxwang <1476209578@qq.com>

fix

ef98a25

Signed-off-by: zxwang <1476209578@qq.com>

Toneymiller force-pushed the update_12_23 branch from a41e87c to ef98a25 Compare December 24, 2025 01:46

github-actions bot removed the merge-conflicts label Dec 24, 2025

leo-pony added ready read for review ready-for-test start test by label for PR labels Dec 24, 2025

github-actions bot added the merge-conflicts label Dec 24, 2025

wangxiyuan closed this Jan 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update 12 23#5268

Update 12 23#5268
Toneymiller wants to merge 2 commits intovllm-project:mainfrom
leo-pony:update_12_23

Toneymiller commented Dec 23, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 23, 2025

Uh oh!

gemini-code-assist bot Dec 23, 2025

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		data_parallel_size=vllm_config.parallel_config.
		data_parallel_size,

Conversation

Toneymiller commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Toneymiller commented Dec 23, 2025 •

edited

Loading