[Bugfix] bugfix for moe_mlp in vllm-ascend 0.11.0-dev by Clorist33 · Pull Request #4825 · vllm-project/vllm-ascend

Clorist33 · 2025-12-09T06:16:49Z

What this PR does / why we need it?

This PR fixes a bug in the moe_mlp module by correcting the arguments passed to the torch_npu.npu_dequant_swiglu_quant function.It properly converts group_list from a cumulative sum to counts for the group_index parameter.

Does this PR introduce any user-facing change?

No

vLLM version: v0.12.0
vLLM main: https://github.com/vllm-project/vllm/main

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

gemini-code-assist

Code Review

This pull request correctly fixes a bug in the moe_mlp module by ensuring the group_list argument is properly converted from a cumulative sum to counts before being passed to torch_npu.npu_dequant_swiglu_quant. The fix is accurate and addresses the issue described. I have one suggestion to improve code maintainability by reducing duplication, which will make the codebase more robust against future changes.

gemini-code-assist · 2025-12-09T06:18:59Z

+            group_diff = torch.diff(group_list, dim=0)
+            new_group = torch.cat([group_list[0].unsqueeze(0), group_diff],
+                                  dim=0)


This logic to convert a cumulative-sum tensor to counts is duplicated from lines 136-138. This duplication poses a maintainability risk, as future changes might be missed in one location, leading to subtle bugs.

To mitigate this and improve consistency, please apply the following suggestion which makes the implementation more concise and aligns it with the existing pattern in the file.

Suggested change

group_diff = torch.diff(group_list, dim=0)

new_group = torch.cat([group_list[0].unsqueeze(0), group_diff],

dim=0)

new_group = torch.cat([group_list[:1], torch.diff(group_list, dim=0)], dim=0)

References

Avoid code duplication (Don't Repeat Yourself - DRY principle). Duplicated code increases maintenance overhead and the risk of introducing inconsistencies and bugs, as changes must be manually synchronized across all instances.

github-actions · 2025-12-09T06:19:14Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

wangxiyuan · 2025-12-09T07:44:43Z

has this be merged into main? If yes, please link the related commit.

github-actions · 2025-12-09T07:51:59Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Clorist33 · 2025-12-09T08:37:03Z

has this be merged into main? If yes, please link the related commit.

Not yet. The PR submitted to the main was reviewed in the meeting yesterday, and the only feedback was to add descriptions—no other modifications were requested. Currently, the PR to be merged into main is still pending review，address is here #4822.

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com> Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com> Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

wangxiyuan · 2025-12-09T14:39:28Z

@@ -0,0 +1,8 @@
+-----BEGIN OPENSSH PRIVATE KEY-----


remove this file

wangxiyuan · 2025-12-09T14:39:34Z

@@ -0,0 +1 @@
+ssh-ed25519 AAAAC3NzaC1lZDI1NTE5AAAAIAQ7BMETcbjbp0ujsehGD12YazJ0L1VmGIGPMgyU25eZ tanqingshandj@gmail.com


github-actions · 2025-12-09T15:15:23Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

bugfix for moe_mlp for vllm-ascend v0.11.0-dev

4b6702f

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

gemini-code-assist bot reviewed Dec 9, 2025

View reviewed changes

github-actions bot added the module:ops label Dec 9, 2025

github-actions bot added the merge-conflicts label Dec 9, 2025

weijinqian0 approved these changes Dec 9, 2025

View reviewed changes

tanqingshan (A) added 2 commits December 9, 2025 17:05

redefine def cumsum_group_list

90685bb

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

delete group_diff

baaf86b

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com> Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

github-actions bot removed the merge-conflicts label Dec 9, 2025

replace new_group with it's value in return

b5d5652

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com> Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

Clorist33 force-pushed the bugfix_moe_mlp_for_dev branch from a83de8c to b5d5652 Compare December 9, 2025 10:56

fix(moe_mlp): resolve F841 by removing unused new_group variable

8256182

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

Clorist33 force-pushed the bugfix_moe_mlp_for_dev branch from c418864 to 8256182 Compare December 9, 2025 11:53

tanqingshan (A) added 2 commits December 9, 2025 19:58

fix:彻底删除new_group解决F841+安装mypy解决环境依赖

3c68a06

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com> Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

fix for ut

4eecb94

Signed-off-by: tanqingshan (A) <50050625@china.huawei.com> Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

github-actions bot added the module:tests label Dec 9, 2025

wangxiyuan reviewed Dec 9, 2025

View reviewed changes

github-actions bot added the merge-conflicts label Dec 9, 2025

wangxiyuan closed this Dec 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] bugfix for moe_mlp in vllm-ascend 0.11.0-dev#4825

[Bugfix] bugfix for moe_mlp in vllm-ascend 0.11.0-dev#4825
Clorist33 wants to merge 7 commits intovllm-project:v0.11.0-devfrom
Clorist33:bugfix_moe_mlp_for_dev

Clorist33 commented Dec 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 9, 2025

Uh oh!

github-actions bot commented Dec 9, 2025

Uh oh!

wangxiyuan commented Dec 9, 2025

Uh oh!

github-actions bot commented Dec 9, 2025

Uh oh!

Clorist33 commented Dec 9, 2025 •

edited

Loading

Uh oh!

wangxiyuan Dec 9, 2025

Uh oh!

wangxiyuan Dec 9, 2025

Uh oh!

github-actions bot commented Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1 @@
		ssh-ed25519 AAAAC3NzaC1lZDI1NTE5AAAAIAQ7BMETcbjbp0ujsehGD12YazJ0L1VmGIGPMgyU25eZ tanqingshandj@gmail.com

Conversation

Clorist33 commented Dec 9, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 9, 2025

Uh oh!

wangxiyuan commented Dec 9, 2025

Uh oh!

github-actions bot commented Dec 9, 2025

Uh oh!

Clorist33 commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wangxiyuan Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

wangxiyuan Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Clorist33 commented Dec 9, 2025 •

edited

Loading