Revert "moe_gating_top_k" by zzzzwwjj · Pull Request #5512 · vllm-project/vllm-ascend

zzzzwwjj · 2025-12-30T06:57:27Z

Reverts #5271

It breaks e2e test

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@45c1ca1

This reverts commit 45c3c27.

github-actions · 2025-12-30T06:57:36Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request reverts the moe_gating_top_k custom operator. The changes primarily consist of removing the implementation files for this operator. A key modification is in vllm_ascend/ops/fused_moe/experts_selector.py, where the removed custom operator is replaced with torch_npu.npu_moe_gating_top_k. This change also adjusts the logic to align with the native implementation, which appears to be a correct and beneficial update. The pull request is clean and serves its purpose of reverting the feature.

Reverts vllm-project#5271 It breaks e2e test - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@45c1ca1 Signed-off-by: f00824209 <fuzhihong4@huawei.com>

…to FIA_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: (88 commits) [1/N] Refactor nightly test structure (vllm-project#5479) Docs: Remove deprecated --task parameter for embedding models (vllm-project#5257) Revert "moe_gating_top_k" (vllm-project#5512) [Doc] Fix issue link for 0.12.0 (vllm-project#5500) [CI]update triton ascend version (vllm-project#5392) moe_gating_top_k (vllm-project#5271) [refactor] refactor model runner capture model (vllm-project#5230) Update corresponding vllm commit ID to 12 29 (vllm-project#5475) [Kernel]update csrc cmakelist for open-source cann (vllm-project#5458) [OP] add custom op aclnnMoeInitRoutingCustom (vllm-project#5251) [Refactor][EAGLE] 1/N delete __init__ in mtp_proposer (vllm-project#5176) [Refactor][Triton] Move reject sample triton kernels into ops/triton (vllm-project#5324) [Feature] support eager mode in model runner v2 (vllm-project#5210) [feature] fia support sliding windows (vllm-project#5239) Optimize some rejectsampler functions to make npu op launch non-blocking (vllm-project#4587) [Feature] Support to use fullgraph with eagle (vllm-project#5118) [EPLB][refactor] Modification of the initialization logic for expert_map and log2phy（depend on pr5285） (vllm-project#5311) [Refactor]6/N Extract common code of class AscendMLAImpl (vllm-project#5314) [Refactor] cache cos/sin in mla & remove parameter model in builder. (vllm-project#5277) update vllm pin to 12.27 (vllm-project#5412) ...

Reverts vllm-project#5271 It breaks e2e test - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@45c1ca1

Reverts vllm-project#5271 It breaks e2e test - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@45c1ca1 Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

Reverts vllm-project#5271 It breaks e2e test - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@45c1ca1

Reverts vllm-project#5271 It breaks e2e test - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@45c1ca1 Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

Revert "moe_gating_top_k (#5271)"

2519d35

This reverts commit 45c3c27.

github-actions bot added module:tests module:ops labels Dec 30, 2025

gemini-code-assist bot reviewed Dec 30, 2025

View reviewed changes

wangxiyuan merged commit 71f729a into main Dec 30, 2025
18 checks passed

zzzzwwjj deleted the revert-5271-main branch December 30, 2025 07:06

shenchuxiaofugui pushed a commit to shenchuxiaofugui/vllm-ascend that referenced this pull request Dec 31, 2025

Revert "moe_gating_top_k" (vllm-project#5512)

c359649

Reverts vllm-project#5271 It breaks e2e test - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@45c1ca1

maoxx241 pushed a commit to maoxx241/vllm-ascend that referenced this pull request Mar 2, 2026

Revert "moe_gating_top_k" (vllm-project#5512)

9b1f5d5

Reverts vllm-project#5271 It breaks e2e test - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@45c1ca1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "moe_gating_top_k"#5512

Revert "moe_gating_top_k"#5512
wangxiyuan merged 1 commit intomainfrom
revert-5271-main

zzzzwwjj commented Dec 30, 2025 •

edited by wangxiyuan

Loading

Uh oh!

github-actions bot commented Dec 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zzzzwwjj commented Dec 30, 2025 • edited by wangxiyuan Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zzzzwwjj commented Dec 30, 2025 •

edited by wangxiyuan

Loading