Patch Grouped Topk by xinyu-intel · Pull Request #708 · vllm-project/vllm-gaudi

xinyu-intel · 2025-12-11T01:55:24Z

No description provided.

Copilot

Pull request overview

This PR patches the grouped top-k implementation for MoE (Mixture of Experts) operations in vLLM on HPU, addressing dtype conversion behavior based on whether grouped top-k is enabled.

Adds conditional dtype conversion logic based on use_grouped_topk flag
Implements a patched grouped_topk function with batch invariance support and e_score_correction_bias handling
Applies the patch to the vLLM library's fused_moe layer module

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
vllm_gaudi/ops/hpu_fused_moe.py	Adds conditional dtype conversion, implements patched_grouped_topk function, and applies the grouped_topk patch to vllm module
vllm_gaudi/ops/hpu_fp8.py	Adds conditional dtype conversion for FP8 operations based on use_grouped_topk flag

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-11T01:55:54Z

+        if not layer.use_grouped_topk:
+            topk_ids = topk_ids.to(torch.int64)
+            topk_weights = topk_weights.to(x.dtype)


The dtype conversions for topk_ids and topk_weights are now duplicated - they appear both before line 67 (lines 63-64) and within this conditional block (lines 68-69). When use_grouped_topk is False, these conversions happen twice unnecessarily. Consider moving the earlier conversions (lines 63-64) into an else block, or removing the duplicate logic.

Copilot · 2025-12-11T01:55:55Z

        topk_weights = topk_weights.view(*x.shape[:-1], -1)
+        if not layer.use_grouped_topk:
+            topk_ids = topk_ids.to(torch.int64)
+            topk_weights = topk_weights.to(x.dtype)


The dtype conversions for topk_ids and topk_weights are duplicated - they appear both before line 163 (lines 159-160) and within this conditional block (lines 164-165). When use_grouped_topk is False, these conversions happen twice unnecessarily. Consider moving the earlier conversions (lines 159-160) into an else block, or removing the duplicate logic.

Suggested change

topk_weights = topk_weights.to(x.dtype)

Signed-off-by: Xinyu Chen <xinyu1.chen@intel.com>

github-actions · 2025-12-11T04:18:58Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
7618dc973dd1e56a46162bc7bd6e7625143bead0

Signed-off-by: Xinyu Chen <xinyu1.chen@intel.com>

Copilot AI review requested due to automatic review settings December 11, 2025 01:55

xinyu-intel requested review from adobrzyn, afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, kzawora-intel, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners December 11, 2025 01:55

Copilot AI reviewed Dec 11, 2025

View reviewed changes

xinyu-intel mentioned this pull request Dec 11, 2025

CustomOp: grouped topk #647

Closed

xinyu-intel added 2 commits December 11, 2025 10:09

Patch Grouped Topk

ae11a23

Signed-off-by: Xinyu Chen <xinyu1.chen@intel.com>

reduce dtype conversion between grouped topk and moe

6126ff7

Signed-off-by: Xinyu Chen <xinyu1.chen@intel.com>

github-actions Bot mentioned this pull request Dec 11, 2025

🚦 Team Review Dashboard #701

Open

xuechendi approved these changes Dec 12, 2025

View reviewed changes

xuechendi self-assigned this Dec 12, 2025

xuechendi merged commit c03ca8d into vllm-project:main Dec 12, 2025
46 checks passed

adobrzyn pushed a commit that referenced this pull request Mar 31, 2026

Patch Grouped Topk (#708)

ff0f5c6

Signed-off-by: Xinyu Chen <xinyu1.chen@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Patch Grouped Topk#708

Patch Grouped Topk#708
xuechendi merged 2 commits intovllm-project:mainfrom
xinyu-intel:dev/xinyu/patch-topk

xinyu-intel commented Dec 11, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 11, 2025

Uh oh!

Copilot AI Dec 11, 2025

Uh oh!

github-actions Bot commented Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

xinyu-intel commented Dec 11, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Dec 11, 2025

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants