CustomOp: grouped topk by xinyu-intel · Pull Request #647 · vllm-project/vllm-gaudi

xinyu-intel · 2025-11-27T03:55:42Z

Copilot

Pull request overview

This PR adds an optimized grouped top-k operation implementation for the Gaudi platform. The optimization involves intelligent handling of expert selection for mixture-of-experts (MoE) models, with special logic for batch sizes and optional score correction bias.

Adds has_optimized_grouped_topk() method returning True to indicate platform support
Implements grouped_topk() method with scoring functions (softmax/sigmoid), group-based expert selection, and optional bias correction
Includes adaptive algorithm selection based on token count threshold (1024)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

github-actions · 2025-11-27T04:56:42Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
0353d2e162cbda776d9dbfe026e65303204a7f1f

xuechendi · 2025-12-10T21:46:41Z

is that possible to monkey patch from vllm.model_executor.layers.fused_moe.fused_moe.grouped_topk ?

I think we can push for the vllm-project/vllm#29575 after, since it usually need some discussion and alignment.

xinyu-intel · 2025-12-11T01:59:07Z

is that possible to monkey patch from vllm.model_executor.layers.fused_moe.fused_moe.grouped_topk ?

I think we can push for the vllm-project/vllm#29575 after, since it usually need some discussion and alignment.

#708

Signed-off-by: Xinyu Chen <xinyu1.chen@intel.com>

Signed-off-by: Iryna Boiko <iboiko@habana.ai>

iboiko-habana · 2025-12-17T16:21:20Z

it is merged into #735

Hourly fixes: CustomOp: grouped topk #647 - depends on vllm-project/vllm#29575 Fix HpuCommunicator.dispatch #732 - This is fix for upstream changes: https://github.com/vllm-project/vllm/pull/30014/files Signed-off-by: Iryna Boiko <iboiko@habana.ai>

…project#732 (vllm-project#735)" This reverts commit d6896de.

This reverts commit d6896de.

(vllm-project#735) Hourly fixes: CustomOp: grouped topk vllm-project#647 - depends on vllm-project/vllm#29575 Fix HpuCommunicator.dispatch vllm-project#732 - This is fix for upstream changes: https://github.com/vllm-project/vllm/pull/30014/files Signed-off-by: Iryna Boiko <iboiko@habana.ai>

Copilot AI review requested due to automatic review settings November 27, 2025 03:55

xinyu-intel requested review from adobrzyn, afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, kzawora-intel, mgawarkiewicz-intel, michalkuligowski, vivekgoe and xuechendi as code owners November 27, 2025 03:55

Copilot AI reviewed Nov 27, 2025

View reviewed changes

Comment thread vllm_gaudi/platform.py Outdated

Comment thread vllm_gaudi/platform.py Outdated

Comment thread vllm_gaudi/platform.py Outdated

Comment thread vllm_gaudi/platform.py Outdated

xinyu-intel force-pushed the dev/xinyu/grouped_topk branch from e191a3d to 0cb8f4f Compare December 4, 2025 01:54

xinyu-intel changed the title ~~platform: optimize grouped topk op~~ CustomOp: grouped topk Dec 4, 2025

xinyu-intel force-pushed the dev/xinyu/grouped_topk branch from 0cb8f4f to 2b24404 Compare December 4, 2025 02:18

github-actions bot mentioned this pull request Dec 8, 2025

🚦 Team Review Dashboard #701

Open

xuechendi self-assigned this Dec 10, 2025

xinyu-intel force-pushed the dev/xinyu/grouped_topk branch 2 times, most recently from 635b660 to 7b996a9 Compare December 17, 2025 11:08

CustomOp: Impl GroupedTopk

7b996a9

Signed-off-by: Xinyu Chen <xinyu1.chen@intel.com>

iboiko-habana added a commit to iboiko-habana/vllm-gaudi that referenced this pull request Dec 17, 2025

[FIX_FOR_VLLM_LATEST] hourly fixes vllm-project#647 and vllm-project#732

1f7f04a

Signed-off-by: Iryna Boiko <iboiko@habana.ai>

iboiko-habana mentioned this pull request Dec 17, 2025

[FIX_FOR_VLLM_LATEST] hourly fixes #647 and #732 #735

Merged

xinyu-intel closed this Dec 18, 2025

iboiko-habana added a commit to iboiko-habana/vllm-gaudi that referenced this pull request Dec 19, 2025

Revert "[FIX_FOR_VLLM_LATEST] hourly fixes vllm-project#647 and vllm-…

8641392

…project#732 (vllm-project#735)" This reverts commit d6896de.

iboiko-habana added a commit that referenced this pull request Dec 19, 2025

Revert "[FIX_FOR_VLLM_LATEST] hourly fixes #647 and #732 (#735)" (#743)

6c8c8e1

This reverts commit d6896de.

PatrykWo pushed a commit that referenced this pull request Dec 19, 2025

Revert "[FIX_FOR_VLLM_LATEST] hourly fixes #647 and #732 (#735)" (#743)

164fa2d

This reverts commit d6896de.

PatrykWo pushed a commit that referenced this pull request Dec 19, 2025

Revert "[FIX_FOR_VLLM_LATEST] hourly fixes #647 and #732 (#735)" (#743)

19152df

This reverts commit d6896de.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CustomOp: grouped topk#647

CustomOp: grouped topk#647
xinyu-intel wants to merge 1 commit intovllm-project:mainfrom
xinyu-intel:dev/xinyu/grouped_topk

xinyu-intel commented Nov 27, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

xuechendi commented Dec 10, 2025

Uh oh!

xinyu-intel commented Dec 11, 2025

Uh oh!

iboiko-habana commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

xinyu-intel commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Nov 27, 2025

✅ CI Passed

Uh oh!

xuechendi commented Dec 10, 2025

Uh oh!

xinyu-intel commented Dec 11, 2025

Uh oh!

iboiko-habana commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

xinyu-intel commented Nov 27, 2025 •

edited

Loading