[MiniMax-M2] Remove reduce_results kwarg from FusedMoE init by mounikamandava · Pull Request #1444 · vllm-project/vllm-gaudi

mounikamandava · 2026-05-12T19:55:52Z

Removes the reduce_results=False argument passed to FusedMoE in HpuMiniMaxM2MoE which is no longer accepted by upstream VLLM and causes worker startup to fail.

Upstream VLLM removed the reduce_results parameter from Fused MoE_init_ (vllm/model_executor/layers/fused_moe/layer.py). THe MoE output reduction is now decided internally based on TP/EP topology. The corresponding upstream model MiniMaxM2MoE (vllm/model_executor/models/minimax_m2.py) was updated accordingly, but the HPU port HpuMiniMaxM2MoE was not, so it still passes the now-unknown kwarg.

Fix :
Drop the reduce_results=False kwarg from the FusedMoE construction in HpuMiniMaxM2MoE. Behavior is unchanged because upstream now governs MoE output reduction internally based on TP/EP configuration.

Copilot

Pull request overview

This PR updates the Gaudi-specific MiniMax-M2 MoE implementation to stay compatible with upstream vLLM by removing a no-longer-supported reduce_results keyword argument when constructing FusedMoE, preventing worker startup failures.

Changes:

Remove the deprecated reduce_results=False kwarg from FusedMoE(...) initialization in HpuMiniMaxM2MoE.

github-actions · 2026-05-13T01:44:39Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
54f548e9e58087f0155e4e164e416ad7efdfde6d

iboiko-habana

reduce_results was removed in vllm-project/vllm#35949. Thanks for fix

[MiniMax-M2] Remove reduce_results kwarg from FusedMoE init

4f9654e

Copilot AI review requested due to automatic review settings May 12, 2026 19:55

mounikamandava requested review from PatrykWo, adobrzyn, afierka-intel, iboiko-habana, jbyczkow, kamil-kaczor, ksmusz, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners May 12, 2026 19:55

Copilot started reviewing on behalf of mounikamandava May 12, 2026 19:56 View session

Copilot AI reviewed May 12, 2026

View reviewed changes

github-actions Bot mentioned this pull request May 12, 2026

🚦 Team Review Dashboard #701

Open

iboiko-habana approved these changes May 13, 2026

View reviewed changes

iboiko-habana merged commit cbc78c0 into vllm-project:main May 13, 2026
5 of 6 checks passed

skavulya mentioned this pull request May 16, 2026

Fix accuracy issue in minimax_m2 with TP > 1 #1451

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MiniMax-M2] Remove reduce_results kwarg from FusedMoE init#1444

[MiniMax-M2] Remove reduce_results kwarg from FusedMoE init#1444
iboiko-habana merged 1 commit into
vllm-project:mainfrom
mounikamandava:fix-minimax-m2

mounikamandava commented May 12, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

iboiko-habana left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mounikamandava commented May 12, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

github-actions Bot commented May 13, 2026

✅ CI Passed

Uh oh!

iboiko-habana left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants