Skip to content

Port of: Fix stale gate ref overriding caller router_logits in dp_size==1 MoE fast path (#1469)#1492

Merged
mgawarkiewicz-intel merged 2 commits into
vllm-project:releases/v0.21.0from
iboiko-habana:port1469
May 26, 2026
Merged

Port of: Fix stale gate ref overriding caller router_logits in dp_size==1 MoE fast path (#1469)#1492
mgawarkiewicz-intel merged 2 commits into
vllm-project:releases/v0.21.0from
iboiko-habana:port1469

Conversation

@iboiko-habana
Copy link
Copy Markdown
Collaborator

No description provided.

…e==1 MoE fast path (vllm-project#1469)

Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Copilot AI review requested due to automatic review settings May 25, 2026 08:26
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Ports an upstream fix to prevent the dp_size==1 MoE fast path from using a stale cached gate reference after INC conversion, which can override the caller-provided router_logits and trigger shape/dtype mismatches (notably under fp8).

Changes:

  • When synchronizing post-INC MoE state, refreshes runner._hpu_gate_ref to point to the post-INC block-level gate when runner.gate is cleared.

Comment on lines +4824 to +4825
if block_gate is not None:
object.__setattr__(runner, "_hpu_gate_ref", block_gate)
@github-actions
Copy link
Copy Markdown

✅ CI Passed

All checks passed successfully against the following vllm commit:
ad7125a431e176d4161099480a66f0169609a690

Copy link
Copy Markdown
Collaborator

@kamil-kaczor kamil-kaczor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I trust you although Copilot says it's not needed. GL

@mgawarkiewicz-intel mgawarkiewicz-intel merged commit 1dc158d into vllm-project:releases/v0.21.0 May 26, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants