Skip to content

[FIX_FOR_VLLM_LATEST] Fix for hourly#843

Closed
adobrzyn wants to merge 26 commits intomainfrom
adobrzyn/fix_for833
Closed

[FIX_FOR_VLLM_LATEST] Fix for hourly#843
adobrzyn wants to merge 26 commits intomainfrom
adobrzyn/fix_for833

Conversation

@adobrzyn
Copy link
Copy Markdown
Collaborator

iboiko-habana and others added 7 commits January 16, 2026 17:57
…attention #32238

Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Signed-off-by: Iryna Boiko <iboiko@habana.ai>
… you mean: 'input_size'

Signed-off-by: root <root@adobrzyn-31x3-g3-mpijob-worker-0.adobrzyn-31x3-g3-mpijob-worker.framework.svc.cluster.local>
@github-actions
Copy link
Copy Markdown

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates import paths to align with the latest version of vLLM, addressing breaking changes from upstream vLLM updates. The changes primarily reorganize imports for the FusedMoE router components and remove an unused parameter from a function call.

Changes:

  • Updated import paths for GroupedTopk and FusedMoERouter to reflect new module structure in vLLM
  • Removed input_scale parameter from apply_block_fp8_linear_hpu function call
  • Added FP4 BMM (Block Matrix Multiply) support check in the attention backend

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
vllm_gaudi/ops/hpu_fused_moe.py Updated imports for GroupedTopk and FusedMoERouter to new module paths
vllm_gaudi/ops/hpu_fp8.py Updated FusedMoERouter import path
vllm_gaudi/ops/hpu_compressed_tensors.py Updated FusedMoERouter import path
vllm_gaudi/extension/ops.py Removed input_scale parameter from function call
vllm_gaudi/attention/backends/hpu_attn.py Added FP4 BMM enablement check with dtype validation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread vllm_gaudi/ops/hpu_fused_moe.py Outdated
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Signed-off-by: Agata Dobrzyniewicz <160237065+adobrzyn@users.noreply.github.com>
@github-actions
Copy link
Copy Markdown

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Copy link
Copy Markdown
Collaborator

@kamil-kaczor kamil-kaczor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

root and others added 5 commits January 20, 2026 17:10
Signed-off-by: root <root@adobrzyn-9z1k-g3-mpijob-worker-0.adobrzyn-9z1k-g3-mpijob-worker.framework.svc.cluster.local>
Signed-off-by: Agata Dobrzyniewicz <160237065+adobrzyn@users.noreply.github.com>
Signed-off-by: root <root@adobrzyn-9z1k-g3-mpijob-worker-0.adobrzyn-9z1k-g3-mpijob-worker.framework.svc.cluster.local>
Signed-off-by: Iryna Boiko <iboiko@habana.ai>
@github-actions
Copy link
Copy Markdown

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Signed-off-by: Iryna Boiko <iboiko@habana.ai>
@github-actions
Copy link
Copy Markdown

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Signed-off-by: Iryna Boiko <iboiko@habana.ai>
Signed-off-by: Iryna Boiko <iboiko@habana.ai>
@iboiko-habana
Copy link
Copy Markdown
Collaborator

everything is delivered in #876

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants