[FIX_FOR_VLLM_CUSTOM=4efd6ffde09477800294a8ed9cc752017812c3b1] Fix minimax_m2 import after mamba LINEAR refactor (+1 more) by pawel-olejniczak · Pull Request #1529 · vllm-project/vllm-gaudi

pawel-olejniczak · 2026-06-05T08:33:38Z

This PR consolidates 2 hourly-CI fixes against vllm@4efd6ffde09477800294a8ed9cc752017812c3b1 per the single-rolling-PR rule (invariant I9).

Bug 1: Fix minimax_m2 import after mamba LINEAR refactor

State machine id: mamba_linear_attn_import_missing
Commit: a992350

Root cause

Upstream vLLM moved MiniMaxText01RMSNormTP out of vllm.model_executor.layers.mamba.linear_attn into a dedicated module vllm.model_executor.layers.minimax_rms_norm. The HPU minimax_m2 model is eagerly imported by register_model(), so the stale import broke every CI test at import time.

Upstream PR

vllm-project/vllm#43556

Fix

Update the import of MiniMaxText01RMSNormTP to vllm.model_executor.layers.minimax_rms_norm.

Bug 2: Fix multi_model_api_server imports after serving-utils consolidation

State machine id: multi_model_entrypoints_logger_missing
Commit: 2c4e39f

Root cause

Upstream vLLM consolidated the online serving utilities, removing entrypoints/logger.py, entrypoints/openai/server_utils.py and entrypoints/utils.py. The HPU multi-model API server imported three symbols from those removed modules, breaking the unit-test import path.

Upstream PR

vllm-project/vllm#44479

Fix

Repoint three imports in multi_model_api_server.py to the consolidated locations: serve.utils.request_logger (RequestLogger), serve.utils.server_utils (get_uvicorn_log_config) and serve.utils.api_utils (cli_env_setup, process_lora_modules).

HPU verification

Pod: Gaudi g3
Full commit stack (origin/main..HEAD) re-verified against vllm@4efd6ffde09477800294a8ed9cc752017812c3b1: PASS

Related PRs

None

…nimax_m2 import after mamba LINEAR refactor Root cause: upstream vLLM #43556 relocated MiniMaxText01RMSNormTP out of vllm.model_executor.layers.mamba.linear_attn into the new vllm.model_executor.layers.minimax_rms_norm package. Upstream: vllm-project/vllm#43556 Fix: import MiniMaxText01RMSNormTP from minimax_rms_norm. Signed-off-by: Paweł Olejniczak <pawelx.olejniczak@intel.com>

…lti_model_api_server imports after serving-utils consolidation Root cause: upstream vLLM #44479 consolidated online serving utils, removing vllm/entrypoints/logger.py, vllm/entrypoints/openai/server_utils.py and vllm/entrypoints/utils.py. Upstream: vllm-project/vllm#44479 Fix: update three imports in multi_model_api_server.py to new canonical paths (RequestLogger -> serve.utils.request_logger; get_uvicorn_log_config -> serve.utils.server_utils; cli_env_setup/process_lora_modules -> serve.utils.api_utils). Signed-off-by: Paweł Olejniczak <pawelx.olejniczak@intel.com>

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Updates import paths to align with refactored module locations in vLLM/vLLM-Gaudi, ensuring these components resolve from their new packages.

Changes:

Switched MiniMaxText01RMSNormTP import to vllm.model_executor.layers.minimax_rms_norm.
Updated OpenAI multi-model API server utility imports (RequestLogger, get_uvicorn_log_config, cli_env_setup, process_lora_modules) to vllm.entrypoints.serve.utils.*.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
vllm_gaudi/models/minimax_m2.py	Adjusts RMSNorm import to the new MiniMax-specific module path.
vllm_gaudi/entrypoints/openai/multi_model_api_server.py	Updates server-related imports to new `serve.utils` locations.

pawel-olejniczak · 2026-06-05T09:10:43Z

Closing as duplicate. These two commits were consolidated into the existing rolling hourly-CI fix PR #1525 (per the single-rolling-PR rule). Only one open PR with the FIX_FOR_VLLM_CUSTOM prefix is allowed at a time.

pawel-olejniczak added 2 commits June 5, 2026 10:43

Copilot AI review requested due to automatic review settings June 5, 2026 08:33

pawel-olejniczak requested review from PatrykWo, adobrzyn, afierka-intel, iboiko-habana, jbyczkow, ksmusz, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners June 5, 2026 08:33

pawel-olejniczak requested a deployment to pre-merge-approval June 5, 2026 08:33 — with GitHub Actions Waiting

Copilot AI reviewed Jun 5, 2026

View reviewed changes

pawel-olejniczak closed this Jun 5, 2026

pawel-olejniczak deleted the fix/batch-cycle_202606 branch June 5, 2026 09:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX_FOR_VLLM_CUSTOM=4efd6ffde09477800294a8ed9cc752017812c3b1] Fix minimax_m2 import after mamba LINEAR refactor (+1 more)#1529

[FIX_FOR_VLLM_CUSTOM=4efd6ffde09477800294a8ed9cc752017812c3b1] Fix minimax_m2 import after mamba LINEAR refactor (+1 more)#1529
pawel-olejniczak wants to merge 2 commits into
vllm-project:mainfrom
pawel-olejniczak:fix/batch-cycle_202606

pawel-olejniczak commented Jun 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

pawel-olejniczak commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pawel-olejniczak commented Jun 5, 2026

Bug 1: Fix minimax_m2 import after mamba LINEAR refactor

Root cause

Upstream PR

Fix

Bug 2: Fix multi_model_api_server imports after serving-utils consolidation

Root cause

Upstream PR

Fix

HPU verification

Related PRs

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

pawel-olejniczak commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants