Skip to content

[FIX_FOR_VLLM_CUSTOM=4efd6ffde09477800294a8ed9cc752017812c3b1] Fix minimax_m2 import after mamba LINEAR refactor (+1 more)#1529

Closed
pawel-olejniczak wants to merge 2 commits into
vllm-project:mainfrom
pawel-olejniczak:fix/batch-cycle_202606
Closed

[FIX_FOR_VLLM_CUSTOM=4efd6ffde09477800294a8ed9cc752017812c3b1] Fix minimax_m2 import after mamba LINEAR refactor (+1 more)#1529
pawel-olejniczak wants to merge 2 commits into
vllm-project:mainfrom
pawel-olejniczak:fix/batch-cycle_202606

Conversation

@pawel-olejniczak

Copy link
Copy Markdown
Collaborator

This PR consolidates 2 hourly-CI fixes against vllm@4efd6ffde09477800294a8ed9cc752017812c3b1 per the single-rolling-PR rule (invariant I9).

Bug 1: Fix minimax_m2 import after mamba LINEAR refactor

  • State machine id: mamba_linear_attn_import_missing
  • Commit: a992350

Root cause

Upstream vLLM moved MiniMaxText01RMSNormTP out of vllm.model_executor.layers.mamba.linear_attn into a dedicated module vllm.model_executor.layers.minimax_rms_norm. The HPU minimax_m2 model is eagerly imported by register_model(), so the stale import broke every CI test at import time.

Upstream PR

vllm-project/vllm#43556

Fix

Update the import of MiniMaxText01RMSNormTP to vllm.model_executor.layers.minimax_rms_norm.

Bug 2: Fix multi_model_api_server imports after serving-utils consolidation

  • State machine id: multi_model_entrypoints_logger_missing
  • Commit: 2c4e39f

Root cause

Upstream vLLM consolidated the online serving utilities, removing entrypoints/logger.py, entrypoints/openai/server_utils.py and entrypoints/utils.py. The HPU multi-model API server imported three symbols from those removed modules, breaking the unit-test import path.

Upstream PR

vllm-project/vllm#44479

Fix

Repoint three imports in multi_model_api_server.py to the consolidated locations: serve.utils.request_logger (RequestLogger), serve.utils.server_utils (get_uvicorn_log_config) and serve.utils.api_utils (cli_env_setup, process_lora_modules).

HPU verification

  • Pod: Gaudi g3
  • Full commit stack (origin/main..HEAD) re-verified against vllm@4efd6ffde09477800294a8ed9cc752017812c3b1: PASS

Related PRs

None

…nimax_m2 import after mamba LINEAR refactor

Root cause: upstream vLLM #43556 relocated MiniMaxText01RMSNormTP out of
vllm.model_executor.layers.mamba.linear_attn into the new
vllm.model_executor.layers.minimax_rms_norm package.
Upstream: vllm-project/vllm#43556
Fix: import MiniMaxText01RMSNormTP from minimax_rms_norm.

Signed-off-by: Paweł Olejniczak <pawelx.olejniczak@intel.com>
…lti_model_api_server imports after serving-utils consolidation

Root cause: upstream vLLM #44479 consolidated online serving utils, removing vllm/entrypoints/logger.py, vllm/entrypoints/openai/server_utils.py and vllm/entrypoints/utils.py.

Upstream: vllm-project/vllm#44479

Fix: update three imports in multi_model_api_server.py to new canonical paths (RequestLogger -> serve.utils.request_logger; get_uvicorn_log_config -> serve.utils.server_utils; cli_env_setup/process_lora_modules -> serve.utils.api_utils).

Signed-off-by: Paweł Olejniczak <pawelx.olejniczak@intel.com>

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Updates import paths to align with refactored module locations in vLLM/vLLM-Gaudi, ensuring these components resolve from their new packages.

Changes:

  • Switched MiniMaxText01RMSNormTP import to vllm.model_executor.layers.minimax_rms_norm.
  • Updated OpenAI multi-model API server utility imports (RequestLogger, get_uvicorn_log_config, cli_env_setup, process_lora_modules) to vllm.entrypoints.serve.utils.*.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File Description
vllm_gaudi/models/minimax_m2.py Adjusts RMSNorm import to the new MiniMax-specific module path.
vllm_gaudi/entrypoints/openai/multi_model_api_server.py Updates server-related imports to new serve.utils locations.

@pawel-olejniczak

Copy link
Copy Markdown
Collaborator Author

Closing as duplicate. These two commits were consolidated into the existing rolling hourly-CI fix PR #1525 (per the single-rolling-PR rule). Only one open PR with the FIX_FOR_VLLM_CUSTOM prefix is allowed at a time.

@pawel-olejniczak pawel-olejniczak deleted the fix/batch-cycle_202606 branch June 5, 2026 09:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants