Skip to content

[NPU] bugfix: resolve modelslim load weights bug#19472

Merged
sglang-npu-bot merged 1 commit intosgl-project:mainfrom
Hexq0210:bugfix_modelslim
Feb 28, 2026
Merged

[NPU] bugfix: resolve modelslim load weights bug#19472
sglang-npu-bot merged 1 commit intosgl-project:mainfrom
Hexq0210:bugfix_modelslim

Conversation

@Hexq0210
Copy link
Contributor

@Hexq0210 Hexq0210 commented Feb 27, 2026

Motivation

Based on the Qwen3.5 model, some bugs exist in the modelslim quantization scenario. As a result, the service cannot be started.
Before fix:
image
After fix:
image

Modifications

python/sglang/srt/layers/quantization/modelslim/modelslim.py
The language_model field has been added to the prefix file in qwen3_vl.py(PR #18926). do not need to add the replace operation of language_model to the modelslim.py file.

Accuracy Tests

NA

Benchmarking and Profiling

Checklist

Review Process

  1. Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
  4. After green CI and required approvals, ask Merge Oncalls to merge.

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@Hexq0210 Hexq0210 changed the title bugfix: resolve modelslim load weights bug [Ascend] bugfix: resolve modelslim load weights bug Feb 27, 2026
@iforgetmyname iforgetmyname changed the title [Ascend] bugfix: resolve modelslim load weights bug [NPU] bugfix: resolve modelslim load weights bug Feb 27, 2026
@sglang-npu-bot
Copy link
Collaborator

/tag-and-rerun-ci

@Hexq0210
Copy link
Contributor Author

/rerun-failed-ci

3 similar comments
@sglang-npu-bot
Copy link
Collaborator

/rerun-failed-ci

@sglang-npu-bot
Copy link
Collaborator

/rerun-failed-ci

@sglang-npu-bot
Copy link
Collaborator

/rerun-failed-ci

@sglang-npu-bot sglang-npu-bot merged commit 4ebe9e1 into sgl-project:main Feb 28, 2026
371 of 419 checks passed
magicYang1573 pushed a commit to magicYang1573/sglang that referenced this pull request Mar 9, 2026
Wangzheee pushed a commit to Wangzheee/sglang that referenced this pull request Mar 21, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants