Skip to content

[NPU] fix VoxtralRealtimeTextModel miss in 5.3.0 transformers version for model Mistral-Small-3.1-2506#21674

Closed
ZeyuanChen2000 wants to merge 1 commit into
sgl-project:mainfrom
ZeyuanChen2000:mistral-small-3.1-24B
Closed

[NPU] fix VoxtralRealtimeTextModel miss in 5.3.0 transformers version for model Mistral-Small-3.1-2506#21674
ZeyuanChen2000 wants to merge 1 commit into
sgl-project:mainfrom
ZeyuanChen2000:mistral-small-3.1-24B

Conversation

@ZeyuanChen2000
Copy link
Copy Markdown
Contributor

Motivation

In the 5.3.0 transformers version, running model case of test_npu_mistral_small_3_1_24b_instruct_2503.py will report error Could not find VoxtralRealtimeTextModel when call function _config_cls_name_to_arch_name_mapping()
image

Modifications

In 5.3.0 transformers, _model_mapping becomes _LazyAutoMapping object, use raw_config_mapping and raw_model_mapping to reconstruct the mapping.

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

  1. Ping Merge Oncalls to start the process. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
  4. After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

@github-actions github-actions Bot added the npu label Mar 30, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the model architecture mapping in llava.py to handle newer transformers versions by accessing internal mapping attributes and adds specific execution arguments to the Mistral model tests on NPU. Feedback suggests documenting the reliance on private library attributes and refactoring the mapping logic into a dictionary comprehension for improved clarity.

Comment thread python/sglang/srt/models/llava.py
@ZeyuanChen2000 ZeyuanChen2000 force-pushed the mistral-small-3.1-24B branch from 5a1c20e to 55f9795 Compare March 31, 2026 02:36
@sglang-npu-bot
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@sglang-npu-bot
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

1 similar comment
@sglang-npu-bot
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants