[BUGFIX] Fix Pixtral consolidated format vision weight loading by juliendenize · Pull Request #39916 · vllm-project/vllm

juliendenize · 2026-04-15T14:53:28Z

Purpose

#36963 replaced the Pixtral vision encoder's nn.Linear layers (wq/wk/wv/wo/w1/w2/w3) with QKVParallelLinear and MergedColumnParallelLinear (qkv_proj/o_proj/gate_up_proj/down_proj) to support LoRA. However, the weight loading stacked_params only mapped HF-style names (q_proj, k_proj, etc.), not Mistral native names (wq, wk, etc.), causing vision encoder weights to be silently dropped when loading consolidated-format checkpoints.

Test Plan

Added a ministral test that is run for small GPUs instead of relying on Pixtral.

Test Result

Passing.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request adds support for Mistral native (consolidated) weight formats to the Pixtral model and introduces a new test case for consolidated loading. Feedback indicates that the added test uses a text-only model, which fails to exercise the vision encoder weight loading logic. Additionally, the weight loading implementation may fail to match keys containing a '.weight' suffix, and the remapping logic for native parameter names is inefficiently located and may lead to dropped weights.

Signed-off-by: Julien Denize <julien.denize@mistral.ai> Signed-off-by: juliendenize <julien.denize@mistral.ai>

afurm · 2026-04-15T15:30:40Z

            (".qkv_proj", ".v_proj", "v"),
            (".gate_up_proj", ".gate_proj", 0),
            (".gate_up_proj", ".up_proj", 1),
+            # Mistral native (consolidated) format


The wo and w2 parameters are handled via _vision_encoder_name_remap rather than through _vision_encoder_stacked_params. Since they're not sharded across TP ranks like qkv/w1/w3, they don't appear in the stacked params list. Is there a reason they couldn't be added to the stacked params list with their shard_id, or is the remap approach more robust to variations in how these keys appear across different checkpoint formats?

NickLucche

does the ministral test you added actually excercise the fix? 🤔

juliendenize · 2026-04-20T08:00:30Z

does the ministral test you added actually excercise the fix? 🤔

Hey thanks for the merge and to answer your question @NickLucche It does by making sure the output is not garbaged with a smaller gpu size than needed to Pixtral. I think I could even lower the size of the GPU but AFAIK it should be always used by the CI for 16 GB right ? So now we should catch whenever a regression happens !

…project#39916) Signed-off-by: Julien Denize <julien.denize@mistral.ai> Signed-off-by: juliendenize <julien.denize@mistral.ai>

juliendenize requested review from DarkLight1337, patrickvonplaten and ywang96 as code owners April 15, 2026 14:53

gemini-code-assist Bot reviewed Apr 15, 2026

View reviewed changes

Comment thread tests/models/multimodal/generation/test_pixtral.py

Comment thread vllm/model_executor/models/pixtral.py

Comment thread vllm/model_executor/models/pixtral.py Outdated

[BUGFIX] Fix Pixtral consolidated format vision weight loading

b22a56b

Signed-off-by: Julien Denize <julien.denize@mistral.ai> Signed-off-by: juliendenize <julien.denize@mistral.ai>

juliendenize force-pushed the fix/pixtral-consolidated-weight-loading branch from e1ed624 to b22a56b Compare April 15, 2026 15:19

mergify Bot added multi-modality Related to multi-modality (#4194) bug Something isn't working labels Apr 15, 2026

afurm reviewed Apr 15, 2026

View reviewed changes

NickLucche reviewed Apr 15, 2026

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 16, 2026

Merge branch 'main' into fix/pixtral-consolidated-weight-loading

ccbd4f5

vllm-bot merged commit 6097afb into vllm-project:main Apr 20, 2026
58 of 60 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUGFIX] Fix Pixtral consolidated format vision weight loading#39916

[BUGFIX] Fix Pixtral consolidated format vision weight loading#39916
vllm-bot merged 2 commits intovllm-project:mainfrom
juliendenize:fix/pixtral-consolidated-weight-loading

juliendenize commented Apr 15, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afurm Apr 15, 2026

Uh oh!

NickLucche left a comment •

edited

Loading

Uh oh!

Uh oh!

juliendenize commented Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

juliendenize commented Apr 15, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afurm Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

NickLucche left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

juliendenize commented Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

juliendenize commented Apr 15, 2026 •

edited by github-actions Bot

Loading

NickLucche left a comment •

edited

Loading