[Bugfix][Model] Fix PixtralForConditionalGeneration LoRA by jeejeelee · Pull Request #36963 · vllm-project/vllm

jeejeelee · 2026-03-13T07:54:04Z

Purpose

Fix #34591

Test Plan

Testing with the real LoRA adapter

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request refactors the Pixtral vision encoder to utilize vLLM's parallel layers, which is a significant and well-executed change to enable tensor parallelism, quantization, and LoRA support. The code is mostly clean and consistent. I have one suggestion to improve the robustness of the weight loading logic for future compatibility with features like quantization.

mergify · 2026-03-19T20:02:51Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @jeejeelee.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

ywang96 · 2026-03-30T04:11:09Z

I don't think this will break the base model but FYI @patrickvonplaten

…t#36963) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Roger Wang <hey@rogerw.io> Signed-off-by: neweyes <328719365@qq.com>

…t#36963) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Roger Wang <hey@rogerw.io> Signed-off-by: Rishi Puri <riship@nvidia.com>

…t#36963) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Roger Wang <hey@rogerw.io>

…xtral, MoE and Granite regressions (#1311) ## Summary This PR fixes a set of regressions introduced by recent upstream changes and observed in vLLM-Gaudi hourly validation. The branch now includes: - Pixtral HPUAttention projection path fix, - MoE dispatch and method override alignment updates for fused MoE and compressed tensors, - unit test updates to match the new MoE runner API usage, - fix hybrid model page size alignment for Granite 4.0-H. ## Related upstream PRs that introduced the regressions - vllm-project/vllm#37234 - vllm-project/vllm#35153 - vllm-project/vllm#36963 - vllm-project/vllm#38960 - vllm-project/vllm#35326 - vllm-project/vllm#37467 --------- Signed-off-by: Paweł Olejniczak <pawelx.olejniczak@intel.com>

Done

cf05283

jeejeelee requested a review from patrickvonplaten as a code owner March 13, 2026 07:54

mergify Bot added the bug Something isn't working label Mar 13, 2026

This was referenced Mar 13, 2026

[Bug]: Ministral-3 LoRA adapter fails silently #34591

Closed

Fix LoRA adapter silently failing on Pixtral/Ministral-3 models #34964

Closed

gemini-code-assist Bot reviewed Mar 13, 2026

View reviewed changes

Comment thread vllm/model_executor/models/pixtral.py

jeejeelee marked this pull request as draft March 13, 2026 07:59

OPT

673b186

jeejeelee marked this pull request as ready for review March 16, 2026 01:52

mergify Bot added the needs-rebase label Mar 19, 2026

Address conflict

03d1e1e

Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>

mergify Bot removed the needs-rebase label Mar 27, 2026

jeejeelee and others added 2 commits March 27, 2026 15:41

Merge branch 'main' into fix-pixtral-lora

019afb3

Merge branch 'main' into fix-pixtral-lora

36d3567

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 30, 2026

ywang96 approved these changes Mar 30, 2026

View reviewed changes

Merge branch 'main' into fix-pixtral-lora

0aa457e

jeejeelee enabled auto-merge (squash) March 30, 2026 04:52

vllm-bot merged commit ac30a83 into main Mar 30, 2026
59 of 61 checks passed

vllm-bot deleted the fix-pixtral-lora branch March 30, 2026 06:59

pawel-olejniczak mentioned this pull request Apr 7, 2026

[FIX_FOR_VLLM_CUSTOM=dd9342e6bc92a52a4674a3e472318c241cb18fe1] Fix Pixtral, MoE and Granite regressions vllm-project/vllm-gaudi#1311

Merged

mtparet pushed a commit to blackfuel-ai/vllm that referenced this pull request Apr 9, 2026

[Bugfix][Model] Fix PixtralForConditionalGeneration LoRA (vllm-projec…

17868f0

…t#36963) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Roger Wang <hey@rogerw.io>

juliendenize mentioned this pull request Apr 15, 2026

[BUGFIX] Fix Pixtral consolidated format vision weight loading #39916

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix][Model] Fix PixtralForConditionalGeneration LoRA#36963

[Bugfix][Model] Fix PixtralForConditionalGeneration LoRA#36963
vllm-bot merged 6 commits intomainfrom
fix-pixtral-lora

jeejeelee commented Mar 13, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

mergify Bot commented Mar 19, 2026

Uh oh!

ywang96 commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

jeejeelee commented Mar 13, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

mergify Bot commented Mar 19, 2026

Uh oh!

ywang96 commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jeejeelee commented Mar 13, 2026 •

edited by github-actions Bot

Loading