[BugFix][XPU] fix lora ops bgmv_expand size not match by Liangliang-Ma · Pull Request #39989 · vllm-project/vllm

Liangliang-Ma · 2026-04-16T08:54:06Z

Issued found with tests/entrypoints/openai/speech_to_text/test_translation_validation.py::test_basic_audio_with_lora.
error msg:
(EngineCore pid=25878) ERROR 04-07 16:03:25 [core.py:1108] bgmv_expand(buffer, lora_b_stacked, y, sampler_indices, add_inputs=True) (EngineCore pid=25878) ERROR 04-07 16:03:25 [core.py:1108] File "/opt/venv/lib/python3.12/site-packages/vllm/lora/ops/xpu_ops/lora_ops.py", line 30, in bgmv_expand (EngineCore pid=25878) ERROR 04-07 16:03:25 [core.py:1108] torch.ops._xpu_C.bgmv_expand( (EngineCore pid=25878) ERROR 04-07 16:03:25 [core.py:1108] File "/opt/venv/lib/python3.12/site-packages/torch/ops.py", line 1209, in __call_ (EngineCore pid=25878) ERROR 04-07 16:03:25 [core.py:1108] return self._op(*args, **kwargs) (EngineCore pid=25878) ERROR 04-07 16:03:25 [core.py:1108] ^^^^^^^^^^^^^^^^^^^^^^^^^ (EngineCore pid=25878) ERROR 04-07 16:03:25 [core.py:1108] RuntimeError: lora_b_weights.size(-2) must match slice_size

This pr would fix the size issue.

Signed-off-by: Ma, Liangliang <liangliang.ma@intel.com>

gemini-code-assist

Code Review

This pull request updates the bgmv_expand function in the XPU LoRA operations to handle dimension mismatches between LoRA weights and output tensors. The implementation now correctly handles cases where the weight output dimension is smaller than the output tensor by using bgmv_expand_slice, and truncates weights when the dimension is larger, ensuring robust behavior for scenarios like padded logits. I have no feedback to provide as there were no review comments.

jikunshang

LGTM. cc @chaojun-zhang

…9989) Signed-off-by: Ma, Liangliang <liangliang.ma@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>

…9989) Signed-off-by: Ma, Liangliang <liangliang.ma@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Avinash Singh <avinashsingh.rcoem@gmail.com>

…9989) Signed-off-by: Ma, Liangliang <liangliang.ma@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com> Signed-off-by: Adrian <info@zzit.ch>

…9989) Signed-off-by: Ma, Liangliang <liangliang.ma@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>

fix xpu lora

692b56e

Signed-off-by: Ma, Liangliang <liangliang.ma@intel.com>

Liangliang-Ma requested a review from jeejeelee as a code owner April 16, 2026 08:54

Liangliang-Ma changed the title ~~[BugFix][XPU] fix lora ops bgmv_expand dim not match~~ [BugFix][XPU] fix lora ops bgmv_expand size not match Apr 16, 2026

mergify Bot added intel-gpu Related to Intel GPU bug Something isn't working labels Apr 16, 2026

gemini-code-assist Bot reviewed Apr 16, 2026

View reviewed changes

jeejeelee requested a review from jikunshang April 16, 2026 11:06

Merge branch 'main' into mll_fix_1077

35078b9

jikunshang reviewed Apr 17, 2026

View reviewed changes

jikunshang and others added 3 commits April 17, 2026 10:55

Merge branch 'main' into mll_fix_1077

b140f40

Merge branch 'main' into mll_fix_1077

22a2abe

Merge branch 'main' into mll_fix_1077

e00411d

jikunshang added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 19, 2026

Merge branch 'main' into mll_fix_1077

7a87061

jikunshang approved these changes Apr 20, 2026

View reviewed changes

jikunshang merged commit 898beca into vllm-project:main Apr 20, 2026
53 checks passed

bnellnm pushed a commit to neuralmagic/vllm that referenced this pull request Apr 20, 2026

[BugFix][XPU] fix lora ops bgmv_expand size not match (vllm-project#3…

b30c216

…9989) Signed-off-by: Ma, Liangliang <liangliang.ma@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>

mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request May 10, 2026

[BugFix][XPU] fix lora ops bgmv_expand size not match (vllm-project#3…

b52cdee

…9989) Signed-off-by: Ma, Liangliang <liangliang.ma@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix][XPU] fix lora ops bgmv_expand size not match#39989

[BugFix][XPU] fix lora ops bgmv_expand size not match#39989
jikunshang merged 6 commits intovllm-project:mainfrom
Liangliang-Ma:mll_fix_1077

Liangliang-Ma commented Apr 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

jikunshang left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Liangliang-Ma commented Apr 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

jikunshang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants