[Model Runner V2] Fix inputs_embeds=None bug for MM models by WoosukKwon · Pull Request #35917 · vllm-project/vllm

WoosukKwon · 2026-03-03T21:24:46Z

This PR fixes a bug that VLMs see blank images when CUDA graph is enabled (while the model works fine with eager mode). The bug is actually caused by incorrect compilation; We didn't provide inputs_embeds for the initial run.

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

gemini-code-assist

Code Review

This pull request correctly fixes a bug for multi-modal (MM) models where inputs_embeds was None during dummy runs, such as for profiling or CUDA graph capture. The change ensures that get_mm_embeddings is always called for MM models, providing a consistent tensor input for inputs_embeds in both dummy and real runs. This prevents issues with CUDA graph compilation, which expects consistent input shapes and types. The change is sound and I have no further comments.

…ect#35917) Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

[Model Runner V2] Fix inputs_embeds=None bug for MM models

ccd2d34

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

WoosukKwon requested a review from njhill as a code owner March 3, 2026 21:24

mergify Bot added v1 bug Something isn't working labels Mar 3, 2026

gemini-code-assist Bot reviewed Mar 3, 2026

View reviewed changes

njhill approved these changes Mar 3, 2026

View reviewed changes

WoosukKwon merged commit 467886a into main Mar 3, 2026
11 of 12 checks passed

WoosukKwon deleted the woosuk/mrv2-fix-vlm branch March 3, 2026 21:47

Copilot AI pushed a commit to machov/vllm that referenced this pull request Mar 10, 2026

[Model Runner V2] Fix inputs_embeds=None bug for MM models (vllm-proj…

33033a5

…ect#35917) Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

avinashsingh77 pushed a commit to avinashsingh77/vllm that referenced this pull request Mar 12, 2026

[Model Runner V2] Fix inputs_embeds=None bug for MM models (vllm-proj…

3047dd3

…ect#35917) Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026

[Model Runner V2] Fix inputs_embeds=None bug for MM models (vllm-proj…

4696e5d

…ect#35917) Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model Runner V2] Fix inputs_embeds=None bug for MM models#35917

[Model Runner V2] Fix inputs_embeds=None bug for MM models#35917
WoosukKwon merged 1 commit intomainfrom
woosuk/mrv2-fix-vlm

WoosukKwon commented Mar 3, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

WoosukKwon commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

WoosukKwon commented Mar 3, 2026 •

edited

Loading