Skip to content

[Model Runner V2] Fix inputs_embeds=None bug for MM models#35917

Merged
WoosukKwon merged 1 commit intomainfrom
woosuk/mrv2-fix-vlm
Mar 3, 2026
Merged

[Model Runner V2] Fix inputs_embeds=None bug for MM models#35917
WoosukKwon merged 1 commit intomainfrom
woosuk/mrv2-fix-vlm

Conversation

@WoosukKwon
Copy link
Copy Markdown
Collaborator

@WoosukKwon WoosukKwon commented Mar 3, 2026

This PR fixes a bug that VLMs see blank images when CUDA graph is enabled (while the model works fine with eager mode). The bug is actually caused by incorrect compilation; We didn't provide inputs_embeds for the initial run.

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
@WoosukKwon WoosukKwon requested a review from njhill as a code owner March 3, 2026 21:24
@mergify mergify Bot added v1 bug Something isn't working labels Mar 3, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request correctly fixes a bug for multi-modal (MM) models where inputs_embeds was None during dummy runs, such as for profiling or CUDA graph capture. The change ensures that get_mm_embeddings is always called for MM models, providing a consistent tensor input for inputs_embeds in both dummy and real runs. This prevents issues with CUDA graph compilation, which expects consistent input shapes and types. The change is sound and I have no further comments.

@WoosukKwon WoosukKwon merged commit 467886a into main Mar 3, 2026
11 of 12 checks passed
@WoosukKwon WoosukKwon deleted the woosuk/mrv2-fix-vlm branch March 3, 2026 21:47
Copilot AI pushed a commit to machov/vllm that referenced this pull request Mar 10, 2026
avinashsingh77 pushed a commit to avinashsingh77/vllm that referenced this pull request Mar 12, 2026
wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants