Make Gemma and Gemma 2 accept `inputs_embeds` like Gemma 3 by hmellor · Pull Request #36787 · vllm-project/vllm

hmellor · 2026-03-11T13:04:41Z

Gemma3Model.forward accepts pre-scaled inputs_embeds which are scaled by Gemma3Model.embed_input_ids.

Before this PR GemmaModel and Gemma2Model did the scaling inside forward.

After this PR the scaling for the earlier Gemma variants happens in embed_input_ids. This is consistent with:

Gemma3
Gemma and Gemma 2 in Transformers after huggingface/transformers@0e7cb4e#diff-ac7f118ce5e3a16be1467acd5245dba49ed1cbb115082351ef21b256c84318ba (which should be released in v5.4.0)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

DarkLight1337 · 2026-03-11T13:10:32Z

Btw there are some other models that do a similar thing as well. How should we handle them?

hmellor · 2026-03-11T13:21:18Z

This was only really needed for the basic correctness test because it uses HF to generate the embeds and then passes them to vLLM. So the change in behaviour on the HF side was a problem.

For the rest of vLLM where we use vLLM to generate the embeds this should be a non-issue.

gemini-code-assist

Code Review

This pull request refactors the embedding scaling for Gemma and Gemma 2 models to align with Gemma 3 and recent changes in the transformers library. The scaling logic is correctly moved from the forward method to embed_input_ids. The accompanying test changes are also appropriate, but they contain an incorrect version string for transformers which should be corrected for accuracy and maintainability.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

…ect#36787) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Make Gemma 1/2 accept inputs_embeds like Gemma 3

d676303

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor mentioned this pull request Mar 11, 2026

Update to transformers v5 #30566

Merged

DarkLight1337 approved these changes Mar 11, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) March 11, 2026 13:23

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 11, 2026

Merge branch 'main' into fix-gemma

56de8d5

gemini-code-assist bot reviewed Mar 11, 2026

View reviewed changes

Comment thread tests/basic_correctness/test_basic_correctness.py

Fix other test where HF embeds are given to vLLM

8a96a61

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor requested a review from ywang96 as a code owner March 11, 2026 15:12

DarkLight1337 merged commit 65986db into vllm-project:main Mar 11, 2026
53 checks passed

hmellor deleted the fix-gemma branch March 12, 2026 09:28

wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026

Make Gemma and Gemma 2 accept inputs_embeds like Gemma 3 (vllm-proj…

91c16d0

…ect#36787) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026

Make Gemma and Gemma 2 accept inputs_embeds like Gemma 3 (vllm-proj…

61ef2d0

…ect#36787) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mtparet pushed a commit to blackfuel-ai/vllm that referenced this pull request Apr 9, 2026

Make Gemma and Gemma 2 accept inputs_embeds like Gemma 3 (vllm-proj…

089cd86

…ect#36787) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make Gemma and Gemma 2 accept `inputs_embeds` like Gemma 3#36787

Make Gemma and Gemma 2 accept `inputs_embeds` like Gemma 3#36787
DarkLight1337 merged 3 commits intovllm-project:mainfrom
hmellor:fix-gemma

hmellor commented Mar 11, 2026 •

edited by github-actions bot

Loading

Uh oh!

DarkLight1337 commented Mar 11, 2026

Uh oh!

hmellor commented Mar 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hmellor commented Mar 11, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DarkLight1337 commented Mar 11, 2026

Uh oh!

hmellor commented Mar 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hmellor commented Mar 11, 2026 •

edited by github-actions bot

Loading