Generate: correct default model input creation for decoder-only models #21580

gante · 2023-02-11T16:51:06Z

What does this PR do?

Fixes #21578 and addresses concerns in #21575

Context

Support for .generate() from inputs_embeds with selected decoder-only models was added recently (#21405). This feature enables a .generate(inputs_embeds=inputs_embeds) call, i.e. without a input_ids.

This specific call strategy, under the hood, implies that .generate() is in charge of creating a) an input_ids for later use (to concatenate the generated tokens) and b) the corresponding attention_mask. This automated creation was not working properly for batch_size>1.

Changes

The changes in the PR respect the following desiderata (which required moving a few things around):

The attention_mask can be automatically inferred (with all ones) regardless of the shape of inputs_embeds;
When inputs_embeds is passed and input_ids is not, the automated input_ids has a sequence length of 1. This is particularly relevant for BLIP, as we don't want input_ids to start with the embeddings' sequence length.

This PR also adds/enhances tests, to ensure we don't regress on this capability.

⚠️ if approved, I will make the corresponding TF changes before merging.

gante · 2023-02-11T16:54:27Z

cc @dimitry12 this fixes the error you reported in #21575 (GPT2 was throwing the same error as GPTJ, and gets fixed here)

cc @NielsRogge please don't forget to add batched tests on models with generation capabilities, generate+batching is surprisingly tricky🙏

HuggingFaceDocBuilderDev · 2023-02-11T17:10:53Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for your work on this!

sgugger · 2023-02-13T14:20:51Z

But it does seem like it breaks a lot of tests 😅

gante · 2023-02-13T17:04:38Z

(Merging -- the failing test is a known failing test)

gante · 2023-02-13T17:12:39Z

@dimitry12 lmk if you see the error when using GPT-J :)

dimitry12 · 2023-02-13T19:04:06Z

@dimitry12 lmk if you see the error when using GPT-J :)

@gante GPT-J generation without dummy input_ids using only inputs_embeds works without errors now. Thank you!

gante added 2 commits February 11, 2023 16:16

automated attention mask works

5f8cef1

better solution

2bfb2c2

gante requested a review from sgugger February 11, 2023 16:51

gante mentioned this pull request Feb 13, 2023

BLIP-2 batch generate error #21599

Closed

4 tasks

sgugger approved these changes Feb 13, 2023

View reviewed changes

gante mentioned this pull request Feb 13, 2023

Add inputs_embeds support when generating with GPT-J #21575

Merged

gante and others added 3 commits February 13, 2023 15:19

Merge branch 'main' into batched_default_input_ids_decoder_only

2e9c297

fix batch size

ab9befa

add TF side of the improvements

3f3e0cf

gante merged commit fa4bdb0 into huggingface:main Feb 13, 2023

gante deleted the batched_default_input_ids_decoder_only branch February 13, 2023 17:04

ArthurZucker mentioned this pull request Feb 24, 2023

Fix-ci-whisper #21767

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Generate: correct default model input creation for decoder-only models #21580

Generate: correct default model input creation for decoder-only models #21580

Uh oh!

gante commented Feb 11, 2023 •

edited

Loading

Uh oh!

gante commented Feb 11, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Feb 11, 2023 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

sgugger commented Feb 13, 2023

Uh oh!

gante commented Feb 13, 2023

Uh oh!

gante commented Feb 13, 2023

Uh oh!

dimitry12 commented Feb 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Generate: correct default model input creation for decoder-only models #21580

Generate: correct default model input creation for decoder-only models #21580

Uh oh!

Conversation

gante commented Feb 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Context

Changes

Uh oh!

gante commented Feb 11, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Feb 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger commented Feb 13, 2023

Uh oh!

gante commented Feb 13, 2023

Uh oh!

gante commented Feb 13, 2023

Uh oh!

dimitry12 commented Feb 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gante commented Feb 11, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 11, 2023 •

edited

Loading