IDEFICS: support inputs embeds #34043

zucchini-nlp · 2024-10-09T14:51:36Z

What does this PR do?

Fixes #34033 and enables tests for VLMs. Prev tests were all skipped because we had a hard check for CausalLM Mapping

zucchini-nlp · 2024-10-09T14:53:06Z

src/transformers/generation/utils.py

        # generating the first new token or not, and we only want to use the embeddings for the first new token)
        if not self.config.is_encoder_decoder and model_input_name == "inputs_embeds":
            model_kwargs["use_cache"] = True
+            generation_config.use_cache = True


the subsequent step of preparing cache checks generation config, and we miss the step of adding a Cache class. Fails for IDEFICS only because it doesn't prepare cache in "forward" if "use_cache", but rather assume that there's already a correct cache provided

This change updates generation_config, which means we would have two variables containing the source of truth for this flag (generation_config.use_cache and model_kwargs["use_cache"]). More than one source of truth usually leads to bugs 🐛 We also know that we need model_kwargs["use_cache"] for the forward pass.

To avoid multiple sources of truth, I'd suggest one of the following:

let's use model_kwargs["use_cache"] in all places after this if/else, instead of also using generation_config.use_cache.

(I have a preference for this one) Let's use generation_config.use_cache everywhere, removing model_kwargs["use_cache"] from most places in the generate function. Before calling the decoding methods, let's add model_kwargs["use_cache"] = generation_config.use_cache, since we need this for the forward pass and generation_config barely gets used in the decoding methods

(I know this issue predates your change 🤗 But since we're touching it, let's do the most sustainable change)

ArthurZucker

It's always the same models causing problems 👁️ @leot13 👁️

zucchini-nlp · 2024-10-10T13:00:11Z

I am trying to make sure that IDEFICS models start using library standards, and hopefully after we enable generation tests it will be easier to catch those bugs when adding the model 🤗

gante

LGTM, except for that nit in the generate body :)

Thank you for fixing 💪

gante · 2024-10-14T10:44:25Z

src/transformers/generation/utils.py

        # generating the first new token or not, and we only want to use the embeddings for the first new token)
        if not self.config.is_encoder_decoder and model_input_name == "inputs_embeds":
            model_kwargs["use_cache"] = True
+            generation_config.use_cache = True


This change updates generation_config, which means we would have two variables containing the source of truth for this flag (generation_config.use_cache and model_kwargs["use_cache"]). More than one source of truth usually leads to bugs 🐛 We also know that we need model_kwargs["use_cache"] for the forward pass.

To avoid multiple sources of truth, I'd suggest one of the following:

let's use model_kwargs["use_cache"] in all places after this if/else, instead of also using generation_config.use_cache.

(I have a preference for this one) Let's use generation_config.use_cache everywhere, removing model_kwargs["use_cache"] from most places in the generate function. Before calling the decoding methods, let's add model_kwargs["use_cache"] = generation_config.use_cache, since we need this for the forward pass and generation_config barely gets used in the decoding methods

(I know this issue predates your change 🤗 But since we're touching it, let's do the most sustainable change)

HuggingFaceDocBuilderDev · 2024-10-14T15:54:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* support embeds * use cache from config * style... * fix tests after rebase

support embeds

02a0066

zucchini-nlp commented Oct 9, 2024

View reviewed changes

zucchini-nlp requested review from ArthurZucker and gante October 9, 2024 14:53

zucchini-nlp mentioned this pull request Oct 10, 2024

Idefics: enable generation tests #34062

Merged

ArthurZucker approved these changes Oct 10, 2024

View reviewed changes

zucchini-nlp and others added 2 commits October 11, 2024 10:30

Merge branch 'main' into idefics-embeds

f6f2e3a

Merge branch 'huggingface:main' into idefics-embeds

29d96cc

gante approved these changes Oct 14, 2024

View reviewed changes

zucchini-nlp added 2 commits October 14, 2024 17:25

use cache from config

7214aca

style...

d9f28b0

zucchini-nlp added 2 commits October 15, 2024 11:18

Merge branch 'main' into idefics-embeds

997a5fe

fix tests after rebase

198a77c

zucchini-nlp merged commit d087165 into huggingface:main Oct 16, 2024

zucchini-nlp mentioned this pull request Oct 28, 2024

Fix CI #34458

Merged

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

IDEFICS: support inputs embeds (huggingface#34043)

77b8b82

* support embeds * use cache from config * style... * fix tests after rebase

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

IDEFICS: support inputs embeds #34043

IDEFICS: support inputs embeds #34043

Uh oh!

zucchini-nlp commented Oct 9, 2024

Uh oh!

zucchini-nlp Oct 9, 2024

Uh oh!

gante Oct 14, 2024

Uh oh!

ArthurZucker left a comment

Uh oh!

zucchini-nlp commented Oct 10, 2024 •

edited

Loading

Uh oh!

gante left a comment

Uh oh!

gante Oct 14, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

IDEFICS: support inputs embeds #34043

IDEFICS: support inputs embeds #34043

Uh oh!

Conversation

zucchini-nlp commented Oct 9, 2024

What does this PR do?

Uh oh!

zucchini-nlp Oct 9, 2024

Choose a reason for hiding this comment

Uh oh!

gante Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zucchini-nlp commented Oct 10, 2024 •

edited

Loading