Allows to use `decoder_inputs_embeds` for `model.generate` #21671

Andrechang · 2023-02-16T23:35:24Z

What does this PR do?

Allows to use decoder_inputs_embeds for model.generate in VisionEncoderDecoderModel

Who can review?

Vision Model
@amyeroberts

HuggingFaceDocBuilderDev · 2023-02-16T23:49:53Z

The documentation is not available anymore as the PR was closed or merged.

amyeroberts · 2023-02-17T10:36:44Z

cc @gante

gante · 2023-02-17T12:53:35Z

src/transformers/generation/utils.py

+            #add next_tokens to inputs_embeds if using embeds to generate
+            if model_kwargs.get("decoder_inputs_embeds") is not None:
+                next_tokens_embed = self.decoder.get_input_embeddings()(next_tokens) * self.decoder.model.decoder.embed_scale
+                model_kwargs["decoder_inputs_embeds"] = torch.cat([model_kwargs["decoder_inputs_embeds"], next_tokens_embed.unsqueeze(1)], dim=-2)


I will not accept these changes to the body of generate :)

There are multiple reasons for this decision:
a) A similar proposal is being tracked here. It will only be added for consideration after it raises sufficient interest, as described in the link;
b) We want to avoid adding more logic to generate itself unless it is a widely requested feature or it can be added as part of the model itself (e.g. in prepare_inputs_for_generation) / in a self-contained class (like the LogitsProcessors);

thank you for the review

gante · 2023-02-17T12:55:41Z

src/transformers/models/vision_encoder_decoder/modeling_vision_encoder_decoder.py

+        if kwargs.get("decoder_inputs_embeds") is not None:
+            decoder_inputs["input_ids"] = None
+            decoder_inputs_embeds = self.decoder.prepare_inputs_for_generation(kwargs.get("decoder_inputs_embeds"), past_key_values=past_key_values)
+            decoder_inputs_embeds = decoder_inputs_embeds["input_ids"]


This does not follow the reference implementation, see here for an example

YiandLi · 2023-06-18T12:12:45Z

And how about the EncoderDecoderModel like T5?

I tried to replace the prepare_inputs_for_generation method only guided by #6535, but it does not work ....



class CustomT5ForConditionalGeneration(T5ForConditionalGeneration):
    
    def prepare_inputs_for_generation(self,
                                      input_ids,
                                      past_key_values=None,
                                      attention_mask=None,
                                      head_mask=None,
                                      decoder_head_mask=None,
                                      cross_attn_head_mask=None,
                                      use_cache=None,
                                      encoder_outputs=None,
                                      **kwargs):
        res = super().prepare_inputs_for_generation(input_ids,
                                                    past_key_values,
                                                    attention_mask,
                                                    head_mask,
                                                    decoder_head_mask,
                                                    cross_attn_head_mask,
                                                    use_cache,
                                                    encoder_outputs,
                                                    **kwargs)
        # maybe another solution :https://github.com/huggingface/transformers/pull/21671
        
        # add decoder embeddings and mask
        if "decoder_inputs_embeds" in kwargs.keys():
            res["decoder_inputs_embeds"] = kwargs["decoder_inputs_embeds"]
        if "decoder_attention_mask" in kwargs.keys():
            res["decoder_attention_mask"] = kwargs["decoder_attention_mask"]
        
        # if `inputs_embeds` are passed, we only want to use them in the 1st generation step
        if past_key_values is None:
            del res["decoder_input_ids"]
        else:
            # only last token for inputs_ids if past is defined in kwargs
            res['decoder_input_ids'] = res['decoder_input_ids'][:, -1].unsqueeze(-1)
            del res["decoder_inputs_embeds"]
        
        return res

gante · 2023-06-19T10:04:18Z

Hey @YiandLi 👋

My suggestion would be to open a separate issue for the support of a decoder_input_embeds input, like #6535, so the issue becomes clear and visible to everyone. Like in #6535, I'd be happy to a) share a temporary solution b) push a permanent solution if the issue acquires sufficient traction.

Normally, I would not provide support for custom tasks, as my bandwidth is very limited, but according to this closed PR you are not the first person asking the question :)

Andrechang added 2 commits February 10, 2023 14:56

allow generate with embeddings input only

daf2b14

Merge branch 'huggingface:main' into main

8b92ef4

gante reviewed Feb 17, 2023

View reviewed changes

Andrechang closed this Feb 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allows to use `decoder_inputs_embeds` for `model.generate` #21671

Allows to use `decoder_inputs_embeds` for `model.generate` #21671

Uh oh!

Andrechang commented Feb 16, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Feb 16, 2023 •

edited

Loading

Uh oh!

amyeroberts commented Feb 17, 2023

Uh oh!

gante Feb 17, 2023

Uh oh!

Andrechang Feb 17, 2023

Uh oh!

gante Feb 17, 2023

Uh oh!

YiandLi commented Jun 18, 2023

Uh oh!

gante commented Jun 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Allows to use decoder_inputs_embeds for model.generate #21671

Allows to use decoder_inputs_embeds for model.generate #21671

Uh oh!

Conversation

Andrechang commented Feb 16, 2023

What does this PR do?

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Feb 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amyeroberts commented Feb 17, 2023

Uh oh!

gante Feb 17, 2023

Choose a reason for hiding this comment

Uh oh!

Andrechang Feb 17, 2023

Choose a reason for hiding this comment

Uh oh!

gante Feb 17, 2023

Choose a reason for hiding this comment

Uh oh!

YiandLi commented Jun 18, 2023

Uh oh!

gante commented Jun 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Allows to use `decoder_inputs_embeds` for `model.generate` #21671

Allows to use `decoder_inputs_embeds` for `model.generate` #21671

HuggingFaceDocBuilderDev commented Feb 16, 2023 •

edited

Loading