[GenerationConfig] add additional kwargs handling #21269

ArthurZucker · 2023-01-23T20:50:35Z

What does this PR do?

This add the same support that we have in the PretrainedConfig, where additional kwargs are automaticallu updated.
This will allow users to re-use the GenerationConfig class for most of the use_cases, whithout having to add a model specific class. I was trying to load the following generation_config and got half of my additional arguments deleted 😉

gante

We probably can remove self.generation_kwargs = kwargs.pop("generation_kwargs", {}) on L277, it was intended as a restricted version on these changes

(for context, the model config has the same lines here)

HuggingFaceDocBuilderDev · 2023-01-23T21:09:02Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for adding this!

ArthurZucker · 2023-01-23T21:41:43Z

Also will have to add test + this is apparently breaking a lot of things haha

ArthurZucker · 2023-01-24T13:07:53Z

Okay, after talking a bit with @gante and testing, this is not the best, this PR will focus on other missing functionalities. Mostly addition of the dict_torch_dtype_to_str function, as the dtype could be passed to the generation 😉
The problem is mostly that if we process all the additional kwargs, we are getting all of the arguments from the configuration.json which mixes things up.
The simplest solution is either to store them in generate_kwargs or re-write the configuration for the model. I though this was cumbersome but it is actually the most logical and cleanest way to do it.

EDIT : gonna just add a condition, if the kwargs are from a config file, they are not added.

ArthurZucker · 2023-01-24T14:46:31Z

Now only thing left is to add a pretty test with all the different edge cases I encountered.

gante · 2023-01-24T14:56:19Z

src/transformers/generation/configuration_utils.py

+        # remove all the arguments that are in the config_dict
+
+        config = cls(**config_dict, **kwargs)
        unused_kwargs = config.update(**kwargs)


This line now only exists to obtain unused_kwargs, as the kwargs get written to the config in the line above, correct?

Well yes, for example when _from_model_config is set to True, then it is still in the kwargs, think I saw something like this.

gante · 2023-01-24T15:07:33Z

src/transformers/generation/configuration_utils.py

        self.transformers_version = kwargs.pop("transformers_version", __version__)

+        # Additional attributes without default values
+        if not self._from_model_config:


I'd add a comment here explaining why we need this if, otherwise we may be like "wtf?" in the future

Sure, thanks for the comment!

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

add additional kwargs handling

260fb3b

ArthurZucker requested review from gante and sgugger and removed request for gante January 23, 2023 20:50

gante approved these changes Jan 23, 2023

View reviewed changes

sgugger approved these changes Jan 23, 2023

View reviewed changes

ArthurZucker added 3 commits January 24, 2023 12:56

fix issue when serializing

2c70587

correct order of kwargs removal for serialization in from dict

659e776

add dict_torch_dtype_to_str in case a dtype is needed for generation

757dd0d

add condition when adding the kwargs : not from config

95dbc4b

gante approved these changes Jan 24, 2023

View reviewed changes

ArthurZucker and others added 2 commits January 24, 2023 17:22

Add comment based on review

71c7255

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

add test function

2a9ff00

ArthurZucker mentioned this pull request Jan 24, 2023

[Whisper] Refactor whisper #21252

Merged

default None when poping arg

a0a06a1

ArthurZucker requested a review from sgugger January 24, 2023 17:10

sgugger approved these changes Jan 24, 2023

View reviewed changes

ArthurZucker merged commit 94a7edd into huggingface:main Jan 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GenerationConfig] add additional kwargs handling #21269

[GenerationConfig] add additional kwargs handling #21269

Uh oh!

ArthurZucker commented Jan 23, 2023

Uh oh!

gante left a comment •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jan 23, 2023 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

ArthurZucker commented Jan 23, 2023

Uh oh!

ArthurZucker commented Jan 24, 2023 •

edited

Loading

Uh oh!

ArthurZucker commented Jan 24, 2023

Uh oh!

gante Jan 24, 2023

Uh oh!

ArthurZucker Jan 24, 2023

Uh oh!

gante Jan 24, 2023

Uh oh!

ArthurZucker Jan 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[GenerationConfig] add additional kwargs handling #21269

[GenerationConfig] add additional kwargs handling #21269

Uh oh!

Conversation

ArthurZucker commented Jan 23, 2023

What does this PR do?

Uh oh!

gante left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented Jan 23, 2023

Uh oh!

ArthurZucker commented Jan 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker commented Jan 24, 2023

Uh oh!

gante Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

gante Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gante left a comment •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 23, 2023 •

edited

Loading

ArthurZucker commented Jan 24, 2023 •

edited

Loading