Add unpack_inputs decorator for ctrl #16242

johko · 2022-03-17T22:26:05Z

What does this PR do?

Add the unpack_inputs decorator for ctrl. It also replaces the past parameters in models in and output by past_key_values as there was an irregularity in the naming that caused an error with the new input processing.

Fixes # (issue)
#16051

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@gante

Tests

I ran RUN_SLOW=1 py.test -vv tests/ctrl/test_modeling_tf_ctrl.py but it only came to around 69% and failed the pre-trained model test, because it is too big for my local machine to test it.

HuggingFaceDocBuilderDev · 2022-03-17T22:38:38Z

The documentation is not available anymore as the PR was closed or merged.

gante

Thank you for the contribution 🔥 Have a look at my comment, as it is an important thing for us at Hugging Face.

Other than that, ready to merge 🚀

gante · 2022-03-18T11:34:43Z

src/transformers/models/ctrl/modeling_tf_ctrl.py

        self,
        input_ids=None,
-        past=None,
+        past_key_values=None,


This change is well-intentioned and technically correct, but I'm going to ask you to revert past_key_values to past. It changes the public interface of the model, which may disrupt downstream users :)

I see and understand, only problem is, without past_key_values in the parameters, the tests will fail. Can they both (past and past_key_values) be in the parameter list?

And what about the other places where I changed it, like line 570, should that be reverted too?

Oh yeah, my comment applies for all instances of past that got replaced with past_key_values. Does the problem persist after replacing them all?

Yes, the problem arose when I first replaced all the input_processing related code.
It will throw a ValueError, that arises from the input_processing method (line 436 in modeling_tf_utils.py), because past_key_values will remain in kwargs_call. That is due to the fact that past is not in kwargs passed to the input_processing function. (I did some test debugging there)
From what i could reconstruct past_key_values got in kwargs_call in the run_call_with_unpacked_inputs method, because it is not initially in the signature of the functions, so my first idea at a fix was to put it in there.
Then I went a bit overboard and replaced all the past variables ;)

Okay, I see what's going on! It's actually related to a recent refactor we are doing on the TF side -- bringing our TF generate() up to speed with FLAX/PyTorch. In that PR, in the prepare_inputs_for_generation() function, the output dictionary key was updated from past to past_key_values. The output of that function is then fed to the model, explaining the issue you see. This is a great example of the problem of changing interfaces.

The new planned interface for generate() does rely on past_key_values, not on past, although most models don't use it as an explicit keyword argument. Normally, some sort of deprecation warning should be added, but since this argument is mostly for internal use (through the public generate()), there should be no need. I will take responsibility if a few users complain :)

Thank you so much for having the patience to explain the rationale behind your change, it helped me understand the issue faster 💪

(Although now I'm curious -- how did the model not break before? 🤔 )

Yeah, I also wondered why the tests didn't break before 🤷‍♂️

I've added a todo for me to ensure we add a new test :)

gante · 2022-03-18T12:59:25Z

Going to run the slow tests locally, to confirm they pass

gante · 2022-03-18T15:33:18Z

Can confirm that they pass, merging 👍

* add unpack_inputs decorator for ctrl * replace "past" with "past_key_values" Co-authored-by: Johannes Kolbe <johannes.kolbe@tech.better.team>

Johannes Kolbe added 2 commits March 17, 2022 23:10

add unpack_inputs decorator for ctrl

75e183b

replace "past" with "past_key_values"

430b2e0

johko changed the title ~~Input decorator ctrl~~ Add unpack_inputs decorator for ctrl Mar 17, 2022

gante self-requested a review March 18, 2022 11:31

gante reviewed Mar 18, 2022

View reviewed changes

gante approved these changes Mar 18, 2022

View reviewed changes

gante merged commit 5709a20 into huggingface:master Mar 18, 2022

FrancescoSaverioZuppichini pushed a commit that referenced this pull request Mar 24, 2022

Add unpack_inputs decorator for ctrl (#16242)

048f7fd

* add unpack_inputs decorator for ctrl * replace "past" with "past_key_values" Co-authored-by: Johannes Kolbe <johannes.kolbe@tech.better.team>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add unpack_inputs decorator for ctrl #16242

Add unpack_inputs decorator for ctrl #16242

Uh oh!

johko commented Mar 17, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Mar 17, 2022 •

edited

Loading

Uh oh!

gante left a comment

Uh oh!

gante Mar 18, 2022

Uh oh!

johko Mar 18, 2022

Uh oh!

gante Mar 18, 2022

Uh oh!

johko Mar 18, 2022

Uh oh!

gante Mar 18, 2022

Uh oh!

johko Mar 18, 2022

Uh oh!

gante Mar 18, 2022

Uh oh!

gante commented Mar 18, 2022

Uh oh!

gante commented Mar 18, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add unpack_inputs decorator for ctrl #16242

Add unpack_inputs decorator for ctrl #16242

Uh oh!

Conversation

johko commented Mar 17, 2022

What does this PR do?

Before submitting

Who can review?

Tests

Uh oh!

HuggingFaceDocBuilderDev commented Mar 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

johko Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

gante Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

johko Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

gante Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

johko Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

gante Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

gante commented Mar 18, 2022

Uh oh!

gante commented Mar 18, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented Mar 17, 2022 •

edited

Loading