Add warning message for beta and gamma parameters #31654

OmarManzoor · 2024-06-27T11:06:08Z

What does this PR do?

This adds a warning message to notify about the renaming of gamma and beta parameters during initialisation and also during loading.

Fixes #29554

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you write any new necessary tests?

Who can review?

@amyeroberts

amyeroberts · 2024-06-28T18:51:32Z

src/transformers/modeling_utils.py

+        if "beta" in loaded_keys or "gamma" in loaded_keys:
+            logger.warning(
+                f"Parameter names `gamma` or `beta` for {cls.__name__} will be renamed within the model. "
+                f"Please use different names to suppress this warning."
+            )


I don't this is quite right, this assumes the weight is called "beta" in the state dict, but it could be called "layer.beta"

amyeroberts

Hi @OmarManzoor,

Thanks for addressing this! We want to make sure we catch any place where the renaming happens, so any place where if gamma in key and if beta in key are True (so key can be a longer string that contains beta or gamma). As you've added, this would be in _load_pretrained_model but also in _load_state_dict_into_model

OmarManzoor · 2024-07-04T14:06:23Z

Hi @OmarManzoor,

Thanks for addressing this! We want to make sure we catch any place where the renaming happens, so any place where if gamma in key and if beta in key are True (so key can be a longer string that contains beta or gamma). As you've added, this would be in _load_pretrained_model but also in _load_state_dict_into_model

Hi @amyeroberts
Thanks for the feedback. Should we remove it during initialization? I added it in post init because during the main init we might not have the parameters declared.

amyeroberts

Hi @OmarManzoor, thanks for iterating on this!

Given the diff, I'm slightly confused, were there no warnings being triggered before? It seems like they were from the tests and logging messages

amyeroberts · 2024-07-05T14:40:44Z

tests/utils/test_modeling_utils.py

        self.assertEqual(model.dtype, torch.float16)



More importantly, we should check that the parameter is renamed as well

I tried this out and it seems that the parameter is not renamed at all. Basically when we load the model using from_pretrained it seems that the parameter is still present with the name gamma_param.

It shouldn't rename the value in the model, but will rename the value in the state_dict, I believe. Could you dive into the loading logic and verify what's happening?

I tried updating the tests. Could you kindly have a look?

src/transformers/modeling_utils.py

OmarManzoor · 2024-07-08T07:25:32Z

Given the diff, I'm slightly confused, were there no warnings being triggered before? It seems like they were from the tests and logging messages

I basically removed the warning code that I added in the post init method. Should that be kept?

amyeroberts · 2024-07-08T10:40:08Z

@OmarManzoor Ah, OK. I think the diff was rendering funny on github. Should be OK.

amyeroberts

Looks great - thanks for adding and iterating on this!

OmarManzoor · 2024-07-11T12:06:20Z

Looks great - thanks for adding and iterating on this!

Thank you.

whwangovo · 2024-12-19T15:45:21Z

Why have you added warnings only for the initialization process and not for renaming during loading as well? The model I'm using is timm's convnext (which is even the companion framework to transformers), which would have the parameter gamma. When loading he just tells me that I didn't successfully load the gamma function without telling me why, and I think the user should be informed when renaming the state_dict, otherwise it will cause unnecessary confusion.

Add warning message for and parameters

c4c4881

OmarManzoor changed the title ~~Add warning message for and parameters~~ Add warning message for beta and gamma parameters Jun 27, 2024

amyeroberts reviewed Jun 28, 2024

View reviewed changes

OmarManzoor added 3 commits July 4, 2024 19:55

Fix when the warning is raised

6c830a1

Merge branch 'main' into warning_for_gamma_beta

4c931fa

Formatting changes

b7b4dad

amyeroberts reviewed Jul 5, 2024

View reviewed changes

Improve testing and remove duplicated warning from _fix_key

8e82a85

amyeroberts approved these changes Jul 11, 2024

View reviewed changes

amyeroberts merged commit 1499a55 into huggingface:main Jul 11, 2024

OmarManzoor deleted the warning_for_gamma_beta branch July 11, 2024 12:06

muellerzr mentioned this pull request Aug 13, 2024

Reduce the error log when using core models that need their weights renamed, and provide a step forward #32656

Merged

5 tasks

qubvel mentioned this pull request Jan 10, 2025

🚨🚨🚨 An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope, optimize string search. #35615

Merged

Add warning message for beta and gamma parameters #31654

Add warning message for beta and gamma parameters #31654

Uh oh!

Conversation

OmarManzoor commented Jun 27, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

amyeroberts Jun 28, 2024

Choose a reason for hiding this comment

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

OmarManzoor commented Jul 4, 2024

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

amyeroberts Jul 5, 2024

Choose a reason for hiding this comment

Uh oh!

OmarManzoor Jul 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

amyeroberts Jul 8, 2024

Choose a reason for hiding this comment

Uh oh!

OmarManzoor Jul 10, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

OmarManzoor commented Jul 8, 2024

Uh oh!

amyeroberts commented Jul 8, 2024

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

OmarManzoor commented Jul 11, 2024

Uh oh!

whwangovo commented Dec 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

OmarManzoor Jul 8, 2024 •

edited

Loading