clean up vision/text config dict arguments by ydshieh · Pull Request #19954 · huggingface/transformers

ydshieh · 2022-10-28T15:51:57Z

What does this PR do?

Remove vision_config_dict and text_config_dict: just use vision_config and text_config.

Make code base cleaner
Avoid surprising behavior (see the comment)

HuggingFaceDocBuilderDev · 2022-10-28T16:05:46Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2022-10-28T16:11:21Z

Without this PR, we have somehow surprising/confusing results

from transformers import CLIPConfig, CLIPModel

config = CLIPConfig.from_pretrained("openai/clip-vit-base-patch16")
print(config.vision_config.patch_size)
print(config.vision_config_dict["patch_size"])

config.vision_config.patch_size = 32
config.save_pretrained("v2")

config_v2 = CLIPConfig.from_pretrained("v2")
# This is not `32` which is unexpected!
# In fact, it is `vision_config_dict` is being used during loading to set `vision_config`
print(config_v2.vision_config.patch_size)
# This is 32 - unexpected!
print(config_v2.vision_config_dict["patch_size"])

config.vision_config_dict["patch_size"] = 32
config.save_pretrained("v3")

config_v3 = CLIPConfig.from_pretrained("v3")
# This is 32 - unexpected!
print(config_v3.vision_config.patch_size)
# This is 32 - OK
print(config_v3.vision_config_dict["patch_size"])

ydshieh · 2022-10-28T16:28:45Z

-        super().__init__(text_config_dict=text_config_dict, vision_config_dict=vision_config_dict, **kwargs)
+        super().__init__(**kwargs)
+
+        # If `_config_dict` exist, we use them for the backward compatibility.


For backward compatibility

ydshieh · 2022-10-28T16:29:46Z

@sgugger If you are happy with the current change, I will apply the changes to some other models, and the testing files.
So far it is good even if I don't change to_dict. It has already

output["text_config"] = self.text_config.to_dict()
output["vision_config"] = self.vision_config.to_dict()

ydshieh · 2022-10-28T16:38:38Z

        **kwargs
    ):
-        super().__init__(text_config=text_config, vision_config=vision_config, **kwargs)
+        super().__init__(**kwargs)


We don't need to pass text/vision config to super, as we will set self.text_config and self.vision_config below

sgugger

LGTM, but pinging @patrickvonplaten and @patil-suraj here too as it may have implications in Diffusers.

NielsRogge · 2022-11-01T14:43:09Z

Awesome that you are working on fixing this!

Encountered the same issue with a new model I'm working on called CLIPSeg.

Also, could we update GroupViT as well? This is also a CLIP-like model.

* clean up * For backward compatibility * clean up * Same changes for more models Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ydshieh commented Oct 28, 2022

View reviewed changes

ydshieh requested a review from sgugger October 28, 2022 16:29

ydshieh commented Oct 28, 2022

View reviewed changes

sgugger approved these changes Oct 28, 2022

View reviewed changes

ydshieh added 4 commits November 2, 2022 11:44

clean up

751dff4

For backward compatibility

a59b8b8

clean up

7448ebc

Same changes for more models

a983507

ydshieh force-pushed the cleanup_vision_text_config_dict branch from 3b33586 to a983507 Compare November 2, 2022 10:45

ydshieh merged commit 8827e1b into main Nov 2, 2022

ydshieh deleted the cleanup_vision_text_config_dict branch November 2, 2022 11:03

ydshieh mentioned this pull request Nov 3, 2022

Allow passing arguments to model testers for CLIP-like models #20044

Merged

ydshieh mentioned this pull request Mar 8, 2023

Avoid text_config_dict and vision_config_dict being saved for CLIP-like models #22035

Merged

ydshieh mentioned this pull request Dec 19, 2023

Avoid unnecessary warnings when loading CLIPConfig #28108

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clean up vision/text config dict arguments#19954

clean up vision/text config dict arguments#19954
ydshieh merged 4 commits into
mainfrom
cleanup_vision_text_config_dict

ydshieh commented Oct 28, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 28, 2022 •

edited

Loading

Uh oh!

ydshieh commented Oct 28, 2022 •

edited

Loading

Uh oh!

ydshieh Oct 28, 2022

Uh oh!

ydshieh commented Oct 28, 2022 •

edited

Loading

Uh oh!

ydshieh Oct 28, 2022

Uh oh!

sgugger left a comment

Uh oh!

NielsRogge commented Nov 1, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ydshieh commented Oct 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh commented Oct 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh Oct 28, 2022

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Oct 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh Oct 28, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

NielsRogge commented Nov 1, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ydshieh commented Oct 28, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 28, 2022 •

edited

Loading

ydshieh commented Oct 28, 2022 •

edited

Loading

ydshieh commented Oct 28, 2022 •

edited

Loading