Use CLIP model config to set some kwargs for components #16609

ydshieh · 2022-04-05T14:51:54Z

What does this PR do?

In CLIPModel, set output_attentions and output_hidden_states using CLIPModel.config if these values are specified in the configuration + not specified in the arguments.

(currently, these operations are done in its vision & text components separately, and cause a WIP CLIP PT/TF equivalence test failing - #16557)

Details

Currently, CLIPModel uses its 2 components' (vision_model and text_model) configurations to perform things like

(here self is CLIPVisionTransformer or CLIPTextTransformer)

output_attentions = output_attentions if output_attentions is not None else self.config.output_attentions

If output_attentions/output_hidden_states are not passed to CLIPModel.forward at this line

transformers/src/transformers/models/clip/modeling_clip.py

Lines 966 to 967 in 9fd5e6b

    
           output_attentions: Optional[bool] = None, 
        
           output_hidden_states: Optional[bool] = None,

but CLIPModel.config has these values set, CLIPModel.config.output_attentions and CLIPModel.config.output_hidden_states won't have any effect. This case happens here

transformers/tests/test_modeling_tf_common.py

Lines 544 to 547 in 9fd5e6b

    
           # Output all for aggressive testing 
        
           config.output_hidden_states = True 
        
           if self.has_attentions: 
        
               config.output_attentions = True

Therefore, CLIP PT/TF equivalence test won't returns hidden_states/attentions for the PT model.

In TF,

transformers/src/transformers/modeling_tf_utils.py

Line 393 in b33ab4e

def input_processing(func, config, input_ids, **kwargs):

will use config to set the kwargs at the CLIPModel level. These kwargs are passed to the 2 components, and CLIP PT/TF equivalence test returns hidden_states/attentions for the TF model.

HuggingFaceDocBuilderDev · 2022-04-05T15:05:36Z

The documentation is not available anymore as the PR was closed or merged.

…se of vision & text components.

ydshieh · 2022-04-05T17:21:43Z

cc @gante (just for information) since he is recently working on unpack_inputs & input_processing in TF

sgugger

Looks okay to me but will defer to @patil-suraj on this :-)
Thanks for your PR!

patil-suraj

LGTM, thank you for fixing this!

update vision & text components' config from CLIP model

219def7

ydshieh added 2 commits April 5, 2022 17:18

Use CLIP model's config for some fields (if specified) instead of tho…

bb8b764

…se of vision & text components.

remove previous block

88ea984

ydshieh marked this pull request as ready for review April 5, 2022 16:56

ydshieh requested review from patil-suraj and sgugger April 5, 2022 17:20

sgugger approved these changes Apr 5, 2022

View reviewed changes

patil-suraj approved these changes Apr 6, 2022

View reviewed changes

ydshieh changed the title ~~Update vision & text components' config from CLIP model~~ Use CLIP model config to set some kwargs for components Apr 6, 2022

ydshieh merged commit ae6a7a7 into huggingface:main Apr 6, 2022

ydshieh deleted the fix_clip_pt_tf_outputs branch April 6, 2022 10:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use CLIP model config to set some kwargs for components #16609

Use CLIP model config to set some kwargs for components #16609

Uh oh!

ydshieh commented Apr 5, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 5, 2022 •

edited

Loading

Uh oh!

ydshieh commented Apr 5, 2022 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

patil-suraj left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	output_attentions: Optional[bool] = None,
	output_hidden_states: Optional[bool] = None,

	# Output all for aggressive testing
	config.output_hidden_states = True
	if self.has_attentions:
	config.output_attentions = True

Use CLIP model config to set some kwargs for components #16609

Use CLIP model config to set some kwargs for components #16609

Uh oh!

Conversation

ydshieh commented Apr 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Details

Uh oh!

HuggingFaceDocBuilderDev commented Apr 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh commented Apr 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ydshieh commented Apr 5, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 5, 2022 •

edited

Loading

ydshieh commented Apr 5, 2022 •

edited

Loading