⚠️ [CLAP] Fix dtype of logit scales in init #25682

sanchit-gandhi · 2023-08-23T11:43:36Z

What does this PR do?

The dtype of the CLAP logit scale parameters was always float64 by default (even if the rest of the model was initialised in float32). This PR fixes the logit scales, such that they respect the default dtype of the model.

sanchit-gandhi · 2023-08-23T11:44:11Z

src/transformers/models/clap/modeling_clap.py

        text_config = config.text_config
        audio_config = config.audio_config

-        self.logit_scale_a = nn.Parameter(torch.tensor(np.log(config.logit_scale_init_value)))


The aforementioned behaviour is a result of the np.log operation defaulting to float64

Given the original code we might need to init in float64 then cast to float if it makes a difference. No idea if the actual value save is in float64!

The parameters are initialised in float64 but are stored in float32 in the state dict

younesbelkada

Thanks!

ArthurZucker

As mentioned offline, never used in the original repo. Is a bit breaking but it is a bug fix. Let's just add one ⚠️ !

HuggingFaceDocBuilderDev · 2023-08-23T12:07:25Z

The documentation is not available anymore as the PR was closed or merged.

sanchit-gandhi · 2023-08-23T12:14:35Z

Note that in the original repo, the model is always cast to float16 for all training / inference. Thus, they likely never used the model in it's default dtype, and always relied on explicitly casting to float16

[CLAP] Fix dtype of logit scales

[CLAP] Fix dtype of logit scales

d9297ea

sanchit-gandhi marked this pull request as ready for review August 23, 2023 11:43

sanchit-gandhi commented Aug 23, 2023

View reviewed changes

sanchit-gandhi requested review from ArthurZucker and younesbelkada August 23, 2023 11:44

sanchit-gandhi mentioned this pull request Aug 23, 2023

[AudioLDM 2] Pipeline fixes huggingface/diffusers#4738

Merged

younesbelkada approved these changes Aug 23, 2023

View reviewed changes

ArthurZucker approved these changes Aug 23, 2023

View reviewed changes

sanchit-gandhi changed the title ~~[CLAP] Fix dtype of logit scales~~ ⚠️ [CLAP] Fix dtype of logit scales in init Aug 23, 2023

sanchit-gandhi merged commit 77cb2ab into huggingface:main Aug 23, 2023

sanchit-gandhi deleted the clap-dtype branch August 23, 2023 17:49

sanchit-gandhi mentioned this pull request Aug 25, 2023

[CLAP] Fix logit scales dtype for fp16 #25754

Merged

parambharat pushed a commit to parambharat/transformers that referenced this pull request Sep 26, 2023

⚠️ [CLAP] Fix dtype of logit scales in init (huggingface#25682)

e4e1f35

[CLAP] Fix dtype of logit scales

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

⚠️ [CLAP] Fix dtype of logit scales in init #25682

⚠️ [CLAP] Fix dtype of logit scales in init #25682

Uh oh!

sanchit-gandhi commented Aug 23, 2023

Uh oh!

sanchit-gandhi Aug 23, 2023

Uh oh!

ArthurZucker Aug 23, 2023 •

edited

Loading

Uh oh!

sanchit-gandhi Aug 23, 2023 •

edited

Loading

Uh oh!

younesbelkada left a comment

Uh oh!

ArthurZucker left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Aug 23, 2023 •

edited

Loading

Uh oh!

sanchit-gandhi commented Aug 23, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

⚠️ [CLAP] Fix dtype of logit scales in init #25682

⚠️ [CLAP] Fix dtype of logit scales in init #25682

Uh oh!

Conversation

sanchit-gandhi commented Aug 23, 2023

What does this PR do?

Uh oh!

sanchit-gandhi Aug 23, 2023

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sanchit-gandhi commented Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ArthurZucker Aug 23, 2023 •

edited

Loading

sanchit-gandhi Aug 23, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 23, 2023 •

edited

Loading

sanchit-gandhi commented Aug 23, 2023 •

edited

Loading