Fix DAC conversion script #39793

ebezzam · 2025-07-30T14:48:55Z

What does this PR do

Fix DAC conversion:

Most notably, performing weight norm removal on GPU instead of on CPU (otherwise get differences for layers with weight norm when applying models on GPU)
Missing feature extractor parameters
Correctly casting sampling rate

More consistent add/remove weight norm functions
Update explanation of high tolerances during testing. We now know it comes from weight norm removal on CPU (instead of GPU) and different implementations of Snake1d (their version uses JIT). Nevertheless, we stick with current models on the Hub, as differences are minimal.

Reproducer to show weight norm difference when doing weight removal on a different device: https://gist.github.com/ebezzam/c83f186dcfeaab8cac040c960eb474cd

src/transformers/models/dac/modeling_dac.py

ebezzam · 2025-07-30T14:53:36Z

tests/models/dac/test_modeling_dac.py

+1. Transformer model does not use weight norm for speed-up. And during model conversion, weight norm was removed on
+CPU (old script: https://github.com/huggingface/transformers/blob/8e077a3e452e8cab94ef62b37d68258bd3dcffed/src/transformers/models/dac/convert_dac_checkpoint.py#L230)
+This leads to slightly different weight (1e-8) and the error accumulates. Removing weight norm on GPU would produce
+equivalent weights (current conversion script).
+2. Original version uses Snake1D activation with JIT: https://github.com/descriptinc/descript-audio-codec/blob/c7cfc5d2647e26471dc394f95846a0830e7bec34/dac/nn/layers.py#L18
+Transformer version does not use JIT, so outputs are slightly different.


Updated (definite) reason for high tolerances

HuggingFaceDocBuilderDev · 2025-07-30T15:01:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

vasqu

We should not update the conversion if we don't change the hub. Having a legacy path is unideal and makes it confusing for the average user as hub differs from the script here.

There are two option imo:

Change to new conversion (no extra flags) and update hub weights
Only leave the description where the differences stem from

I'd prefer option 1 even if it was breaking tbh. Would wait on Eustache here tbh

src/transformers/models/dac/convert_dac_checkpoint.py

src/transformers/models/dac/modeling_dac.py

src/transformers/models/dac/convert_dac_checkpoint.py

github-actions · 2025-08-20T14:54:27Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: dac

ebezzam · 2025-08-20T14:59:41Z

thanks @vasqu!

🚨 @eustlb (when you're back), @vasqu and I spoke offline that it would be better to:

ask Descript to update model weights (with conversion done on GPU)
switch to Snake 1D with JIT

Main reason being that several models are depending on DAC (XCodec, Dia, Higgs Boson, maybe more), and it would be better that they don't depend on a model with minor output differences. As model addition/integration will be trickier since we may not be able to isolate if differences are coming from DAC or from implementing the new model.

ebezzam added 13 commits July 9, 2025 17:50

Fix DAC (slow) integration tests.

4cce31f

Fix DAC conversion.

716baa6

Merge branch 'main' into dac_fix

086f6b0

Address comments

9e51f6f

Merge branch 'main' into dac_fix

0addb3e

Sync with main, uncomment nn.utils.parametrizations.weight_norm.

e5f02a2

Update DAC integration tests with expected outputs.

178c4d8

Merge branch 'main' into dac_fix

60004f5

Added info about encoder/decoder error and longer decoder outputs.

da8243b

Parameterize tests.

36a24cb

Set expected values to GitHub runners.

7d27ea1

Merge branch 'main' into dac_fix

57a6924

Fix DAC conversion.

3f0f3d9

ebezzam commented Jul 30, 2025

View reviewed changes

src/transformers/models/dac/modeling_dac.py Outdated Show resolved Hide resolved

ebezzam commented Jul 30, 2025

View reviewed changes

src/transformers/models/dac/modeling_dac.py Outdated Show resolved Hide resolved

ebezzam commented Jul 30, 2025

View reviewed changes

ebezzam requested a review from eustlb July 30, 2025 14:53

ebezzam mentioned this pull request Jul 30, 2025

Add xcodec2 model #37868

Open

4 tasks

ebezzam added the Audio label Jul 30, 2025

ebezzam added 2 commits August 20, 2025 12:44

Revert to CPU conversion for consistency with Hub.

56be0dd

Merge branch 'main' into dac_fix

400d800

vasqu reviewed Aug 20, 2025

View reviewed changes

src/transformers/models/dac/convert_dac_checkpoint.py Outdated Show resolved Hide resolved

src/transformers/models/dac/modeling_dac.py Outdated Show resolved Hide resolved

src/transformers/models/dac/convert_dac_checkpoint.py Outdated Show resolved Hide resolved

ebezzam added 2 commits August 20, 2025 16:38

Cleanup.

b8a054e

Merge branch 'dac_fix' of github.com:ebezzam/transformers into dac_fix

eedcf03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix DAC conversion script #39793

Fix DAC conversion script #39793

Uh oh!

ebezzam commented Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ebezzam Jul 30, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 30, 2025

Uh oh!

vasqu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 20, 2025

Uh oh!

ebezzam commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix DAC conversion script #39793

Are you sure you want to change the base?

Fix DAC conversion script #39793

Uh oh!

Conversation

ebezzam commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do

Uh oh!

Uh oh!

Uh oh!

ebezzam Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 30, 2025

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 20, 2025

Uh oh!

ebezzam commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ebezzam commented Jul 30, 2025 •

edited

Loading