[static cache] fix device map per layer in VLMs #38488

zucchini-nlp · 2025-05-30T07:37:10Z

What does this PR do?

As per title, addresses the issue from #38426 (comment)

After the recent refactor, we don't return language model as decoder but only the base model, which contain vision/vq/audio etc encoders. Since generation relies on get_decoder() and since decoder is supposed to be only the LM backbone, this PR returns the correct module as decoder

HuggingFaceDocBuilderDev · 2025-05-30T07:50:43Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante

LGTM, thank you for fixing 🤗

return lm as decoder

a1aacfb

zucchini-nlp requested a review from gante May 30, 2025 08:21

remi-or mentioned this pull request Jun 9, 2025

Small fixes amd #38700

Merged

gante approved these changes Jun 20, 2025

View reviewed changes

Merge branch 'main' into multi-gpu-vlms

5c9d764

zucchini-nlp merged commit ff95974 into huggingface:main Jun 20, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[static cache] fix device map per layer in VLMs #38488

[static cache] fix device map per layer in VLMs #38488

Uh oh!

zucchini-nlp commented May 30, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 30, 2025

Uh oh!

gante left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[static cache] fix device map per layer in VLMs #38488

[static cache] fix device map per layer in VLMs #38488

Uh oh!

Conversation

zucchini-nlp commented May 30, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented May 30, 2025

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants