[Parakeet] add output_attention_mask #41694

eustlb · 2025-10-17T15:23:29Z

What does this PR do?

Parakeet encoder modified the sequence length, making that the input attention_mask cannot be directly used to know which embeddings positions correspond to padding.

Here we do it differently than the actual external approach like we have on mimi that consists to retreive the correct mask:

attention_mask = mimi_model.get_audio_codes_mask(inputs.attention_mask)

and that recomputes as mask that is already known in the forward and that can simply be returned along with the outputs.

HuggingFaceDocBuilderDev · 2025-10-17T15:32:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

nithinraok · 2025-10-17T15:33:50Z

src/transformers/models/parakeet/modeling_parakeet.py


-        return BaseModelOutput(last_hidden_state=hidden_states)
+        return ParakeetEncoderModelOutput(
+            last_hidden_state=hidden_states, attention_mask=output_mask.int() if output_attention_mask else None


can we return lengths directly instead or along with attention_mask

I'd rather keep an explicit approach here for potiential future usage of the same with left padding, you can retreive lengths by doing attention_mask.sum(-1)

sounds good.

ArthurZucker

Okay sounds good !

github-actions · 2025-10-23T23:02:00Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: parakeet

* add output_attention_mask * style

add output_attention_mask

2a691c3

eustlb requested a review from ArthurZucker October 17, 2025 15:23

nithinraok reviewed Oct 17, 2025

View reviewed changes

eustlb mentioned this pull request Oct 23, 2025

Add Vocos model #39403

Open

ArthurZucker approved these changes Oct 23, 2025

View reviewed changes

style

9838b35

eustlb enabled auto-merge (squash) October 23, 2025 23:01

eustlb merged commit e4b920b into huggingface:main Oct 23, 2025
17 checks passed

ngazagna-qc pushed a commit to ngazagna-qc/transformers that referenced this pull request Oct 24, 2025

[Parakeet] add output_attention_mask (huggingface#41694)

3956acb

* add output_attention_mask * style

i3hz pushed a commit to i3hz/transformers that referenced this pull request Oct 30, 2025

[Parakeet] add output_attention_mask (huggingface#41694)

e8f2b4d

* add output_attention_mask * style

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Parakeet] add output_attention_mask #41694

[Parakeet] add output_attention_mask #41694

Uh oh!

eustlb commented Oct 17, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 17, 2025

Uh oh!

nithinraok Oct 17, 2025

Uh oh!

eustlb Oct 20, 2025

Uh oh!

nithinraok Oct 21, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

github-actions bot commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Parakeet] add output_attention_mask #41694

[Parakeet] add output_attention_mask #41694

Uh oh!

Conversation

eustlb commented Oct 17, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 17, 2025

Uh oh!

nithinraok Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

eustlb Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

nithinraok Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants