Return attention mask in ASR pipeline to avoid warnings #33509

Rocketknight1 · 2024-09-16T13:28:19Z

cc @sanchit-gandhi - this PR just sets return_attention_mask=True on the preprocessors in the automatic_speech_recognition pipeline to avoid warnings caused by missing attention masks. It works okay in my testing, but please let me know if you think it could cause any problems!

Fixes #33498

Vaibhavs10 · 2024-09-16T13:40:47Z

cc: @ylacombe here

ylacombe · 2024-09-17T06:48:36Z

Hey @Rocketknight1, thanks for opening the PR!
In theory, I don't see any problems with this fix, have you been able to run every slow ASR pipeline tests here ?

Rocketknight1 · 2024-09-17T14:07:21Z

@ylacombe there are some slow tests that I can't get working on my local machine (even on main). However, of the tests that run, they all work with this PR as well!

ylacombe · 2024-09-17T14:14:15Z

Let me know if you want me to run them!

Rocketknight1 · 2024-09-17T14:51:08Z

@ylacombe Sure!

ylacombe

So, I've first opened a PR to fix some of the slow tests that were not passing due to how data was loaded #33545.
Your PR doesn't add any failing tests as compared to main and the changes make sense, so I think it should be ok to merge !

Rocketknight1 · 2024-09-17T16:12:43Z

Okay, cool! cc @LysandreJik for core maintainer review.

LysandreJik

Thanks for the PR @Rocketknight1!

monica-sekoyan · 2024-10-16T07:26:43Z

Hi @Rocketknight1,
I think return_attention_mask should also be added to chunk_iter function, so the warning will be removed even when we specify chunk_length_s.

Holmes-GU · 2024-11-25T02:14:37Z

cc @sanchit-gandhi - this PR just sets return_attention_mask=True on the preprocessors in the automatic_speech_recognition pipeline to avoid warnings caused by missing attention masks. It works okay in my testing, but please let me know if you think it could cause any problems!

Fixes #33498

Hi, where can I set return_attention_mask=True?

…33509) return attention mask in ASR pipeline

AlessandroSpallina · 2025-01-06T02:18:37Z

I'm having the error fixed here using latest transformers==4.47.1:

The attention mask is not set and cannot be inferred from input because pad token is same as eos token. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.
Whisper did not predict an ending timestamp, which can happen if audio is cut off in the middle of a word. Also make sure WhisperTimeStampLogitsProcessor was used during generation.

when running this code:

        pipe = pipeline(
            "automatic-speech-recognition",
            model=self.model,
            torch_dtype=torch.float16,
            chunk_length_s=30,
            batch_size=24,
            return_timestamps=True,
            device=self.device,
            tokenizer=processor.tokenizer,
            feature_extractor=processor.feature_extractor,
            model_kwargs={"use_flash_attention_2": True},
            generate_kwargs={
                "max_new_tokens": 128,
            },
        )

return attention mask in ASR pipeline

3e7b53e

Rocketknight1 force-pushed the return_attention_mask_in_asr branch from 8d0d13b to 3e7b53e Compare September 16, 2024 13:28

ylacombe mentioned this pull request Sep 17, 2024

Fix tests in ASR pipeline #33545

Merged

ylacombe approved these changes Sep 17, 2024

View reviewed changes

LysandreJik approved these changes Sep 17, 2024

View reviewed changes

Rocketknight1 merged commit 8efc06e into main Sep 18, 2024

Rocketknight1 deleted the return_attention_mask_in_asr branch September 18, 2024 14:57

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

Return attention mask in ASR pipeline to avoid warnings (huggingface#…

dc4d2e1

…33509) return attention mask in ASR pipeline

AlessandroSpallina mentioned this pull request Jan 6, 2025

Warning 'The attention mask is not set' #35524

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Return attention mask in ASR pipeline to avoid warnings #33509

Return attention mask in ASR pipeline to avoid warnings #33509

Uh oh!

Rocketknight1 commented Sep 16, 2024

Uh oh!

Vaibhavs10 commented Sep 16, 2024

Uh oh!

ylacombe commented Sep 17, 2024

Uh oh!

Rocketknight1 commented Sep 17, 2024

Uh oh!

ylacombe commented Sep 17, 2024

Uh oh!

Rocketknight1 commented Sep 17, 2024

Uh oh!

ylacombe left a comment

Uh oh!

Rocketknight1 commented Sep 17, 2024

Uh oh!

LysandreJik left a comment

Uh oh!

monica-sekoyan commented Oct 16, 2024

Uh oh!

Holmes-GU commented Nov 25, 2024

Uh oh!

AlessandroSpallina commented Jan 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Return attention mask in ASR pipeline to avoid warnings #33509

Return attention mask in ASR pipeline to avoid warnings #33509

Uh oh!

Conversation

Rocketknight1 commented Sep 16, 2024

Uh oh!

Vaibhavs10 commented Sep 16, 2024

Uh oh!

ylacombe commented Sep 17, 2024

Uh oh!

Rocketknight1 commented Sep 17, 2024

Uh oh!

ylacombe commented Sep 17, 2024

Uh oh!

Rocketknight1 commented Sep 17, 2024

Uh oh!

ylacombe left a comment

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Sep 17, 2024

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

monica-sekoyan commented Oct 16, 2024

Uh oh!

Holmes-GU commented Nov 25, 2024

Uh oh!

AlessandroSpallina commented Jan 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants