[whisper] small changes for faster tests #38236

gante · 2025-05-20T15:32:15Z

What does this PR do?

whisper has a very slow set of tests.

This PR applies low-hanging fruit changes to get faster tests (slow tests: 7m47s -> 6m59s on my machine)

the main dataset is now loaded once per test suite (as opposed to once per test in relevant tests)
generation tests happen on GPU when possible

gante · 2025-05-20T15:36:08Z

tests/models/whisper/test_modeling_whisper.py

 @require_torchaudio
 class WhisperModelIntegrationTests(unittest.TestCase):
-    def setUp(self):
-        self._unpatched_generation_mixin_generate = transformers.GenerationMixin.generate


this was only used in one test, and that test should use the base generate. This overcomplicates things.

idk why it was added in the first place 🤔

It is linked to #29312, but the change here is only test. So good for me if it works.

That is implicitly tested: the output checked in the test is different if num_beams is not respected :)

(just like in all other beam search integration tests: we check the output, which is sensible enough to detect bad usage of flags)

It's definitely way over the top for what it tried. So yea let's keep it simple.

HuggingFaceDocBuilderDev · 2025-05-20T15:46:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante · 2025-05-20T16:20:23Z

tests/models/whisper/test_modeling_whisper.py

-        transformers.GenerationMixin.generate = self._unpatched_generation_mixin_generate
-
-    @cached_property
-    def default_processor(self):


tests/models/whisper/test_modeling_whisper.py

ydshieh · 2025-05-21T08:43:23Z

tests/models/whisper/test_modeling_whisper.py

-        torch_device = "cpu"
-        set_seed(0)


could you run with flakefinder to see if we need seed or not?

RUN_SLOW=1 py.test tests/models/whisper/test_modeling_whisper.py -k test_tiny_en_generation --flake-finder --flake-runs 100 yields no failures

(set_seed(0) comes from the original whisper commit. However, AFAIK, Whisper has no random components -- in fact, many output-checking tests in this file don't set a seed)

ydshieh

LGTM, but it would be good to check a few things.

tearDownClass is a good practice to cleanup the stuff.

gante · 2025-05-21T10:43:37Z

@ydshieh regarding dataset loading: I've remembered that we had the same issue in the pipeline tests, so I've pushed a new commit which copies the pattern from the pipeline tests

The pattern we added before doesn't have a teardown method -- do we need to add it everywhere the pattern is used?

ydshieh · 2025-05-21T11:29:54Z

@gante Thank you for checking and further improvement.

Regarding dataset loading, it's probably minor (for now), so you can proceed to merge.

(Those loaded objects assigned to class attributes will remain forever until the python process exit. At some point, it might be a problem if we have many test classes)

vasqu

LGTM, thanks. Was only concerned about the seeds but seems to be a non-issue.

vasqu · 2025-05-21T13:04:38Z

tests/models/whisper/test_modeling_whisper.py

 @require_torchaudio
 class WhisperModelIntegrationTests(unittest.TestCase):
-    def setUp(self):
-        self._unpatched_generation_mixin_generate = transformers.GenerationMixin.generate


It's definitely way over the top for what it tried. So yea let's keep it simple.

faster

dd0da30

gante requested a review from ydshieh May 20, 2025 15:32

gante commented May 20, 2025

View reviewed changes

gante requested a review from vasqu May 20, 2025 16:19

gante commented May 20, 2025

View reviewed changes

ydshieh reviewed May 21, 2025

View reviewed changes

tests/models/whisper/test_modeling_whisper.py Outdated Show resolved Hide resolved

ydshieh reviewed May 21, 2025

View reviewed changes

ydshieh approved these changes May 21, 2025

View reviewed changes

lazy load dataset

f721715

vasqu approved these changes May 21, 2025

View reviewed changes

gante merged commit e4decee into huggingface:main May 21, 2025
14 checks passed

[whisper] small changes for faster tests #38236

[whisper] small changes for faster tests #38236

Uh oh!

Conversation

gante commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

gante May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ydshieh May 21, 2025

Choose a reason for hiding this comment

Uh oh!

gante May 21, 2025

Choose a reason for hiding this comment

Uh oh!

vasqu May 21, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 20, 2025

Uh oh!

gante May 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ydshieh May 21, 2025

Choose a reason for hiding this comment

Uh oh!

gante May 21, 2025

Choose a reason for hiding this comment

Uh oh!

ydshieh left a comment

Choose a reason for hiding this comment

Uh oh!

gante commented May 21, 2025

Uh oh!

ydshieh commented May 21, 2025

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

vasqu May 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gante commented May 20, 2025 •

edited

Loading

gante May 20, 2025 •

edited

Loading