Fix flaky `test_beam_search_low_memory` #35611

ydshieh · 2025-01-10T15:25:49Z

What does this PR do?

test_beam_search_low_memory is very flaky for Emu3Vision2TextModelTest.

The 3 fixes in this PR are all necessary to have 0 failure in a suite of 100 runs.

call set_xxx_less_flaky methods
avoid generate image tokens
compare with _check_similar_generate_outputs

The final version run 500 times without failure.

tests/generation/test_utils.py

ydshieh · 2025-01-10T15:51:06Z

tests/generation/test_utils.py

            )
-            self.assertListEqual(low_output.tolist(), high_output.tolist())
+            # The two outputs must match and their shape must be as expected
+            self._check_similar_generate_outputs(low_output, high_output)


This is still required, otherwise 1% of chance to fail for Emu3Vision2TextModelTest

HuggingFaceDocBuilderDev · 2025-01-10T16:13:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

LGTM, thanks!

tests/generation/test_utils.py

ydshieh · 2025-01-10T16:29:23Z

tests/models/mobilenet_v1/test_modeling_mobilenet_v1.py::MobileNetV1ModelTest::test_batching_equivalence - AssertionError: tensor(False) is not true : Batched and Single row outputs are not equal in MobileNetV1ForImageClassification for key=logits. Difference=0.48479270935058594.

This is a bit nasty but irrelevant to this PR. Will merge and try to fix the above one next week.

zucchini-nlp · 2025-01-10T16:31:59Z

@ydshieh Cool, feel free to close #29516 after fixing that. We had a small tracker for flaky vision model with that test

ydshieh · 2025-01-10T17:30:20Z

Thanks. The tracker is more for batching tests, but I will check with #35564 next week

ydshieh added 3 commits January 10, 2025 15:43

fix

d199556

fix

0a05158

fix

1ca6b6d

ydshieh requested review from ArthurZucker and Rocketknight1 as code owners January 10, 2025 15:25

ydshieh commented Jan 10, 2025

View reviewed changes

tests/generation/test_utils.py Outdated Show resolved Hide resolved

ydshieh marked this pull request as draft January 10, 2025 15:27

fix

c4f282f

ydshieh marked this pull request as ready for review January 10, 2025 15:49

ydshieh requested a review from zucchini-nlp January 10, 2025 15:50

ydshieh commented Jan 10, 2025

View reviewed changes

zucchini-nlp approved these changes Jan 10, 2025

View reviewed changes

tests/generation/test_utils.py Outdated Show resolved Hide resolved

fix

a2d4f91

ydshieh merged commit 04eae98 into main Jan 10, 2025
24 of 26 checks passed

ydshieh deleted the fix_flaky_test_beam_search_low_memory branch January 10, 2025 16:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix flaky `test_beam_search_low_memory` #35611

Fix flaky `test_beam_search_low_memory` #35611

Uh oh!

ydshieh commented Jan 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

ydshieh Jan 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jan 10, 2025

Uh oh!

zucchini-nlp left a comment

Uh oh!

Uh oh!

ydshieh commented Jan 10, 2025

Uh oh!

Uh oh!

zucchini-nlp commented Jan 10, 2025

Uh oh!

ydshieh commented Jan 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix flaky test_beam_search_low_memory #35611

Fix flaky test_beam_search_low_memory #35611

Uh oh!

Conversation

ydshieh commented Jan 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Uh oh!

ydshieh Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 10, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ydshieh commented Jan 10, 2025

Uh oh!

Uh oh!

zucchini-nlp commented Jan 10, 2025

Uh oh!

ydshieh commented Jan 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix flaky `test_beam_search_low_memory` #35611

Fix flaky `test_beam_search_low_memory` #35611

ydshieh commented Jan 10, 2025 •

edited

Loading