Clean up flaky behaviour on Slow CUDA Pytorch Push Tests #4759

DN6 · 2023-08-24T11:43:32Z

Certain Mixin Classes and Tests use the sum of the absolute differences between two an expected and output tensor to check if the models/pipelines are behaving as expected. e.g

max_diff = (image - new_image).abs().sum()
assert max_diff < 1e-5

Using sum leads to more flaky testing behaviour due since small differences over many individual values in the tensors accumulate. Proposing to move this to use the maximum absolute difference between tensors instead.
e.g

max_diff = (image - new_image).abs().max()
assert max_diff < 1e-5

Additionally, the PipelineTesterMixin has a some tests: test_from_save_pretrained, test_from_save_pretrained_variant, test_save_load_float16 etc that tests against precision tolerances but do not currently allow the pipelines inheriting from them to pass in custom precision values, which is leading to a number of failures. This PR adds an expected_max_diff argument to each of these tests to control the precision level.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

patrickvonplaten

Cool!

HuggingFaceDocBuilderDev · 2023-08-24T11:53:26Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten · 2023-08-24T13:17:56Z

Failing tests are unrelated - feel free to merge! Failing tests will be fixed in: #4761

…#4759) use max diff to compare model outputs

use max diff to compare model outputs

47e7253

DN6 changed the title ~~use max diff to compare model outputs~~ Clean up flaky behaviour on Slow CUDA Pytorch Push Tests Aug 24, 2023

DN6 requested a review from patrickvonplaten August 24, 2023 11:44

patrickvonplaten approved these changes Aug 24, 2023

View reviewed changes

Merge branch 'main' into test-cleanup-use-max-to-compare

30971f7

DN6 merged commit 4f05058 into main Aug 24, 2023

DN6 mentioned this pull request Aug 28, 2023

Test Cleanup Precision issues #4812

Merged

6 tasks

kashif deleted the test-cleanup-use-max-to-compare branch September 13, 2023 13:41

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

Clean up flaky behaviour on Slow CUDA Pytorch Push Tests (huggingface…

c7b3956

…#4759) use max diff to compare model outputs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up flaky behaviour on Slow CUDA Pytorch Push Tests #4759

Clean up flaky behaviour on Slow CUDA Pytorch Push Tests #4759

DN6 commented Aug 24, 2023

patrickvonplaten left a comment

HuggingFaceDocBuilderDev commented Aug 24, 2023 •

edited

Loading

patrickvonplaten commented Aug 24, 2023

Clean up flaky behaviour on Slow CUDA Pytorch Push Tests #4759

Clean up flaky behaviour on Slow CUDA Pytorch Push Tests #4759

Conversation

DN6 commented Aug 24, 2023

Before submitting

Who can review?

patrickvonplaten left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 24, 2023 • edited Loading

patrickvonplaten commented Aug 24, 2023

HuggingFaceDocBuilderDev commented Aug 24, 2023 •

edited

Loading