Remove `@slow` for `test_eager_matches_sdpa_inference` #34558

ydshieh · 2024-11-01T09:06:54Z

What does this PR do?

And make it less flaky

Use smaller (short) inputs
Use larger epsilon in norm layers

HuggingFaceDocBuilderDev · 2024-11-01T09:51:14Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ydshieh · 2024-11-04T11:53:35Z

tests/generation/test_utils.py

+            for key in ["image_token_index", "image_token_id", "video_token_index", "video_token_id", "vision_start_token_id"]:
+                token_index = getattr(config, key, None)
+                if token_index is not None and token_index < config.get_text_config().vocab_size:
+                    logits_processor_kwargs["bad_words_ids"].append([token_index])


make it more general.

vision_start_token_id is required for qwen2_vl

gante

Yay fewer slow tests 🙌

Added a few questions/suggestions to see if we can remove a few more overwritten cases 😈

gante · 2024-11-04T17:13:53Z

tests/models/albert/test_modeling_albert.py

+    @parameterized.expand([("float16",), ("bfloat16",), ("float32",)])
+    @require_torch_sdpa
+    @unittest.skip("Albert requires `head_mask` which is currently not done in this test.")
+    def test_eager_matches_sdpa_inference(self):
+        pass


On skips like this, on Albert and other models: the test pulls the main input and the attention mask to manipulate them, finally sending them to the model. We could pop these items from inputs_dict, and then pass **inputs_dict to the model (e.g. model_eager(**prepared_inputs, **inputs_dict)) -- I think then we wouldn't need to skip tests due to missing inputs 🤗

Maybe let's merge and do this in a follow-up PR. It's never worked before.

tests/models/idefics/test_modeling_idefics.py

tests/models/mimi/test_modeling_mimi.py

gante · 2024-11-04T17:16:35Z

tests/models/musicgen/test_modeling_musicgen.py


    @parameterized.expand([("float16",), ("bfloat16",), ("float32",)])
    @require_torch_sdpa
-    @slow


Can't we just delete the test? (it has # Copied from tests.test_modeling_common.ModelTesterMixin.test_eager_matches_sdpa_inference and it inherits the mixin, so it should run the original test!)

no, it fails. I don't dive into why it's failing (input issues) though.

Interesting -- in that case, how does # Copied from work? 👀

# Copied from is only applied to files under src I believe :-) but people sometimes use it in the tests/ 😆

tests/models/musicgen_melody/test_modeling_musicgen_melody.py

gante · 2024-11-04T17:20:48Z

Tagging @zucchini-nlp to double-check VLM test changes :)

zucchini-nlp

Thanks, LGTM! Left a few questions and things we can clean up more

src/transformers/models/llava_next_video/modeling_llava_next_video.py

zucchini-nlp · 2024-11-05T10:15:15Z

tests/models/idefics/test_modeling_idefics.py

        return floats_tensor([self.batch_size, self.num_channels, self.image_size, self.image_size])

    @require_torch_sdpa
-    @slow


i think this test dont need skip anymore since we dont check if model has SDPA layers within this test anymore. But it can be skipped due to the same flakiness

It still has input preparation issues, where I added below to another model test class

"Idefics requires both text and image inputs which is currently not done in this test."

As mentioned in a reply to Joao's comment, let's try to do it in a follow up PR

tests/models/llava_next_video/test_modeling_llava_next_video.py

zucchini-nlp · 2024-11-05T10:21:01Z

tests/models/qwen2_vl/test_modeling_qwen2_vl.py

        input_ids = ids_tensor([self.batch_size, self.seq_length], self.vocab_size)
        attention_mask = torch.ones(input_ids.shape, dtype=torch.long, device=torch_device)

+        input_ids[:, -1] = self.pad_token_id


for my understanding, any reason why last token has to be a pad?

to avoid index error

(modeling code)

vision_tokens = input_ids[vision_start_indices + 1]

ArthurZucker

thanks 🤗

tests/test_modeling_common.py

…4558) * update * update * update * update * update * update * update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

ydshieh force-pushed the fix_more_matching branch 4 times, most recently from ad647f2 to 2afd90e Compare November 4, 2024 09:36

ydshieh added 4 commits November 4, 2024 11:11

update

9c040bd

update

72184d9

update

ac16809

update

631cdec

ydshieh force-pushed the fix_more_matching branch from fcc2bfd to 631cdec Compare November 4, 2024 10:12

ydshieh changed the title ~~run sdpa~~ Make test_eager_matches_sdpa_inference less flaky Nov 4, 2024

ydshieh requested a review from gante November 4, 2024 10:23

ydshieh changed the title ~~Make test_eager_matches_sdpa_inference less flaky~~ Remove @slow for test_eager_matches_sdpa_inference less flaky Nov 4, 2024

ydshieh changed the title ~~Remove @slow for test_eager_matches_sdpa_inference less flaky~~ Remove @slow for test_eager_matches_sdpa_inference Nov 4, 2024

ydshieh added 3 commits November 4, 2024 12:45

update

271306b

update

7082ba0

update

2779c68

ydshieh commented Nov 4, 2024

View reviewed changes

update

c18c6c6

gante approved these changes Nov 4, 2024

View reviewed changes

gante requested a review from zucchini-nlp November 4, 2024 17:20

update

69021b5

ydshieh requested a review from ArthurZucker November 4, 2024 19:46

zucchini-nlp approved these changes Nov 5, 2024

View reviewed changes

ArthurZucker approved these changes Nov 5, 2024

View reviewed changes

tests/test_modeling_common.py Show resolved Hide resolved

ydshieh added 2 commits November 5, 2024 15:42

update

f709dbb

update

296c068

ydshieh merged commit f2d5dfb into main Nov 5, 2024

ydshieh deleted the fix_more_matching branch November 5, 2024 15:10

ydshieh mentioned this pull request Nov 28, 2024

Make test_generate_with_static_cache even less flaky #34995

Merged

Remove @slow for test_eager_matches_sdpa_inference #34558

Remove @slow for test_eager_matches_sdpa_inference #34558

Uh oh!

Conversation

ydshieh commented Nov 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Nov 1, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gante commented Nov 4, 2024

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Remove `@slow` for `test_eager_matches_sdpa_inference` #34558

Remove `@slow` for `test_eager_matches_sdpa_inference` #34558

ydshieh commented Nov 1, 2024 •

edited

Loading