Widen match condition for `_can_record_outputs` by molbap · Pull Request #43762 · huggingface/transformers

molbap · 2026-02-05T10:15:43Z

In case of module reloading, we currently lose tracking hooks for hidden states and attentions . Widening the matching condition a bit.

Should fix #43761, also mentioned in linkedin/Liger-Kernel#1061

Reproducer for CLIP:

import importlib, torch
from transformers import CLIPVisionConfig, CLIPVisionModel
import transformers.models.clip.modeling_clip as modeling_clip

config = CLIPVisionConfig(hidden_size=32, intermediate_size=64, num_hidden_layers=2, num_attention_heads=4, image_size=30, patch_size=10)
pixel_values = torch.randn(1, 3, 30, 30)

m1 = CLIPVisionModel(config)
o1 = m1(pixel_values=pixel_values, output_hidden_states=True)
print("before reload:", o1.hidden_states is None)

importlib.reload(modeling_clip)

m2 = CLIPVisionModel(config)
o2 = m2(pixel_values=pixel_values, output_hidden_states=True)
print("after reload:", o2.hidden_states is None)

Second reload will currently fail, and this is a fix proposal.

Added a test as well (specific to CLIP, likely enough)

HuggingFaceDocBuilderDev · 2026-02-05T10:25:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2026-02-05T10:45:57Z

                    for key, specs in capture_tasks:
                        # The second check is for multimodals where only backbone layer suffix is available
-                        if (specs.target_class is not None and isinstance(module, specs.target_class)) or (
-                            specs.class_name is not None and name.endswith(specs.class_name)


This line of check for class_name was added for AyaVision, but I think it's not needed anymore. At least tests don't complain 😄

aah :D makes sense, wdyt of the added check though? not sure how to handle reloading more elegantly

I think it's the best we can do with importlib.reload and shouldn't backfire with false positives. Lemme approve, forgot to

zucchini-nlp · 2026-02-05T10:48:58Z

+        outputs = model(pixel_values=pixel_values, output_hidden_states=True)
+        self.assertIsNotNone(outputs.hidden_states)
+
+        importlib.reload(modeling_clip)


maybe we can move it under test_modeling_common.py, for make smth similar to this test and reload the module at some point before checking if config.output_attentions works?

transformers/tests/test_modeling_common.py

Lines 1754 to 1757 in ace7c37

def test_attention_outputs(self):

if not self.has_attentions:

self.skipTest(reason="Model does not output attentions")

yeah I hesitated, I didn't want to add yet another test for 400+ models 🫣 but you may be right

haha right!

ArthurZucker

great catch actually! 🤗

molbap · 2026-02-06T16:25:15Z

I want to think that great minds think alike 👀 @Cyrilvallez 's PR solves more problems with check_model_inputs #43765

widen the match condition for _can_record_outputs

90fdb51

yukiu00 mentioned this pull request Feb 5, 2026

Fix CLIPVisionModel hidden_states issue in Llava convergence test for transformers v5 linkedin/Liger-Kernel#1061

Merged

2 tasks

zucchini-nlp reviewed Feb 5, 2026

View reviewed changes

zucchini-nlp approved these changes Feb 5, 2026

View reviewed changes

ArthurZucker approved these changes Feb 6, 2026

View reviewed changes

Merge branch 'main' into fix_recording_on_reload

3b589b6

molbap closed this Feb 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Widen match condition for `_can_record_outputs`#43762

Widen match condition for `_can_record_outputs`#43762
molbap wants to merge 2 commits intomainfrom
fix_recording_on_reload

molbap commented Feb 5, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2026

Uh oh!

zucchini-nlp Feb 5, 2026

Uh oh!

molbap Feb 5, 2026

Uh oh!

zucchini-nlp Feb 5, 2026

Uh oh!

zucchini-nlp Feb 5, 2026

Uh oh!

molbap Feb 5, 2026

Uh oh!

zucchini-nlp Feb 5, 2026

Uh oh!

ArthurZucker left a comment

Uh oh!

molbap commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	def test_attention_outputs(self):
	if not self.has_attentions:
	self.skipTest(reason="Model does not output attentions")

Conversation

molbap commented Feb 5, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2026

Uh oh!

zucchini-nlp Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

molbap Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

molbap Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

molbap commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants