Add `TF_MODEL_FOR_SEMANTIC_SEGMENTATION_MAPPING` by ydshieh · Pull Request #18469 · huggingface/transformers

ydshieh · 2022-08-04T09:29:44Z

What does this PR do?

The original goal is to fix TFSegformerModelTest.test_keras_fit, but it ends up the following

Add TF_MODEL_FOR_SEMANTIC_SEGMENTATION_MAPPING to some __init__ files.
Add training arguments in a few layers for TFSegformerModel
Update _prepare_for_class to deal with 2 more image tasks
Fix TFData2VecVisionForSemanticSegmentation loss: we need batch dimension (without this, test_dataset_conversion failed - this was previously skipped due to the lack of labels)

HuggingFaceDocBuilderDev · 2022-08-04T09:41:01Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2022-08-04T14:20:07Z

tests/test_modeling_tf_common.py


                self.assertTrue(loss.shape.as_list() == expected_loss_size or loss.shape.as_list() == [1])

+    def check_keras_fit_results(self, val_loss1, val_loss2, atol=1e-2, rtol=1e-3):


For TFSegformerForSemanticSegmentation, we need higher tolerances. See the comment for the change in that model file.

Adding check_keras_fit_results here to avoid overwrite test_keras_fit entirely.

ydshieh · 2022-08-04T14:25:32Z

tests/models/segformer/test_modeling_tf_segformer.py

    def test_dataset_conversion(self):
        super().test_dataset_conversion()

+    def check_keras_fit_results(self, val_loss1, val_loss2, atol=2e-1, rtol=2e-1):


Use higher tolerance for TFSegformerForSemanticSegmentation: this model

-has BatchNormalization layer,

also have several dropout layers,

as well as a layer TFSegformerDropPath which has random operation during training.

These factors together cause the statistic of moving_average and moving_variance different, and we have larger validation loss.

cc @sayakpaul @amyeroberts

I have found that we need larger tolerances for dense prediction tasks like semantic segmentation. It was the case for TFData2VecVisionForSemanticSegmentation as well.

ydshieh · 2022-08-04T14:29:38Z

src/transformers/models/segformer/modeling_tf_segformer.py

+    def call(self, hidden_states: tf.Tensor, training: bool = False) -> tf.Tensor:
        hidden_states = self.dense(hidden_states)
-        hidden_states = self.dropout(hidden_states)
+        hidden_states = self.dropout(hidden_states, training=training)


We still pass this argument, right?

We can avoid it since these are supposed to be set automatically during training by the Keras engine. But I find that explicitly specifying it gives me mental peace.

Thank you for the information!

ydshieh · 2022-08-04T14:30:14Z

src/transformers/models/segformer/modeling_tf_segformer.py


        hidden_states = self.linear_fuse(tf.concat(all_hidden_states[::-1], axis=-1))
-        hidden_states = self.batch_norm(hidden_states)
+        hidden_states = self.batch_norm(hidden_states, training=training)


ydshieh · 2022-08-04T14:32:05Z

Test failures are ValueError: Connection error - irrelevant.

sayakpaul · 2022-08-04T14:39:09Z

Thank you, @ydshieh for this. I appreciate the help.

sgugger

Thanks a lot for fixing this!

amyeroberts

Looks good! Thanks for the fix ❤️

gante

👀 this needed more changes than I expected. Thank you for looking into it, @ydshieh! 🙏

ydshieh added 2 commits August 4, 2022 10:22

Add TF_MODEL_FOR_SEMANTIC_SEGMENTATION_MAPPING

1d8400e

Add labels

4c99ddc

ydshieh added 5 commits August 4, 2022 13:07

add training arg

b9332ba

higher tol

739712d

higher tol

22bc799

fix style

f02c9f1

fix for data2vec

3ef8b15

ydshieh changed the title ~~Add TF_MODEL_FOR_SEMANTIC_SEGMENTATION_MAPPING~~ Add TF_MODEL_FOR_SEMANTIC_SEGMENTATION_MAPPING Aug 4, 2022

ydshieh commented Aug 4, 2022

View reviewed changes

ydshieh requested review from amyeroberts, gante and sgugger August 4, 2022 14:32

sgugger approved these changes Aug 4, 2022

View reviewed changes

amyeroberts approved these changes Aug 4, 2022

View reviewed changes

gante approved these changes Aug 4, 2022

View reviewed changes

ydshieh merged commit 1492892 into huggingface:main Aug 4, 2022

ydshieh deleted the add_seg_label_to_test branch August 4, 2022 18:41


		self.assertTrue(loss.shape.as_list() == expected_loss_size or loss.shape.as_list() == [1])

		def check_keras_fit_results(self, val_loss1, val_loss2, atol=1e-2, rtol=1e-3):

Conversation

ydshieh commented Aug 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Aug 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

ydshieh Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

ydshieh Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ydshieh Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

ydshieh Aug 4, 2022

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Aug 4, 2022

Uh oh!

sayakpaul commented Aug 4, 2022

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ydshieh commented Aug 4, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 4, 2022 •

edited

Loading

sayakpaul Aug 4, 2022 •

edited

Loading