Fix past CI #20967

ydshieh · 2023-01-02T14:07:12Z

What does this PR do?

I tried to launch Past CI (the 2nd round) after #20861, but there are some more fixes required: Past CI images don't install other dependencies, and we need more decorators to skip some tests if they are not installed.

ydshieh · 2023-01-02T14:11:39Z

utils/past_ci_versions.py


 past_versions_testing = {
    "pytorch": {
+        "1.12": {


For building the Past CI docker image with torch 1.12.x

ydshieh · 2023-01-02T14:13:45Z

tests/trainer/test_trainer.py

            self.assertAlmostEqual(b, b1, delta=1e-5)

    @slow
+    @require_accelerate


This test requires accelerate, which is installed in daily CI image, but not past CI images.

(So far the past CI avoids installing many other dependencies, but this could be changed in the future)

ydshieh · 2023-01-02T14:14:48Z

tests/pipelines/test_pipelines_object_detection.py

        )

    @require_torch
+    @require_pytesseract


(So far) Past CI images don't install pytesseract

ydshieh · 2023-01-02T14:16:01Z

tests/models/layoutlmv2/test_tokenization_layoutlmv2.py

                self.assertEqual(sum(tokens_with_offsets["special_tokens_mask"]), added_tokens)

    @require_torch
+    @require_detectron2


(So far) Past CI images don't install detectron2

ydshieh · 2023-01-02T14:19:54Z

tests/models/bert/test_tokenization_bert_tf.py

    import tensorflow as tf

+    if is_tensorflow_text_available():
+        from transformers.models.bert import TFBertTokenizer


This import will trigger

import tensorflow as tf

in src/transformers/models/bert/tokenization_bert_tf.py

ydshieh · 2023-01-02T14:20:07Z

tests/models/gpt2/test_tokenization_gpt2_tf.py

 if is_tf_available():
    import tensorflow as tf

+    if is_keras_nlp_available():


HuggingFaceDocBuilderDev · 2023-01-02T14:21:23Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2023-01-02T14:24:27Z

src/transformers/__init__.py

 # Tensorflow-text-specific objects
 try:
-    if not is_tensorflow_text_available():
+    if not (is_tensorflow_text_available() and is_tf_available()):


I don't like this change much, but the import of TFBertTokenizer and TFGPT2Tokenizer also requires tensorflow too.

cc @Rocketknight1

This should be avoid as it then screws up the dummy creation (you have an old dummy file that should be removed manually). is_tensorflow_text_available() should probably return False if is_tf_available() is false.

sgugger

Thanks for fixing those. I'm okay with most changes except for the new checks (the existing one should check on the others basically)

sgugger · 2023-01-03T14:36:46Z

src/transformers/__init__.py

 # Tensorflow-text-specific objects
 try:
-    if not is_tensorflow_text_available():
+    if not (is_tensorflow_text_available() and is_tf_available()):


This should be avoid as it then screws up the dummy creation (you have an old dummy file that should be removed manually). is_tensorflow_text_available() should probably return False if is_tf_available() is false.

src/transformers/__init__.py

ydshieh · 2023-01-03T15:08:27Z

tests/models/bert/test_tokenization_bert_tf.py

 if is_tf_available():
    import tensorflow as tf

+    if is_tensorflow_text_available():


@sgugger As is_tensorflow_text_available already contains the check for TF, should I revert the change here? Or it's fine to keep it this way?

I think it's fine either way, but why this change?

In current main branch, is_tensorflow_text_available condition is not inside is_tf_available. In past CI image build dockerfile, we install .[dev] then uninstall tensorflow, so tensorflow_text is there but tensorflow is removed. And this causes some tensorflow_text-related tests failed (TFBertTokenizer file will import tensorflow)

In the 1st version of the PR, I keep is_tensorflow_text_available as it is, and use the combination of is_tensorflow_text_available() and is_tensorflow_available(). The change at these 2 lines was necessary at that time.

@sgugger said we should instead change the definition of is_tensorflow_text_available to include is_tensorflow_available directly. After applying that suggestion, we no longer need to change the 2 lines here.

You can unindent this block if you want but honestly we don't really care. The tensorflow import needs to stay as it's used in the file.

ydshieh · 2023-01-03T15:09:01Z

tests/models/bert/test_tokenization_bert_tf.py

            return out["pooler_output"]


+@require_tf


same question - we don't need this anymore, and can keep it is as in current main.

LysandreJik

Great, thank you @ydshieh!

LysandreJik · 2023-01-11T14:03:49Z

tests/models/bert/test_tokenization_bert_tf.py

 if is_tf_available():
    import tensorflow as tf

+    if is_tensorflow_text_available():


I think it's fine either way, but why this change?

ydshieh commented Jan 2, 2023

View reviewed changes

ydshieh requested a review from sgugger January 3, 2023 14:34

ydshieh marked this pull request as ready for review January 3, 2023 14:35

ydshieh requested a review from LysandreJik January 3, 2023 14:35

sgugger reviewed Jan 3, 2023

View reviewed changes

sgugger approved these changes Jan 3, 2023

View reviewed changes

ydshieh commented Jan 3, 2023

View reviewed changes

LysandreJik approved these changes Jan 11, 2023

View reviewed changes

ydshieh added 4 commits January 12, 2023 17:09

Fix for Past CI

bc6df97

make style

838b70e

clean up

3d3f0cd

unindent 2 blocks

bdb6fd2

ydshieh force-pushed the fix_past_ci branch from 5f8fc11 to bdb6fd2 Compare January 12, 2023 16:10

ydshieh merged commit b3a0aad into main Jan 12, 2023

ydshieh deleted the fix_past_ci branch January 12, 2023 17:04

Fix past CI #20967

Fix past CI #20967

Uh oh!

Conversation

ydshieh commented Jan 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ydshieh Jan 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ydshieh commented Jan 2, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 2, 2023 •

edited

Loading

ydshieh Jan 11, 2023 •

edited

Loading