Fix some TF GPT-J CI testings by ydshieh · Pull Request #16454 · huggingface/transformers

ydshieh · 2022-03-28T16:48:55Z

What does this PR do?

Fix some TF-GPT-J CI testing (scheduled)

test_mixed_precision: require some casting
test_saved_model_creation and test_saved_model_creation_extended: require shape_list instead of shape
test_model_from_pretrained: skip for now otherwise GPU OOM

With the changes this PR, only the following test fails: test_gptj_sample_max_time: for example

transformers/tests/gptj/test_modeling_tf_gptj.py

Line 413 in c85547a

model.generate(input_ids, do_sample=False, max_time=MAX_TIME, max_length=256)

the PT gives a quite short generation sequence (say 19), while TF gives a sequence of length 256, and it takes much more time and therefore fails the tests.

I feel this remaining issue is better to be addressed in another PR.

ydshieh · 2022-03-28T16:59:10Z

I need to check why

@unittest.skipIf(len(tf.config.list_physical_devices("GPU")) > 0, "skip testing on GPU for now to avoid GPU OOM.")

causes problems in other tests (torch, pipeline etc ...)

HuggingFaceDocBuilderDev · 2022-03-28T17:01:00Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for fixing!

gante

Thank for looking at these, @ydshieh 🙏

gante · 2022-03-28T20:21:44Z

Regarding the max_time/generate test -- when I first reviewed the GPT-J PR, I had little understanding of the current state of TF generate. Now I can tell that this test makes no sense :D Contrarily to PT's generate, TF generate has no time-based stopping criteria, so it is natural that the test fails. I'd remove it.

ydshieh · 2022-03-28T20:39:10Z

Regarding the max_time/generate test -- when I first reviewed the GPT-J PR, I had little understanding of the current state of TF generate. Now I can tell that this test makes no sense :D Contrarily to PT's generate, TF generate has no time-based stopping criteria, so it is natural that the test fails. I'd remove it.

OK, thank you for the feedback.

But just curious (off-topic): TF generate has no time-based stopping criteria --> do we plan to support this in the future ??

(not very sure, but I remembered before TF can generate short sequences too. And if TF can't stop earlier, it looks like a quite big drawback ..? Anyway, we shouldn't discuss this generation thing in this PR.)

gante · 2022-03-28T20:59:08Z

But just curious (off-topic): TF generate has no time-based stopping criteria --> do we plan to support this in the future ??

(not very sure, but I remembered before TF can generate short sequences too. And if TF can't stop earlier, it looks like a quite big drawback ..? Anyway, we shouldn't discuss this generation thing in this PR.)

The plan we have for the refactoring does not mention extras like the stopping criteria, so I can only tell that it probably won't happen in the next 2-3 months :) We can generate short sentences with TF if we pass the max_length argument, where generate() generates up to max_length tokens.

ydshieh added 3 commits March 28, 2022 11:03

Fix for test_mixed_precision

07b7a3e

Fix test_saved_model_creation by using shape_list instead of shape

293d815

skit test_model_from_pretrained on GPU for now to avoid GPU OOM

c85bc4f

ydshieh requested review from LysandreJik, Rocketknight1, gante and sgugger March 28, 2022 16:49

fix style

376c520

fix

e90f4b3

ydshieh marked this pull request as draft March 28, 2022 17:28

ydshieh added 2 commits March 28, 2022 19:32

fix

24017d1

fix style

09cf271

ydshieh changed the title ~~Fix tf gptj ci testings~~ Fix some TF GPT-J CI testings Mar 28, 2022

sgugger approved these changes Mar 28, 2022

View reviewed changes

gante approved these changes Mar 28, 2022

View reviewed changes

ydshieh marked this pull request as ready for review March 28, 2022 21:01

ydshieh added 2 commits March 28, 2022 23:06

remove redundant skip condition

8ae261e

skip for now

ec42c78

ydshieh merged commit 86cff21 into huggingface:main Mar 29, 2022

ydshieh deleted the fix_tf_gptj_ci_testings branch March 29, 2022 16:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix some TF GPT-J CI testings#16454

Fix some TF GPT-J CI testings#16454
ydshieh merged 9 commits intohuggingface:mainfrom
ydshieh:fix_tf_gptj_ci_testings

ydshieh commented Mar 28, 2022

Uh oh!

ydshieh commented Mar 28, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 28, 2022 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

gante left a comment

Uh oh!

gante commented Mar 28, 2022

Uh oh!

ydshieh commented Mar 28, 2022 •

edited

Loading

Uh oh!

gante commented Mar 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ydshieh commented Mar 28, 2022

What does this PR do?

Uh oh!

ydshieh commented Mar 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Mar 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante commented Mar 28, 2022

Uh oh!

ydshieh commented Mar 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gante commented Mar 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ydshieh commented Mar 28, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 28, 2022 •

edited

Loading

ydshieh commented Mar 28, 2022 •

edited

Loading