Fix some TF GPT-J CI testings#16454
Conversation
|
I need to check why causes problems in other tests (torch, pipeline etc ...) |
|
The documentation is not available anymore as the PR was closed or merged. |
|
Regarding the |
OK, thank you for the feedback. But just curious (off-topic): (not very sure, but I remembered before TF can generate short sequences too. And if TF can't stop earlier, it looks like a quite big drawback ..? Anyway, we shouldn't discuss this generation thing in this PR.) |
The plan we have for the refactoring does not mention extras like the stopping criteria, so I can only tell that it probably won't happen in the next 2-3 months :) We can generate short sentences with TF if we pass the |
What does this PR do?
Fix some TF-GPT-J CI testing (scheduled)
test_mixed_precision: require some castingtest_saved_model_creationandtest_saved_model_creation_extended: requireshape_listinstead ofshapetest_model_from_pretrained: skip for now otherwise GPU OOMWith the changes this PR, only the following test fails:
test_gptj_sample_max_time: for exampletransformers/tests/gptj/test_modeling_tf_gptj.py
Line 413 in c85547a
the PT gives a quite short generation sequence (say 19), while TF gives a sequence of length 256, and it takes much more time and therefore fails the tests.
I feel this remaining issue is better to be addressed in another PR.