Fix inverted conditional in TF common test! by Rocketknight1 · Pull Request #22540 · huggingface/transformers

Rocketknight1 · 2023-04-03T16:02:32Z

Noticed a rather alarming conditional being backwards in the test_pt_tf_model_equivalence common test. This probably resulted in a lot of tests being skipped!

HuggingFaceDocBuilderDev · 2023-04-03T16:17:25Z

The documentation is not available anymore as the PR was closed or merged.

Rocketknight1 · 2023-04-03T16:20:03Z

As expected this has raised a few bugs in the cross-test that were silent before - I'll see what I can do in this PR

gante

The change makes sense!

Re broken tests (which probably need to be fixed/skipped before merging) -- it means that the loss calculation has issues, correct?

Rocketknight1 · 2023-04-03T17:17:15Z

Most likely - I'll investigate them all soon!

…to inf

Rocketknight1 · 2023-04-04T17:13:24Z

Quick summary of the fixes needed:

ESM: TFEsmForTokenClassification copied the computation from TFBertForTokenClassification, but this has some slightly odd BERT-specific behaviour and doesn't mask -100 in the same way as other models. Replaced it with the loss block from TFRobertaForTokenClassification and all tests pass.

GPT2: For model classes that take rank-3 inputs (e.g. MultipleChoice or DoubleHeads), when output_hidden_states=True , inputs have their second two dims flattened internally in the main model stem. This means that the output hidden_states are rank 3 (bsz, seq_len * num_choices, hidden_dim) and not rank 4 (bsz, num_choices, seq_len, hidden_dim). However, the PT model un-flattens the output for the final hidden_states, which means the last hidden state is rank-4, unlike the others which remain rank-3. In the old TF model, all hidden states are rank-3. I modified the TF code to un-flatten the last hidden state in the same way.

HUBERT: Loss computation especially for CTC overflows a lot with the default labels, which creates lots of inf values and makes it very hard to compare TF and PT losses. I skipped PT-TF equivalence testing for the losses, but keep it for all non-loss outputs.

Wav2Vec2: Same as HUBERT

XGLM: The PT XGLM model does a weird thing where it shifts labels by 1 and then adds pad_token_id as the final label to all samples. I'm not sure this is correct, but I modified the TF code to do the same. It's possible the TF code is the right one here though, in which case we should revert it and change the PT code instead.

Rocketknight1 · 2023-04-04T17:33:28Z

@gante I fixed all the bugs that this surfaced, explained above ^

cc @sgugger for final review too

sgugger

Thanks a lot for fixing the condition in the base test and all the subsequent failures.

… tuple

* Fix inverted conditional in TF common test! * Make the same change in the PT tests file * Make sure hidden states for GPT2 have the same output shape in PT/TF * Minor fix to PT implementation of token classification loss * Skip loss equivalence test for TFHubert because it keeps overflowing to inf * Compute LM loss for TF the (weird) way it's computed in PT * Skip loss equivalence test for Wav2Vec2 for the same reason as Hubert * Fix - don't try to access the hidden states property when output is a tuple

ydshieh · 2023-04-05T09:41:12Z

Thank you for the fix @Rocketknight1 ❤️ . And I apologize for the mistake I introduced ...

* Fix inverted conditional in TF common test! * Make the same change in the PT tests file * Make sure hidden states for GPT2 have the same output shape in PT/TF * Minor fix to PT implementation of token classification loss * Skip loss equivalence test for TFHubert because it keeps overflowing to inf * Compute LM loss for TF the (weird) way it's computed in PT * Skip loss equivalence test for Wav2Vec2 for the same reason as Hubert * Fix - don't try to access the hidden states property when output is a tuple

Rocketknight1 requested review from gante and ydshieh April 3, 2023 16:02

gante approved these changes Apr 3, 2023

View reviewed changes

Rocketknight1 added 4 commits April 4, 2023 17:09

Fix inverted conditional in TF common test!

3a816e7

Make the same change in the PT tests file

ee9b8c5

Make sure hidden states for GPT2 have the same output shape in PT/TF

1ce4421

Minor fix to PT implementation of token classification loss

12cbb74

Rocketknight1 force-pushed the fix_pt_tf_equivalence_test branch from df36a70 to 12cbb74 Compare April 4, 2023 16:09

Rocketknight1 added 2 commits April 4, 2023 17:38

Skip loss equivalence test for TFHubert because it keeps overflowing …

b9d40b5

…to inf

Compute LM loss for TF the (weird) way it's computed in PT

28cc650

Skip loss equivalence test for Wav2Vec2 for the same reason as Hubert

65f48a7

Rocketknight1 requested a review from sgugger April 4, 2023 17:33

sgugger approved these changes Apr 4, 2023

View reviewed changes

Fix - don't try to access the hidden states property when output is a…

2120e9d

… tuple

Rocketknight1 merged commit edb704b into main Apr 4, 2023

Rocketknight1 deleted the fix_pt_tf_equivalence_test branch April 4, 2023 20:59

damianoamatruda mentioned this pull request Jan 24, 2025

Fix XGLM loss computation (PyTorch and TensorFlow) #35878

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix inverted conditional in TF common test!#22540

Fix inverted conditional in TF common test!#22540
Rocketknight1 merged 8 commits into
mainfrom
fix_pt_tf_equivalence_test

Rocketknight1 commented Apr 3, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Apr 3, 2023 •

edited

Loading

Uh oh!

Rocketknight1 commented Apr 3, 2023

Uh oh!

gante left a comment

Uh oh!

Rocketknight1 commented Apr 3, 2023

Uh oh!

Rocketknight1 commented Apr 4, 2023 •

edited

Loading

Uh oh!

Rocketknight1 commented Apr 4, 2023

Uh oh!

sgugger left a comment

Uh oh!

ydshieh commented Apr 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

Rocketknight1 commented Apr 3, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Apr 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rocketknight1 commented Apr 3, 2023

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Apr 3, 2023

Uh oh!

Rocketknight1 commented Apr 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rocketknight1 commented Apr 4, 2023

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Apr 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HuggingFaceDocBuilderDev commented Apr 3, 2023 •

edited

Loading

Rocketknight1 commented Apr 4, 2023 •

edited

Loading