[MNLI example] Prevent overwriting matched with mismatched metrics by eldarkurtic · Pull Request #16475 · huggingface/transformers

eldarkurtic · 2022-03-29T10:43:28Z

What does this PR do?

The fix appends _mm to the metrics for mnli-mm evaluation dataset to prevent overwriting eval metrics from the mnli dataset, and writes them together in eval_results.json and all_results.json. The MNLI-metrics look like this:

{
    "eval_accuracy": 0.36067244014263883,
    "eval_accuracy_mm": 0.35506509357200977,
    "eval_loss": 1.158889889717102,
    "eval_loss_mm": 1.1670204401016235,
    "eval_runtime": 16.4691,
    "eval_runtime_mm": 15.9496,
    "eval_samples": 9815,
    "eval_samples_mm": 9832,
    "eval_samples_per_second": 595.964,
    "eval_samples_per_second_mm": 616.44,
    "eval_steps_per_second": 18.641,
    "eval_steps_per_second_mm": 19.311
}

@sgugger, @patil-suraj

HuggingFaceDocBuilderDev · 2022-03-29T11:09:57Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks a lot for your PR! I just have two comments and it should be good to merged.

Also make sur to run make style on your branch when you're done so the quality check passes.

examples/pytorch/text-classification/run_glue.py

sgugger · 2022-03-29T11:20:48Z

examples/pytorch/text-classification/run_glue.py

+            if task == "mnli-mm":
+                metrics = {k + "_mm": v for k, v in metrics.items()}
+            if "mnli" in task:
+                combined.update(metrics)


I'd regroup this slightly differently:

Suggested change

if task == "mnli-mm":

metrics = {k + "_mm": v for k, v in metrics.items()}

if "mnli" in task:

combined.update(metrics)

if task == "mnli-mm":

combined.update({k + "_mm": v for k, v in metrics.items()})

elif task == "mnli":

combined.update(metrics)

With this regrouping, the trainer.log_metrics("eval", metrics) won't have the "_mm" , so the stdout will be the same as before (that's the main reason why keys in metrics are updated). Given this, should we still regroup and affect only the json log files, but not the stdout?

Ah understood. LGTM as is then :-)

sgugger · 2022-03-29T14:38:11Z

The CI failure is unrelated to this PR (actually investigating it right now), so we can merge safely :-)

Prevent overwriting matched with mismatched metrics

3e5bd4b

sgugger approved these changes Mar 29, 2022

View reviewed changes

Fix style

cccfcc0

sgugger merged commit 5216607 into huggingface:main Mar 29, 2022

eldarkurtic mentioned this pull request Mar 29, 2022

Fix overwriting of MNLI metrics neuralmagic/sparseml#654

Merged

sgugger mentioned this pull request Mar 29, 2022

Fix example test and test_fetcher for examples #16478

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[MNLI example] Prevent overwriting matched with mismatched metrics#16475

[MNLI example] Prevent overwriting matched with mismatched metrics#16475
sgugger merged 2 commits intohuggingface:mainfrom
eldarkurtic:fix-mnli-example

eldarkurtic commented Mar 29, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Mar 29, 2022 •

edited

Loading

Uh oh!

sgugger left a comment •

edited

Loading

Uh oh!

Uh oh!

sgugger Mar 29, 2022

Uh oh!

eldarkurtic Mar 29, 2022

Uh oh!

sgugger Mar 29, 2022

Uh oh!

sgugger commented Mar 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

eldarkurtic commented Mar 29, 2022

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Mar 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sgugger Mar 29, 2022

Choose a reason for hiding this comment

Uh oh!

eldarkurtic Mar 29, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger Mar 29, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger commented Mar 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented Mar 29, 2022 •

edited

Loading

sgugger left a comment •

edited

Loading