Adding support for bf16_full_eval by bhargaveede · Pull Request #610 · huggingface/optimum-habana

bhargaveede · 2023-12-21T09:29:32Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2023-12-21T09:35:14Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

bhargaveede · 2023-12-21T13:30:27Z

@regisss I have enabled bf16_full_eval and verified the same with t5 inference.
Can you review it?

vivekgoe · 2023-12-22T07:49:27Z

@bhargaveede please also update (1) summarization README file to include this extra argument for t5-3b prediction example, (2) performance targets in test for both G1 and G2.

bhargaveede · 2023-12-27T06:18:42Z

@regisss Can your review and merge this?

bhargaveede · 2024-01-02T04:37:25Z

@regisss Can we merge this?

vivekgoe · 2024-01-02T05:49:04Z

Looks good to me. We can merge it once @regisss also reviews.

regisss · 2024-01-02T09:18:46Z

I'll review it this week 👍

regisss · 2024-01-02T19:13:09Z

            ("facebook/bart-large-cnn", "Habana/bart", 4.691, 26.0688, 2, 1),
-            ("t5-3b", "Habana/t5", 2.28, 21.56, 2, 1),
+            ("t5-3b", "Habana/t5", 2.88, 21.56, 2, 1),
        ],
    }
 else:
    # Gaudi1 CI baselines
    MODELS_TO_TEST = {
        "bf16": [
            ("facebook/bart-large-cnn", "Habana/bart", 2.588, 26.0688, 2, 1),
-            ("t5-3b", "Habana/t5", 0.585, 21.72, 2, 1),
+            ("t5-3b", "Habana/t5", 0.98, 21.56, 2, 1),


For Gaudi1, I get a RougeLsum of 21.3831 and a throughput of 1.005. It doesn't matter much since the test passes (no need to update the numbers).
For Gaudi2 however, runs are not reproducible it seems. I get different RougeLsum from one run to another, is it something you also observed?

I didn't get different RougeLsum. When I added perf numbers, I ran the test twice to check and I was getting same RogueLsum. Let me check again.

Interesting, did you run it with Synapse 1.13?

I could see the variation. However, I'm seeing variation on v1.9-release too for the test "test_run_summarization_t5-small_multi_card". Can you confirm if it's same on your end

I cannot run multi-card tests on my Gaudi2 instance at the moment but if you observed the same behavior for "test_run_summarization_t5-small_multi_card" it means that this "issue" was already there before.
Anyway, tests still pass so I'm going to merge it and I'll investigate that later.

Adding support for bf16_full_eval

c23fca0

bhargaveede requested a review from regisss as a code owner December 21, 2023 09:29

bhargaveede added the run-test Run CI for PRs from external contributors label Dec 21, 2023

bhargaveede requested a review from vivekgoe December 21, 2023 09:29

vivekgoe reviewed Dec 22, 2023

View reviewed changes

Comment thread optimum/habana/transformers/trainer.py

Comment thread optimum/habana/transformers/trainer.py

Bhargav added 2 commits December 27, 2023 07:45

Adding changes for converting model for fp16 flag

44dfc97

Changing perf numbers

9f62538

bhargaveede added run-test Run CI for PRs from external contributors and removed run-test Run CI for PRs from external contributors labels Dec 27, 2023

vivekgoe approved these changes Jan 2, 2024

View reviewed changes

regisss reviewed Jan 2, 2024

View reviewed changes

regisss approved these changes Jan 3, 2024

View reviewed changes

regisss merged commit dd02a7b into huggingface:main Jan 3, 2024

jychen21 pushed a commit to jychen21/optimum-habana that referenced this pull request Feb 27, 2024

Adding support for bf16_full_eval (huggingface#610)

83dd5ef

Conversation

bhargaveede commented Dec 21, 2023

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Dec 21, 2023

Uh oh!

bhargaveede commented Dec 21, 2023

Uh oh!

Uh oh!

Uh oh!

vivekgoe commented Dec 22, 2023

Uh oh!

bhargaveede commented Dec 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhargaveede commented Jan 2, 2024

Uh oh!

vivekgoe commented Jan 2, 2024

Uh oh!

regisss commented Jan 2, 2024

Uh oh!

regisss Jan 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bhargaveede Jan 3, 2024

Choose a reason for hiding this comment

Uh oh!

regisss Jan 3, 2024

Choose a reason for hiding this comment

Uh oh!

bhargaveede Jan 3, 2024

Choose a reason for hiding this comment

Uh oh!

regisss Jan 3, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bhargaveede commented Dec 27, 2023 •

edited

Loading

regisss Jan 2, 2024 •

edited

Loading