Run Llama2 with torch.compile on Gaudi2 by kausikmaiti · Pull Request #616 · huggingface/optimum-habana

kausikmaiti · 2023-12-28T06:57:45Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Signed-off-by: kausik <kmaiti@habana.ai>

vivekgoe · 2024-01-01T08:04:06Z

@kausikmaiti looks good to me. @regisss please review and help merge this if it looks ok to you. Thanks.

HuggingFaceDocBuilderDev · 2024-01-01T08:06:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

libinta

@kausikmaiti I have following questions

what's the default cmd for llama2? torch compile or use_lazy_mode? if the default is torch.compile, please also update the readme
what's the performance comparison with use_lazy_mode and use_hpu_graph?
any depenency to the to-be-released docker?

kausikmaiti · 2024-01-04T05:54:50Z

@kausikmaiti I have following questions

what's the default cmd for llama2? torch compile or use_lazy_mode? if the default is torch.compile, please also update the readme

what's the performance comparison with use_lazy_mode and use_hpu_graph?

any depenency to the to-be-released docker?

@libinta, Please find my answers below.

Here, I am just adding support for torch.compile. But, default mode is still unchanged. So, we don't need to update readme here.
This is the very first step of (functional) enablement. Performance data is not yet available.
Yes. It has dependency on the content of release 1.14.

bhargaveede · 2024-01-04T09:26:22Z

    model, tokenizer, generation_config = initialize_model(args, logger)

+    use_lazy_mode = True
+    if args.torch_compile:


We could avoid this by using args.torch_compile directly in 312 and 492 with lazy_mode = not args.torch_compile.

Yes, that can be done of course. But, I wanted to have the decision making about use_lazy_mode in one place for all. A secondary reason is that I plan to include additional checks in future PR to control use_lazy_mode.

regisss · 2024-01-04T18:01:39Z

Yes. It has dependency on the content of release 1.14.

So we should not merge it before 1.14 is released @libinta right?

bhargaveede · 2024-01-05T06:30:21Z

            model = wrap_in_hpu_graph(model)
+
+    if args.torch_compile:
+        model = get_torch_compiled_model(model)


@kausikmaiti Can we add model specific check as generation using torch.compile isn't verified on models

Added model specific check in separate commit. Please review.

…lama2 Signed-off-by: kausik <kmaiti@habana.ai>

regisss · 2024-01-11T10:19:10Z

@kausikmaiti Let's also add a test to: https://github.com/huggingface/optimum-habana/blob/main/tests/test_text_generation_example.py

You can define a new test at the end of this file:

@pytest.mark.parametrize("model_name, baseline", MODELS_TO_TEST["torch_compile"])
def test_text_generation_torch_compile(model_name: str, baseline: float, token: str):
    _test_text_generation(model_name, baseline, token, torch_compile=True)

and adding a new torch_compile entry in the dict here:

optimum-habana/tests/test_text_generation_example.py

Line 15 in 979c132

MODELS_TO_TEST = {

Signed-off-by: kausik <kmaiti@habana.ai>

kalyanjk · 2024-01-22T10:29:24Z

os.environ["WORLD_SIZE"] = "0" ? WORLD_SIZE should be set to 1 for 1x runs

As you mentioned offline, WORLD_SIZE setting does not matter, as I'm not using deepspeed / gaudi_spawn.py script.
Also as per my observation, if I don't set WORLD_SIZE=0, due to the logic like "use_deepspeed = args.world_size > 0", setup_distributed_model() gets called and the test fails at very early stage while importing deepspeed. This is not the expectation.

Run Llama2 with torch.compile on Gaudi2

e8c6162

Signed-off-by: kausik <kmaiti@habana.ai>

kausikmaiti requested a review from regisss as a code owner December 28, 2023 06:57

kausikmaiti mentioned this pull request Dec 28, 2023

Run Llama2 with torch.compile on Gaudi2 #605

Closed

vivekgoe added the run-test Run CI for PRs from external contributors label Jan 1, 2024

vivekgoe self-requested a review January 1, 2024 08:03

vivekgoe requested a review from libinta January 1, 2024 08:04

vivekgoe approved these changes Jan 2, 2024

View reviewed changes

libinta reviewed Jan 2, 2024

View reviewed changes

regisss reviewed Jan 3, 2024

View reviewed changes

Comment thread examples/text-generation/utils.py

bhargaveede reviewed Jan 4, 2024

View reviewed changes

libinta added the synapse1.14 label Jan 5, 2024

bhargaveede reviewed Jan 5, 2024

View reviewed changes

kausikmaiti and others added 2 commits January 7, 2024 12:27

Added model specific check to enable torch.compile support only for L…

e13946f

…lama2 Signed-off-by: kausik <kmaiti@habana.ai>

Merge branch 'main' into llama2_with_torch_compile_on_gaudi2_1

11239b2

Added a test

03596e6

Signed-off-by: kausik <kmaiti@habana.ai>

kalyanjk reviewed Jan 22, 2024

View reviewed changes

regisss approved these changes Jan 23, 2024

View reviewed changes

regisss added run-test Run CI for PRs from external contributors and removed run-test Run CI for PRs from external contributors labels Jan 23, 2024

regisss merged commit 1ecc732 into huggingface:main Jan 23, 2024

jychen21 pushed a commit to jychen21/optimum-habana that referenced this pull request Feb 27, 2024

Run Llama2 with torch.compile on Gaudi2 (huggingface#616)

8b81d91

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run Llama2 with torch.compile on Gaudi2#616

Run Llama2 with torch.compile on Gaudi2#616
regisss merged 4 commits into
huggingface:mainfrom
kausikmaiti:llama2_with_torch_compile_on_gaudi2_1

kausikmaiti commented Dec 28, 2023

Uh oh!

vivekgoe commented Jan 1, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Jan 1, 2024

Uh oh!

libinta left a comment

Uh oh!

Uh oh!

kausikmaiti commented Jan 4, 2024

Uh oh!

bhargaveede Jan 4, 2024

Uh oh!

kausikmaiti Jan 5, 2024

Uh oh!

regisss commented Jan 4, 2024

Uh oh!

bhargaveede Jan 5, 2024

Uh oh!

kausikmaiti Jan 7, 2024

Uh oh!

regisss commented Jan 11, 2024

Uh oh!

kalyanjk Jan 22, 2024

Uh oh!

kausikmaiti Jan 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

kausikmaiti commented Dec 28, 2023

What does this PR do?

Before submitting

Uh oh!

vivekgoe commented Jan 1, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Jan 1, 2024

Uh oh!

libinta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kausikmaiti commented Jan 4, 2024

Uh oh!

bhargaveede Jan 4, 2024

Choose a reason for hiding this comment

Uh oh!

kausikmaiti Jan 5, 2024

Choose a reason for hiding this comment

Uh oh!

regisss commented Jan 4, 2024

Uh oh!

bhargaveede Jan 5, 2024

Choose a reason for hiding this comment

Uh oh!

kausikmaiti Jan 7, 2024

Choose a reason for hiding this comment

Uh oh!

regisss commented Jan 11, 2024

Uh oh!

kalyanjk Jan 22, 2024

Choose a reason for hiding this comment

Uh oh!

kausikmaiti Jan 22, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants