[Gemma v2] Enable gemma v2 on gaudi by billishyahao · Pull Request #1280 · huggingface/optimum-habana

billishyahao · 2024-08-21T09:33:02Z

What does this PR do?

This patch aims to enable gemma2 both training and inferencing on gaudi device. Also static shape support is enabled as well. The performance results are shown as below:

python run_generation.py --model_name_or_path ../../../gemma-2-9b/ --bf16 --use_kv_cache --reuse_cache --use_hpu_graphs --max_input_tokens 128 --max_new_tokens 128 --bf16 --batch_size 4

Input/outputs:
input 1: ('DeepSpeed is a machine learning framework',)
output 1: ('DeepSpeed is a machine learning framework that enables training of large-scale deep learning models on a single GPU or across multiple GPUs. It is designed to be easy to use and highly scalable, making it a powerful tool for researchers and practitioners working with large-scale deep learning models.\n\nDeepSpeed is built on top of PyTorch, a popular deep learning framework, and provides a set of tools and libraries that make it easy to train large-scale models. It includes features such as zero-shot inference, which allows models to be used for inference without the need for retraining, and distributed training, which enables models to be trained across multiple GPUs.\n\nDeepSpeed is also',)


Stats:
--------------------------------------------------------------------------------------------------------------
Throughput (including tokenization) = 347.9053767194258 tokens/second
Number of HPU graphs                = 20
Memory allocated                    = 20.74 GB
Max memory allocated                = 20.82 GB
Total memory available              = 94.62 GB
Graph compilation duration          = 5.919518291004351 seconds
--------------------------------------------------------------------------------------------------------------

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

billishyahao · 2024-09-09T09:55:48Z

Fixed all comments @ssarkar2 feel free to review again.

billishyahao · 2024-10-08T03:44:30Z

Rebase the patch to address the conflict wth main.

billishyahao · 2024-10-08T06:24:44Z

@regisss Could you approve this ?

billishyahao · 2024-11-13T04:54:37Z

Rebase the patch to address the conflict wth main at 11/13

Luca-Calabria · 2024-11-13T10:21:18Z

Hi @billishyahao @libinta I would like to help merging this PR. It seems it needs just a rebase and if needed fixes after rebasing.

emascarenhas · 2024-11-13T15:26:45Z

@ssarkar2 , You approved this PR earlier, so assuming this can go ahead now that it is rebased with latest habana-main branch?

emascarenhas · 2024-11-13T15:34:34Z

@libinta , We could start run-test on this while we are waiting for Sayantan to confirm.

nedo99 · 2024-11-18T16:22:01Z

@regisss could we start the tests so we can merge this PR?

github-actions · 2024-11-20T09:02:35Z

The code quality check failed, please run make style.

HuggingFaceDocBuilderDev · 2024-11-20T09:05:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Luca-Calabria · 2024-11-20T10:25:51Z

As you can see I created a new PR #1504 as fork of this one to add the fixes on style. This because we don't have write access to this repo.

emascarenhas · 2024-11-24T05:40:12Z

@Luca-Calabria , Please close this as it is no longer needed?

billishyahao force-pushed the enablege2 branch from e82c033 to 237457a Compare August 22, 2024 07:43

billishyahao changed the title ~~[Gemma2] Enable gemma2 on gaudi~~ [Gemma 2] Enable gemma 2 on gaudi Aug 22, 2024

billishyahao changed the title ~~[Gemma 2] Enable gemma 2 on gaudi~~ [Gemma v2] Enable gemma v2 on gaudi Aug 22, 2024

billishyahao marked this pull request as ready for review August 22, 2024 08:05

billishyahao requested review from bhargaveede, regisss, ssarkar2 and vivekgoe as code owners August 22, 2024 08:05

libinta added the review wip label Aug 31, 2024

ssarkar2 suggested changes Sep 5, 2024

View reviewed changes

Comment thread optimum/habana/transformers/models/gemma2/modeling_gemma2.py Outdated

Comment thread optimum/habana/transformers/models/gemma2/modeling_gemma2.py

billishyahao requested a review from ssarkar2 September 9, 2024 09:55

ssarkar2 approved these changes Sep 25, 2024

View reviewed changes

billishyahao added 5 commits October 8, 2024 03:33

[Gemma2] Enable gemma2 on gaudi

60668a4

update doc and code

7a6b330

add rope fusion

8d5f4af

enable trim_logit and reuse_cache for gemma2

31bb6a4

update the license

307ffba

billishyahao force-pushed the enablege2 branch from b265053 to 307ffba Compare October 8, 2024 03:43

Merge branch 'main' into enablege2

dd31ef6

libinta added run-test Run CI for PRs from external contributors and removed review wip labels Nov 14, 2024

Luca-Calabria mentioned this pull request Nov 20, 2024

[Gemma2] Enable Gemma2 Inference on Gaudi #1504

Merged

regisss closed this Nov 25, 2024

regisss mentioned this pull request Mar 7, 2025

fix gemma-2-27b text generation pytest #1828

Closed

Conversation

billishyahao commented Aug 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

Uh oh!

Uh oh!

billishyahao commented Sep 9, 2024

Uh oh!

billishyahao commented Oct 8, 2024

Uh oh!

billishyahao commented Oct 8, 2024

Uh oh!

billishyahao commented Nov 13, 2024

Uh oh!

Luca-Calabria commented Nov 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

emascarenhas commented Nov 13, 2024

Uh oh!

emascarenhas commented Nov 13, 2024

Uh oh!

nedo99 commented Nov 18, 2024

Uh oh!

github-actions Bot commented Nov 20, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Nov 20, 2024

Uh oh!

Luca-Calabria commented Nov 20, 2024

Uh oh!

emascarenhas commented Nov 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

billishyahao commented Aug 21, 2024 •

edited

Loading

Luca-Calabria commented Nov 13, 2024 •

edited

Loading