add support for TinyLlama model by tjs-intel · Pull Request #693 · huggingface/optimum-habana

tjs-intel · 2024-02-07T00:10:24Z

⚠️ Do not merge this PR before Habana DeepSpeed 1.15 is released ⚠️

What does this PR do?

The TinyLlama model only has checkpoints in the form of model.safetensors. This checkpoint needs to be included in the list of checkpoints that is passed to DeepSpeed in order for the model to function properly when initialized with DeepSpeed.

This PR adds the safetensor checkpoints to the list of checkpoints passed to DeepSpeed.

Note: This change requires an upstream commit in microsoft/DeepSpeed to be merged downstream to HabanaAI/DeepSpeed in order for DeepSpeed to support the provided safetensor format.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

regisss · 2024-02-07T07:41:58Z

@mandy-li Do you know if this commit will be part of Habana DeepSpeed 1.15?

mandy-li · 2024-02-07T08:05:12Z

@mandy-li Do you know if this commit will be part of Habana DeepSpeed 1.15?

@regisss , @tjs-intel , DS changes should be submitted to internal habana deepspeed-fork repo

regisss · 2024-02-07T08:26:28Z

@mandy-li Do you know if this commit will be part of Habana DeepSpeed 1.15?

@regisss , @tjs-intel , DS changes should be submitted to internal habana deepspeed-fork repo

deepspeedai/DeepSpeed@c8c57b8 is part of the v0.13 release of DeepSpeed. I guess it should be part of Habana DeepSpeed 1.15 or 1.16 no?

mandy-li · 2024-02-07T08:38:56Z

@mandy-li Do you know if this commit will be part of Habana DeepSpeed 1.15?

@regisss , @tjs-intel , DS changes should be submitted to internal habana deepspeed-fork repo

microsoft/DeepSpeed@c8c57b8 is part of the v0.13 release of DeepSpeed. I guess it should be part of Habana DeepSpeed 1.15 or 1.16 no?

Yes, will be in 1.15, but we modified it in the internal Habana DS repo.
@tjs-intel , this PR should go to habana oh-fork to test internally.

tjs-intel · 2024-02-08T16:12:22Z

OH-fork PR here HabanaAI#25

tjs-intel · 2024-02-14T17:34:28Z

HabanaAI#25 has been merged

regisss · 2024-02-19T09:37:14Z

HabanaAI#25 has been merged

So we need to wait for 1.15 to be merged to merge this PR right?

tjs-intel · 2024-02-20T22:27:23Z

@regisss I will leave that discussion to the maintainers.

The consequence of merging this before the HabanaAI/DeepSpeed 1.15 release is that the latest released version of HabanaAI/DeepSpeed (1.14) does not support safetensors. If there are any models with both .bin and .safetensors checkpoints, this change will prefer the .safetensors checkpoint and send that to DeepSpeed instead of the .bin checkpoint. This will cause DeepSpeed 1.14 to raise an exception since the .safetensors checkpoint cannot be unpickled.

HuggingFaceDocBuilderDev · 2024-02-21T00:06:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

regisss

Thanks for the clear explanation @tjs-intel! In that case, we'll wait for the release of SynapseAI v1.15 to merge this PR.
Changes look good to me, could you also run the following to make the code style check pass please?

pip install -U ruff
make style

regisss · 2024-03-25T17:51:40Z

Merging in the synapse_1.15 branch

tjs-intel requested review from bhargaveede, regisss, ssarkar2 and vivekgoe as code owners February 7, 2024 00:10

vidyasiv reviewed Feb 7, 2024

View reviewed changes

Comment thread optimum/habana/transformers/generation/utils.py Outdated

tjs-intel force-pushed the support-safetensors-oh branch from 620fd07 to 595cb67 Compare February 8, 2024 16:21

regisss mentioned this pull request Feb 11, 2024

enable falcon-180b inference #697

Closed

3 tasks

Add support for safetensors and sharded checkpoints

e3f8822

tjs-intel force-pushed the support-safetensors-oh branch from 595cb67 to e3f8822 Compare February 13, 2024 20:04

regisss added the synapse 1.15 label Feb 21, 2024

regisss reviewed Feb 21, 2024

View reviewed changes

make style

2619e26

regisss added the run-test Run CI for PRs from external contributors label Feb 22, 2024

regisss approved these changes Feb 22, 2024

View reviewed changes

regisss mentioned this pull request Feb 25, 2024

Issue running meta-llama/Llama-2-13b-chat-hf huggingface/tgi-gaudi#54

Closed

4 tasks

regisss changed the base branch from main to synapse_1.15 March 25, 2024 17:51

regisss merged commit e06fb93 into huggingface:synapse_1.15 Mar 25, 2024

tjs-intel deleted the support-safetensors-oh branch March 25, 2024 19:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for TinyLlama model#693

add support for TinyLlama model#693
regisss merged 2 commits into
huggingface:synapse_1.15from
tjs-intel:support-safetensors-oh

tjs-intel commented Feb 7, 2024 •

edited by regisss

Loading

Uh oh!

Uh oh!

regisss commented Feb 7, 2024

Uh oh!

mandy-li commented Feb 7, 2024

Uh oh!

regisss commented Feb 7, 2024

Uh oh!

mandy-li commented Feb 7, 2024

Uh oh!

tjs-intel commented Feb 8, 2024

Uh oh!

tjs-intel commented Feb 14, 2024

Uh oh!

regisss commented Feb 19, 2024

Uh oh!

tjs-intel commented Feb 20, 2024 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Feb 21, 2024

Uh oh!

regisss left a comment

Uh oh!

regisss commented Mar 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

tjs-intel commented Feb 7, 2024 • edited by regisss Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

Uh oh!

regisss commented Feb 7, 2024

Uh oh!

mandy-li commented Feb 7, 2024

Uh oh!

regisss commented Feb 7, 2024

Uh oh!

mandy-li commented Feb 7, 2024

Uh oh!

tjs-intel commented Feb 8, 2024

Uh oh!

tjs-intel commented Feb 14, 2024

Uh oh!

regisss commented Feb 19, 2024

Uh oh!

tjs-intel commented Feb 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Feb 21, 2024

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

regisss commented Mar 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

tjs-intel commented Feb 7, 2024 •

edited by regisss

Loading

tjs-intel commented Feb 20, 2024 •

edited

Loading