Skip to content

enable falcon-180b inference#697

Closed
schoi-habana wants to merge 8 commits into
mainfrom
dev/schoi/falcon_180b_loading_ckpt
Closed

enable falcon-180b inference#697
schoi-habana wants to merge 8 commits into
mainfrom
dev/schoi/falcon_180b_loading_ckpt

Conversation

@schoi-habana
Copy link
Copy Markdown
Collaborator

1 The current code can load only .bin or .pt whereas Falcon-180B ckpt files are in .safetensors format. This change enabled loading ckpt files in .safetensor format
2 The ValueError "The following model_kwargs are not used by the model: ['token_type_ids'] )" is addressed by introducing a workaround that removes ‘token_type_ids’ from the model_kwargs.

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@schoi-habana schoi-habana force-pushed the dev/schoi/falcon_180b_loading_ckpt branch from 6659dca to fbf1bd2 Compare February 8, 2024 23:26
@regisss
Copy link
Copy Markdown
Collaborator

regisss commented Feb 11, 2024

@schoi-habana From what I understood in this PR, loading safetensors checkpoint with DeepSpeed is not supported yet with Habana's DeepSpeed fork v1.14. So my understanding is that it will work when v1.15 is released right?

@schoi-habana
Copy link
Copy Markdown
Collaborator Author

@regisss that's true. Please merge this after 1.15.0 release

@schoi-habana schoi-habana force-pushed the dev/schoi/falcon_180b_loading_ckpt branch from a24421c to e40e9cc Compare March 7, 2024 01:33
@schoi-habana schoi-habana mentioned this pull request Mar 7, 2024
3 tasks
@schoi-habana
Copy link
Copy Markdown
Collaborator Author

no longer needed as #773 is a permanent solution

@schoi-habana schoi-habana deleted the dev/schoi/falcon_180b_loading_ckpt branch March 19, 2024 00:49
gplutop7 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Oct 15, 2025
gplutop7 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Nov 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants