Skip to content

Update ckpt loading#773

Merged
regisss merged 11 commits into
mainfrom
dev/schoi/falcon_180b_loading_ckpt_new
Mar 27, 2024
Merged

Update ckpt loading#773
regisss merged 11 commits into
mainfrom
dev/schoi/falcon_180b_loading_ckpt_new

Conversation

@schoi-habana
Copy link
Copy Markdown
Collaborator

PR#15 reads a set of ckpt file names from the index json file.
When OH downloads files from the hub instead of loading from a cache dir, get_repo_root() skips downloading the index json file. Thus the PR#15 fails to load file names.
This PR scans the path and returns a list of names that matches the pattern

Tested the models in cache dir:
Falcon-7b, Falcon-40b, Falcon-180b, Llama-70b

Tested the models downloaded from HF hub:
Falcon-7b, Falcon-40b, TinyLlama

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

PR#15 reads a set of ckpt file names from the index json file.
When OH downloads files from the hub instead of loading from a cache dir, get_repo_root()
skips downloading the index json file. Thus the PR#15 fails to load file names.
This PR scans the path and returns a list of names that matches the pattern
@schoi-habana
Copy link
Copy Markdown
Collaborator Author

please note that this is to fix errors from #697

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@libinta libinta added run-test Run CI for PRs from external contributors synapse 1.15 labels Mar 19, 2024
@libinta libinta mentioned this pull request Mar 19, 2024
3 tasks
@schoi-habana schoi-habana mentioned this pull request Mar 19, 2024
3 tasks
Comment thread optimum/habana/transformers/generation/utils.py
Comment thread optimum/habana/checkpoint_utils.py Outdated
@schoi-habana schoi-habana mentioned this pull request Mar 25, 2024
3 tasks
@regisss regisss merged commit cc85226 into main Mar 27, 2024
@regisss regisss deleted the dev/schoi/falcon_180b_loading_ckpt_new branch March 27, 2024 12:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

run-test Run CI for PRs from external contributors synapse 1.15

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants