You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'd like for it to be configurable in values.yaml so that I don't have to pull models from s3 and can download them directly from huggingface hub via the functionality in huggingface text-generation-inference image.
The text was updated successfully, but these errors were encountered:
don't have to pull models from s3 and can download them directly from huggingface hub via the functionality in huggingface text-generation-inference image.
There is another block that refered on if model is downloaded through s3 or directly from HF
init:
s3:
enabled: false
bucketURL: s3://k8s-model-zephyr/llm/deployment/HuggingFaceH4/zephyr-7b-beta
If s3 enabled: false it would download it directly from HF, if not, it would use bucketURL
Currently HUGGINGFACE_OFFLINE=1 is hardcoded in the helm template.
I'd like for it to be configurable in values.yaml so that I don't have to pull models from s3 and can download them directly from huggingface hub via the functionality in huggingface text-generation-inference image.
The text was updated successfully, but these errors were encountered: