Make `HUGGINGFACE_OFFLINE` configurable #7

slyt · 2024-07-15T22:06:22Z

Currently HUGGINGFACE_OFFLINE=1 is hardcoded in the helm template.

I'd like for it to be configurable in values.yaml so that I don't have to pull models from s3 and can download them directly from huggingface hub via the functionality in huggingface text-generation-inference image.

slyt · 2024-07-15T22:13:23Z

I was able to overwrite it with:

extraEnvVars:
  - name: HUGGINGFACE_OFFLINE
    value: "0"

voatsap · 2024-07-23T21:30:49Z

Will take a look on this @romanprog please check

voatsap · 2024-07-29T11:49:45Z

don't have to pull models from s3 and can download them directly from huggingface hub via the functionality in huggingface text-generation-inference image.

There is another block that refered on if model is downloaded through s3 or directly from HF
init:
s3:
enabled: false
bucketURL: s3://k8s-model-zephyr/llm/deployment/HuggingFaceH4/zephyr-7b-beta

If s3 enabled: false it would download it directly from HF, if not, it would use bucketURL

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make `HUGGINGFACE_OFFLINE` configurable #7

Make `HUGGINGFACE_OFFLINE` configurable #7

slyt commented Jul 15, 2024

slyt commented Jul 15, 2024

voatsap commented Jul 23, 2024

voatsap commented Jul 29, 2024

Make HUGGINGFACE_OFFLINE configurable #7

Make HUGGINGFACE_OFFLINE configurable #7

Comments

slyt commented Jul 15, 2024

slyt commented Jul 15, 2024

voatsap commented Jul 23, 2024

voatsap commented Jul 29, 2024

Make `HUGGINGFACE_OFFLINE` configurable #7

Make `HUGGINGFACE_OFFLINE` configurable #7