Adding default cpu support. Removing max tokens for huggingface warnings #366

DhruvaBansal00 · 2023-06-22T23:21:37Z

No description provided.

nihit

lgtm otherwise

nihit · 2023-06-22T23:23:23Z

src/autolabel/models/hf_pipeline.py

@@ -38,7 +38,9 @@ def __init__(self, config: AutolabelConfig, cache: BaseCache = None) -> None:
        # initialize HF pipeline
        tokenizer = AutoTokenizer.from_pretrained(self.model_name)
        quantize_bits = self.model_params["quantize"]
-        if quantize_bits == 8:
+        if not torch.cuda.is_available():
+            model = AutoModelForSeq2SeqLM.from_pretrained(self.model_name)


is model quantization only respected for GPU inference? makes sense but just want to confirm

Yeah bitsandbytes-foundation/bitsandbytes#40, quantization isn't really supported on cpu.

Adding default cpu support. Removing max tokens for huggingface warnings

4c4a5c0

DhruvaBansal00 requested a review from nihit June 22, 2023 23:21

nihit approved these changes Jun 22, 2023

View reviewed changes

DhruvaBansal00 merged commit 6e0bea4 into main Jun 22, 2023

DhruvaBansal00 deleted the huggingface-local-cpu branch June 22, 2023 23:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding default cpu support. Removing max tokens for huggingface warnings #366

Adding default cpu support. Removing max tokens for huggingface warnings #366

DhruvaBansal00 commented Jun 22, 2023

nihit left a comment

nihit Jun 22, 2023

DhruvaBansal00 Jun 22, 2023

Adding default cpu support. Removing max tokens for huggingface warnings #366

Adding default cpu support. Removing max tokens for huggingface warnings #366

Conversation

DhruvaBansal00 commented Jun 22, 2023

nihit left a comment

Choose a reason for hiding this comment

nihit Jun 22, 2023

Choose a reason for hiding this comment

DhruvaBansal00 Jun 22, 2023

Choose a reason for hiding this comment