Allow loading of .safetensors through GPTQ-for-LLaMa #529

EyeDeck · 2023-03-24T01:39:17Z

Quantized models were hardcoded to only load .pt, but qwopqwop200's repo already works with .safetensors if we just pass in the other file extension.

With this PR, it looks for models in the order:

models\model-#bit.safetensors
models\subfolder\model-#bit.safetensors
models\model-#bit.pt
models\subfolder\model-#bit.pt

Seems to work fine, but I only tested on one janky model I ran through the quantizer myself.

Ph0rk0z · 2023-03-24T11:49:42Z

Yes, except we all have to re-quantize 100gb of models for this.

EyeDeck · 2023-03-24T18:03:23Z

Why would that be necessary? I mean, it's going to need to be done to support newer versions of GPTQ-for-LLaMa at some point, but that's not related to this PR.

oobabooga · 2023-03-25T02:50:53Z

Thanks, this is handy! More updates on GPTQ will come after #530.

Allow loading of .safetensors through GPTQ-for-LLaMa

Allow loading of .safetensors through GPTQ-for-LLaMa

dcfd866

oobabooga mentioned this pull request Mar 24, 2023

Move to updated GPTQ with new PTB and C4 eval #541

Closed

oobabooga merged commit 3da633a into oobabooga:main Mar 25, 2023

Ph0rk0z pushed a commit to Ph0rk0z/text-generation-webui-testing that referenced this pull request Apr 17, 2023

Merge pull request oobabooga#529 from EyeDeck/main

20987d9

Allow loading of .safetensors through GPTQ-for-LLaMa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow loading of .safetensors through GPTQ-for-LLaMa #529

Allow loading of .safetensors through GPTQ-for-LLaMa #529

EyeDeck commented Mar 24, 2023 •

edited

Loading

Ph0rk0z commented Mar 24, 2023

EyeDeck commented Mar 24, 2023

oobabooga commented Mar 25, 2023

Allow loading of .safetensors through GPTQ-for-LLaMa #529

Allow loading of .safetensors through GPTQ-for-LLaMa #529

Conversation

EyeDeck commented Mar 24, 2023 • edited Loading

Ph0rk0z commented Mar 24, 2023

EyeDeck commented Mar 24, 2023

oobabooga commented Mar 25, 2023

EyeDeck commented Mar 24, 2023 •

edited

Loading