Allow loading of .safetensors through GPTQ-for-LLaMa by EyeDeck · Pull Request #529 · oobabooga/textgen

EyeDeck · 2023-03-24T01:39:17Z

Quantized models were hardcoded to only load .pt, but qwopqwop200's repo already works with .safetensors if we just pass in the other file extension.

With this PR, it looks for models in the order:

models\model-#bit.safetensors
models\subfolder\model-#bit.safetensors
models\model-#bit.pt
models\subfolder\model-#bit.pt

Seems to work fine, but I only tested on one janky model I ran through the quantizer myself.

Ph0rk0z · 2023-03-24T11:49:42Z

Yes, except we all have to re-quantize 100gb of models for this.

EyeDeck · 2023-03-24T18:03:23Z

Why would that be necessary? I mean, it's going to need to be done to support newer versions of GPTQ-for-LLaMa at some point, but that's not related to this PR.

oobabooga · 2023-03-25T02:50:53Z

Thanks, this is handy! More updates on GPTQ will come after #530.

Allow loading of .safetensors through GPTQ-for-LLaMa

Allow loading of .safetensors through GPTQ-for-LLaMa

dcfd866

oobabooga mentioned this pull request Mar 24, 2023

Move to updated GPTQ with new PTB and C4 eval #541

Closed

oobabooga merged commit 3da633a into oobabooga:main Mar 25, 2023

Ph0rk0z pushed a commit to Ph0rk0z/text-generation-webui-testing that referenced this pull request Apr 17, 2023

Merge pull request oobabooga#529 from EyeDeck/main

20987d9

Allow loading of .safetensors through GPTQ-for-LLaMa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow loading of .safetensors through GPTQ-for-LLaMa#529

Allow loading of .safetensors through GPTQ-for-LLaMa#529
oobabooga merged 1 commit into
oobabooga:mainfrom
EyeDeck:main

EyeDeck commented Mar 24, 2023 •

edited

Loading

Uh oh!

Ph0rk0z commented Mar 24, 2023

Uh oh!

EyeDeck commented Mar 24, 2023

Uh oh!

oobabooga commented Mar 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

EyeDeck commented Mar 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ph0rk0z commented Mar 24, 2023

Uh oh!

EyeDeck commented Mar 24, 2023

Uh oh!

oobabooga commented Mar 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

EyeDeck commented Mar 24, 2023 •

edited

Loading