Fix LoRa weight merging in export #19

antimatter15 · 2023-03-16T07:52:06Z

I was trying to export the model for use with llama.cpp but noticed that it just copied weights from the base model verbatim. I'm suspect there's a "right" way to do this, but this was the approach that worked for me.

tloen · 2023-03-16T08:04:43Z

Have you tested that the existing code doesn't work? Assuming you're not loading the foundation model in 8bit, the call to .eval() should merge the LoRA weights into the base weights.

antimatter15 · 2023-03-16T08:10:26Z

Yeah I've tested the existing code and it didn't seem to work. Looking at the code for the underlying Peft implementation I don't see why .eval() would merge weights. All the weight merging logic is located inside the .train() method.

…

On Thu, Mar 16, 2023 at 1:04 AM Eric J. Wang ***@***.***> wrote: Have you tested that the existing code doesn't work? Assuming you're not loading the foundation model in 8bit, the call to .eval() should merge the LoRA weights into the base weights. — Reply to this email directly, view it on GitHub <#19 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAHKZSF7CV2AF2JIOFQ2W3W4LCSNANCNFSM6AAAAAAV425TNQ> . You are receiving this because you authored the thread.Message ID: ***@***.***>

tloen · 2023-03-16T08:22:36Z

eval() should be an alias for train(false), and merge_weights should default to true. I'll look into it tomorrow morning.

tloen · 2023-03-16T08:42:57Z

Can't hurt

Fix LoRa weight merging

dde8995

tloen merged commit 6681523 into tloen:main Mar 16, 2023

gyunggyung mentioned this pull request Mar 18, 2023

[20230319] Weekly AI ArXiv 만담 시즌2 - 10회차 jungwoo-ha/WeeklyArxivTalk#76

Open

jethro254wt mentioned this pull request Mar 29, 2023

llama_model_load: loading model from 'models/7B/ggml-model-q4_0.bin' cocktailpeanut/dalai#251

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LoRa weight merging in export #19

Fix LoRa weight merging in export #19

antimatter15 commented Mar 16, 2023

tloen commented Mar 16, 2023

antimatter15 commented Mar 16, 2023 via email

tloen commented Mar 16, 2023

tloen commented Mar 16, 2023

Fix LoRa weight merging in export #19

Fix LoRa weight merging in export #19

Conversation

antimatter15 commented Mar 16, 2023

tloen commented Mar 16, 2023

antimatter15 commented Mar 16, 2023 via email

tloen commented Mar 16, 2023

tloen commented Mar 16, 2023