How can I merge the LoRA weights into the base model? #74

pantDevesh · 2024-06-16T13:45:19Z

Is there a script for this?

mkserge · 2024-06-17T22:05:04Z

You can do something like this

from mistral_inference.model import Transformer
model = Transformer.from_folder(args.model_path, device=f"cuda:0")
model.load_lora("/path/to/lora.safetensors", device=f"cuda:0")
safetensors.torch.save_model(model, "/path/to/merged.safetensors")

forest520 · 2024-06-18T09:09:51Z

How to perform inference with a LoRA model using Python code, if save_adapters = True?

kehuitt · 2024-07-17T15:45:28Z

You can do something like this

from mistral_inference.model import Transformer
model = Transformer.from_folder(args.model_path, device=f"cuda:0")
model.load_lora("/path/to/lora.safetensors", device=f"cuda:0")
safetensors.torch.save_model(model, "/path/to/merged.safetensors")

When I run this, I got 'ImportError: cannot import name 'Transformer' from 'mistral_inference.model'', the version of mistral_inference=1.2.0, how can I fix this problem? Thx!

pandora-s-git · 2024-07-17T15:51:41Z

Try with from mistral_inference.transformer import Transformer as it was very recently updated with the codestral mamba release!

kehuitt · 2024-07-20T03:58:40Z

A single GPU doesn't seem to be able to load the entire Mixtral-8x7B-v0.1-Instruct model, how should I merge the model using multiple cards? Thanks!

leloss · 2024-08-20T02:20:17Z

A single GPU doesn't seem to be able to load the entire Mixtral-8x7B-v0.1-Instruct model, how should I merge the model using multiple cards? Thanks!

Apparently, the only merging method available today relies on loading everything on the same device, which forces us to rent out a 40GB GPU instance like the p4d.24xlarge for the 7B model. Someone (please) correct me if I'm wrong.

abhishekdhankar95 · 2024-09-10T00:20:27Z

mistral-finetune has a requirement of torch==2.2, whereas mistral-inference has a requirement of torch==2.3.0 for all but the first release.
Is there anyway to have the two of them in the same conda environment without conflicting requirements?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I merge the LoRA weights into the base model? #74

How can I merge the LoRA weights into the base model? #74

pantDevesh commented Jun 16, 2024

mkserge commented Jun 17, 2024

forest520 commented Jun 18, 2024

kehuitt commented Jul 17, 2024 •

edited

Loading

pandora-s-git commented Jul 17, 2024

kehuitt commented Jul 20, 2024

leloss commented Aug 20, 2024

abhishekdhankar95 commented Sep 10, 2024 •

edited

Loading

How can I merge the LoRA weights into the base model? #74

How can I merge the LoRA weights into the base model? #74

Comments

pantDevesh commented Jun 16, 2024

mkserge commented Jun 17, 2024

forest520 commented Jun 18, 2024

kehuitt commented Jul 17, 2024 • edited Loading

pandora-s-git commented Jul 17, 2024

kehuitt commented Jul 20, 2024

leloss commented Aug 20, 2024

abhishekdhankar95 commented Sep 10, 2024 • edited Loading

kehuitt commented Jul 17, 2024 •

edited

Loading

abhishekdhankar95 commented Sep 10, 2024 •

edited

Loading