Re-enable manual LoRA adapter free by PopFlamingo · Pull Request #19983 · ggml-org/llama.cpp

PopFlamingo · 2026-02-28T13:26:25Z

This PR proposes re-enabling manual LoRA adapter free (the llama_adapter_lora_free function), that had been previously deprecated as part of #18490.

Motivation

After reading the discussion on why llama_adapter_lora_free was deprecated and made a no-op, my understanding is that it was considered that this had no real use case, however I am currently working on a project where having the ability to unload adapters is very important due to memory constraints (mobile devices).

Without llama_adapter_lora_free, we have to fully unload and re-load the model, as well as lose any contexts (and their cached tokens) associated with it, just to free the memory associated with those LoRAs. I am not aware of any other way to achieve this.

Summary of changes

The llama_adapter_lora_free function has been re-enabled and un-deprecated; calling llama_adapter_lora_free stays optional as the LoRA will still be freed if necessary when its parent model is released. Documentation comments have been updated to reflect this new behavior.

Concretely we make sure to remove the freed LoRA from the ownership list of its owning model to prevent double frees.

Other related issues

Issue #19153 would benefit from this change as well, since I don't think the requested feature would be implementable without llama_adapter_lora_free.

include/llama.h

…ale comments

Re-enable manual LoRA adapter free

1b60f08

PopFlamingo requested a review from CISC as a code owner February 28, 2026 13:26

CISC requested a review from ggerganov February 28, 2026 14:42

ggerganov reviewed Mar 2, 2026

View reviewed changes

include/llama.h Outdated Show resolved Hide resolved

Remove stale "all adapters must be loaded before context creation" st…

662cda0

…ale comments

PopFlamingo requested a review from ggerganov March 5, 2026 13:05

ggerganov approved these changes Mar 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-enable manual LoRA adapter free#19983

Re-enable manual LoRA adapter free#19983
PopFlamingo wants to merge 2 commits intoggml-org:masterfrom
PopFlamingo:fix/reenable-lora-free

PopFlamingo commented Feb 28, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

PopFlamingo commented Feb 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Summary of changes

Other related issues

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PopFlamingo commented Feb 28, 2026 •

edited

Loading