FIX Model with nested all-linear target modules #2391

BenjaminBossan · 2025-02-20T14:59:43Z

Resolves #2390

There was a bug in PEFT when adding a LoRA adapter with target_modules='all-linear' (e.g. via add_adapter) to a model that already had LoRA adapters applied. The resolution of 'all-linear' would result in, for instance, lora_A and lora_B being targeted, leading to nested LoRA adapters. With this fix, this is prevented and the correct layers will be targeted.

Resolves huggingface#2390 There was a bug in PEFT when adding a LoRA adapter with target_modules='all-linear' (e.g. via add_adapter) to a model that already had LoRA adapters applied. The resolution of 'all-linear' would result in, for instance, lora_A and lora_B being targeted, leading to nested LoRA adapters. With this fix, this is prevented and the correct layers will be targeted.

HuggingFaceDocBuilderDev · 2025-02-20T15:03:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

githubnemo · 2025-02-21T17:40:54Z

src/peft/tuners/tuners_utils.py

+            for suffix, child in module.named_modules():
+                if suffix:
+                    module_names_to_exclude.add(f"{prefix}.{suffix}")
+


If we subtract all the base tuner layers anyway, why gather the linear ones explicitly in the first place and not just extract all the base tuner layers?

I'm not sure if I fully understand your comment. Here is what happens:

1: Old code

for name, module in model.named_modules(): if isinstance(module, linear_classes): linear_module_names.add(name)

Adds all linear modules, which is mostly correct, except that if LoRA is already applied, it will also add lora_A, lora_B, base_layer, etc.

2: New code

elif isinstance(module, BaseTunerLayer) and any(n in type(module).__name__ for n in linear_names): linear_module_names.add(name)

Here we check for lora.Linear etc., which would be present if LoRA is already applied, and add those too, since those are the targets we want (as they replace the original nn.Linear).

3: New code

for prefix, module in model.named_modules(): if isinstance(module, BaseTunerLayer): for suffix, child in module.named_modules(): if suffix: module_names_to_exclude.add(f"{prefix}.{suffix}")

Here we remove the possibly present, wrongly added lora_A, lora_B, base_layer, etc.

In summary, adding 3 is the fix to accidentally targeting nested nn.Linear layers as originally reported in #2390. The addition of 2 was necessary because otherwise, we would not actually update the lora.Linear (et al.) layers when adding a second adapter.

BenjaminBossan requested a review from githubnemo February 20, 2025 16:04

githubnemo reviewed Feb 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX Model with nested all-linear target modules #2391

FIX Model with nested all-linear target modules #2391

BenjaminBossan commented Feb 20, 2025

HuggingFaceDocBuilderDev commented Feb 20, 2025

githubnemo Feb 21, 2025

BenjaminBossan Feb 24, 2025

FIX Model with nested all-linear target modules #2391

Are you sure you want to change the base?

FIX Model with nested all-linear target modules #2391

Conversation

BenjaminBossan commented Feb 20, 2025

HuggingFaceDocBuilderDev commented Feb 20, 2025

githubnemo Feb 21, 2025

Choose a reason for hiding this comment

BenjaminBossan Feb 24, 2025

Choose a reason for hiding this comment