Fail early for unsupported PEFT + Liger Kernel in DPO by albertvillanova · Pull Request #5709 · huggingface/trl

albertvillanova · 2026-05-06T12:05:31Z

Fail early for unsupported PEFT + Liger Kernel in DPO.

This pull request introduces a check to prevent the use of the Liger DPO loss with PEFT models, as this combination is not yet supported. It also adds a corresponding test to ensure that initializing a DPOTrainer with both Liger Kernel and PEFT raises a clear error.

Changes

Validation and error handling improvements:

Added a check in the DPOTrainer initializer to raise a NotImplementedError if both use_liger_kernel is enabled and the model is a PEFT model, with a clear error message.

Testing enhancements:

Added a new test (test_init_fails_with_peft_and_liger) in tests/test_dpo_trainer.py to verify that initializing DPOTrainer with both Liger Kernel and PEFT fails as expected, raising the correct error.

Note

Low Risk
Low risk: adds an early validation guard and a regression test, without changing supported training flows beyond producing a clearer error sooner for an already-unsupported configuration.

Overview
Prevents unsupported configurations by failing fast when initializing DPOTrainer with use_liger_kernel=True on a PEFT-wrapped model, raising a clear NotImplementedError.

Adds a regression test (test_init_fails_with_peft_and_liger) to ensure the PEFT+Liger combination errors at init time, and removes the now-redundant PEFT check from the Liger loss computation path.

^{Reviewed by Cursor Bugbot for commit 623f168. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-05-06T12:08:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2026-05-06T13:32:05Z

+            if is_peft_model(model):
+                raise NotImplementedError("Liger DPO loss is not implemented for PEFT models.")


maybe we could remove this then?

trl/trl/trainer/dpo_trainer.py

Lines 1101 to 1102 in 62a66af

if is_peft_model(model):

raise NotImplementedError("Liger DPO loss is not implemented for PEFT models.")

I thought the same! But codex told me to keep it... I should have followed my intuition.

…liger-kernel

albertvillanova added 2 commits May 6, 2026 10:37

Make DPO fail fast for peft + liger-kernel

2706112

Test DPO fail fast for peft + liger-kernel

62a66af

qgallouedec reviewed May 6, 2026

View reviewed changes

albertvillanova added 2 commits May 6, 2026 15:39

Remove error raising from _compute_loss_liger

70540c1

Merge remote-tracking branch 'upstream/main' into dpo-fail-fast-peft-…

623f168

…liger-kernel

qgallouedec approved these changes May 6, 2026

View reviewed changes

albertvillanova merged commit 19d007e into main May 6, 2026
13 checks passed

albertvillanova deleted the dpo-fail-fast-peft-liger-kernel branch May 6, 2026 14:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail early for unsupported PEFT + Liger Kernel in DPO#5709

Fail early for unsupported PEFT + Liger Kernel in DPO#5709
albertvillanova merged 4 commits into
mainfrom
dpo-fail-fast-peft-liger-kernel

albertvillanova commented May 6, 2026 •

edited by cursor Bot

Loading

Uh oh!

HuggingFaceDocBuilderDev commented May 6, 2026

Uh oh!

qgallouedec May 6, 2026 •

edited

Loading

Uh oh!

albertvillanova May 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if is_peft_model(model):
		raise NotImplementedError("Liger DPO loss is not implemented for PEFT models.")

Conversation

albertvillanova commented May 6, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

HuggingFaceDocBuilderDev commented May 6, 2026

Uh oh!

qgallouedec May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

albertvillanova May 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertvillanova commented May 6, 2026 •

edited by cursor Bot

Loading

qgallouedec May 6, 2026 •

edited

Loading