Skip to content

Align KTO with DPO: Align _precompute_ref_logps#5714

Merged
albertvillanova merged 8 commits into
mainfrom
align-kto-dpo-precompute_ref_logps
May 7, 2026
Merged

Align KTO with DPO: Align _precompute_ref_logps#5714
albertvillanova merged 8 commits into
mainfrom
align-kto-dpo-precompute_ref_logps