Ensure Equivalence With DPO Padding and Padding Free #34

fabianlim · 2025-07-23T03:16:56Z

Actually I believe that padding-free (packing) is correct.

~~this PR fixes the non-packing case, where concatenated_forward assumes the pad_token_id = 0 by mistake~~
the seq_idx in the padding free concatenated_forward was computed wrongly.
NOTE: also reverted Remove Unused DPO Function allenai/open-instruct#794 that removed our preference_span_search function.

Checking

purple: no packing
brown: packing
purple (dotted): packing before fix

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

garrett361 · 2025-07-23T13:13:47Z

open_instruct/dpo_tune_cache.py

    simpo_loss,
    wpo_loss,
 )
-from open_instruct.padding_free_collator import (


Intentionally deleted?

yes there is a dup two lines belwo

garrett361 · 2025-07-23T13:25:43Z

open_instruct/dpo_utils.py

    if not packing:
-        concatenated_batch = concatenated_inputs(batch)
+        try:
+            pad_token_id = model.tokenizer.pad_token_id


Do models usually have a tokenizer attr? I'm finding not, for bamba and granite. Unless the attr is set somewhere after init.

they will usually have, though the model is typed as torch.nn.Module so I thought in the OI use-case it may not be gauranteed.

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

This reverts commit 266f214.

open_instruct/padding_free_collator.py

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

tests/test_padding_free.py

garrett361

LGTM, just one question.

garrett361 · 2025-07-24T15:28:59Z

also, quality checks failing

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

fabianlim · 2025-08-06T16:30:03Z

@garrett361 fixed conflict and linted the files effectively chaged in this commit. But because we had to pull upstream, there are alot of files not linted

draft test, fix concat forward

09a79ea

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

garrett361 reviewed Jul 23, 2025

View reviewed changes

fabianlim and others added 4 commits July 24, 2025 13:44

fix pf

d0da2fd

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

Merge branch 'main' into fix-pf-dpo

54c1dc8

revert

736734f

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

Revert "Remove Unused DPO Function (allenai#794)"

16a037c

This reverts commit 266f214.

garrett361 reviewed Jul 24, 2025

View reviewed changes

open_instruct/padding_free_collator.py Outdated Show resolved Hide resolved

fix test

0853acd

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

garrett361 reviewed Jul 24, 2025

View reviewed changes

tests/test_padding_free.py Show resolved Hide resolved

garrett361 approved these changes Jul 24, 2025

View reviewed changes

fabianlim added 2 commits August 6, 2025 15:58

Merge remote-tracking branch 'origin' into fix-pf-dpo

3a543b8

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

lint

705ba44

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

garrett361 merged commit 1003a02 into main Aug 6, 2025
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure Equivalence With DPO Padding and Padding Free #34

Ensure Equivalence With DPO Padding and Padding Free #34

Uh oh!

fabianlim commented Jul 23, 2025 •

edited

Loading

Uh oh!

garrett361 Jul 23, 2025

Uh oh!

fabianlim Jul 23, 2025

Uh oh!

garrett361 Jul 23, 2025

Uh oh!

fabianlim Jul 23, 2025

Uh oh!

Uh oh!

Uh oh!

garrett361 left a comment

Uh oh!

garrett361 commented Jul 24, 2025

Uh oh!

fabianlim commented Aug 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Ensure Equivalence With DPO Padding and Padding Free #34

Ensure Equivalence With DPO Padding and Padding Free #34

Uh oh!

Conversation

fabianlim commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

garrett361 Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

fabianlim Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

garrett361 Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

fabianlim Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

garrett361 left a comment

Choose a reason for hiding this comment

Uh oh!

garrett361 commented Jul 24, 2025

Uh oh!

fabianlim commented Aug 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fabianlim commented Jul 23, 2025 •

edited

Loading