🔀 Rename `get_batch_sample` and add `num_items_in_batch` to `compute_loss` #2246

qgallouedec · 2024-10-18T09:22:37Z

What does this PR do?

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2024-10-18T09:31:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

trl/trainer/orpo_trainer.py

trl/trainer/cpo_trainer.py

lewtun

Thanks for the fast fix! LGTM and I left a comment about some follow-up cleaning we can do with these generate methods in favour of extending the LogCompletions callback

lewtun · 2024-10-18T18:43:02Z

trl/trainer/bco_trainer.py

        return SequentialSampler(self.train_dataset)

-    def get_batch_samples(self, model, batch: Dict[str, torch.LongTensor]) -> Tuple[str, str]:
+    def generate_from_model_and_ref(self, model, batch: Dict[str, torch.LongTensor]) -> Tuple[str, str]:


Note for future: now that we have LogCompletions callback, it might be possible to enable the generative aspects of this method directly as a callback. We'd probably have to extend the LogCompletions callback to check if a reference model exists and generate for that too, but that seems better than having this code duplicated all over our preference trainers

…loss` (huggingface#2246)

qgallouedec added 2 commits October 18, 2024 09:01

get_batch_sample -> generate_from_model[_and_ref]

979c5c5

add num_items_in_batch=None

ada53cf

num_items_in_batch in training_step

10bffa0

qgallouedec changed the title ~~Rename get_batch_sample and add num_items_in_batch to compute_loss~~ 🔀 Rename get_batch_sample and add num_items_in_batch to compute_loss Oct 18, 2024

qgallouedec requested review from kashif and lewtun October 18, 2024 10:09

qgallouedec marked this pull request as ready for review October 18, 2024 10:09

qgallouedec requested a review from edbeeching October 18, 2024 10:09

kashif approved these changes Oct 18, 2024

View reviewed changes

qgallouedec commented Oct 18, 2024

View reviewed changes

trl/trainer/orpo_trainer.py Outdated Show resolved Hide resolved

qgallouedec commented Oct 18, 2024

View reviewed changes

trl/trainer/cpo_trainer.py Outdated Show resolved Hide resolved

Fix return type hint

ca2d98f

qgallouedec mentioned this pull request Oct 18, 2024

TypeError: XPOTrainer.training_step() takes 3 positional arguments but 4 were given #2247

Closed

4 tasks

qgallouedec linked an issue Oct 18, 2024 that may be closed by this pull request

TypeError: XPOTrainer.training_step() takes 3 positional arguments but 4 were given #2247

Closed

4 tasks

lewtun approved these changes Oct 18, 2024

View reviewed changes

qgallouedec merged commit 31b7820 into main Oct 18, 2024

qgallouedec deleted the rename_get_batch_sample branch October 18, 2024 19:02

This was referenced Oct 24, 2024

Conflict between last version of Transformers.Trainer and DPOTrainer.get_batch_samples #2275

Open

trl dpo AttributeError: 'generator' object has no attribute 'generate' #2292

Closed

yxliu-TAMU pushed a commit to mincheolseong/ECEN743-GRPO-Project-Proposal that referenced this pull request Apr 20, 2025

🔀 Rename get_batch_sample and add num_items_in_batch to `compute_…

ab0a687

…loss` (huggingface#2246)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🔀 Rename `get_batch_sample` and add `num_items_in_batch` to `compute_loss` #2246

🔀 Rename `get_batch_sample` and add `num_items_in_batch` to `compute_loss` #2246

Uh oh!

qgallouedec commented Oct 18, 2024 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 18, 2024

Uh oh!

Uh oh!

Uh oh!

lewtun left a comment

Uh oh!

lewtun Oct 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

🔀 Rename get_batch_sample and add num_items_in_batch to compute_loss #2246

🔀 Rename get_batch_sample and add num_items_in_batch to compute_loss #2246

Uh oh!

Conversation

qgallouedec commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 18, 2024

Uh oh!

Uh oh!

Uh oh!

lewtun left a comment

Choose a reason for hiding this comment

Uh oh!

lewtun Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

🔀 Rename `get_batch_sample` and add `num_items_in_batch` to `compute_loss` #2246

🔀 Rename `get_batch_sample` and add `num_items_in_batch` to `compute_loss` #2246

qgallouedec commented Oct 18, 2024 •

edited

Loading