Change in-place operations to out-of-place in LogitsProcessors #29680

zucchini-nlp · 2024-03-15T15:55:40Z

What does this PR do?

Fixes #29551 . In fact we do not really fix it. As @gante already explained why the logits and scores for contrastive decoding are same in that setting, because we did not apply any logits processors.

This PR changes all in-place operation on scores to out-of-place, so that the logits and scores are actually different when logits processors are used. Actually, this is mostly copy-paste from PR for compile compatibility, we also has to get rid of in-place operations there.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@gante

HuggingFaceDocBuilderDev · 2024-03-15T16:16:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante

Looking good 👍

Missing: a test regarding that confirms that logits == scores when no processors are used in generate, and logits != scores otherwise.

gante · 2024-03-15T16:30:36Z

src/transformers/generation/logits_process.py


-        scores.scatter_(1, input_ids, score)
+        scores = scores.scatter(1, input_ids, score)
        return scores


nit: we should do return scores_processed or return scores in ALL processors, for the sake of keeping a consistent pattern

gante

The changes look good 👍

One big question though: no doctests should have changed in the process. Do you know what is causing the change?

This reverts commit 4772768.

amyeroberts

Very nice 🔥 Thanks for working on this!

Quite a few of the changes don't seem to be necessary, but I'm assuming it's for consistency of having scores_processed returned.

amyeroberts · 2024-03-21T12:13:23Z

src/transformers/generation/logits_process.py

+        scores_processed = scores / self.temperature
+        return scores_processed


I find it surprising that this causes an inplace modification

In [8]: import torch In [9]: x = torch.Tensor([1, 2, 3, 4]) In [10]: x Out[10]: tensor([1., 2., 3., 4.]) In [11]: id(x) Out[11]: 4339929136 In [12]: x /= 2 In [13]: id(x) Out[13]: 4339929136 In [14]: x = x / 5 In [15]: id(x) Out[15]: 10943842160

No, it was not causing any modifications on scores itself. That naming is for consistency only :)

amyeroberts · 2024-03-21T12:13:58Z

src/transformers/generation/logits_process.py

        indices_to_remove = sorted_indices_to_remove.scatter(1, sorted_indices, sorted_indices_to_remove)
-        scores = scores.masked_fill(indices_to_remove, self.filter_value)
-        return scores
+        scores_processed = scores.masked_fill(indices_to_remove, self.filter_value)


Same for all the masked_fill calls here

yep, naming consistency purposes again

amyeroberts · 2024-03-21T13:45:50Z

src/transformers/generation/logits_process.py

+            scores_processed = torch.full_like(scores, -math.inf)
+            scores_processed[:, self.bos_token_id] = 0
+        return scores_processed


A lot nicer :)

tests/generation/test_logits_process.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

zucchini-nlp added 2 commits March 15, 2024 15:36

change in-place -> out-of-place

544f00a

add tests

7415127

gante reviewed Mar 15, 2024

View reviewed changes

zucchini-nlp added 7 commits March 16, 2024 18:01

add more tests

0d91cfc

naming consistency

98a2db8

fix doctest

4772768

forgot min-length processors

ee7e96c

Merge remote-tracking branch 'upstream/main' into logits_processors

13fc8d7

Merge remote-tracking branch 'upstream/main' into logits_processors

701a887

empty

116c1d0

gante reviewed Mar 18, 2024

View reviewed changes

zucchini-nlp added 2 commits March 19, 2024 14:33

Revert "fix doctest"

a2ee26b

This reverts commit 4772768.

revert change in docstring

36b44a7

gante approved these changes Mar 20, 2024

View reviewed changes

gante requested a review from amyeroberts March 20, 2024 11:11

amyeroberts approved these changes Mar 21, 2024

View reviewed changes

zucchini-nlp and others added 2 commits March 21, 2024 19:15

Update tests/generation/test_logits_process.py

c08d72b

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

Update tests/generation/test_logits_process.py

8020795

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

gante merged commit fadb053 into huggingface:main Mar 21, 2024

gante mentioned this pull request Mar 21, 2024

Contrastive decoding "raw" logits and scores are identical #29551

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change in-place operations to out-of-place in LogitsProcessors #29680

Change in-place operations to out-of-place in LogitsProcessors #29680

Uh oh!

zucchini-nlp commented Mar 15, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Mar 15, 2024

Uh oh!

gante left a comment

Uh oh!

gante Mar 15, 2024

Uh oh!

gante left a comment

Uh oh!

amyeroberts left a comment

Uh oh!

amyeroberts Mar 21, 2024

Uh oh!

zucchini-nlp Mar 21, 2024

Uh oh!

amyeroberts Mar 21, 2024

Uh oh!

zucchini-nlp Mar 21, 2024

Uh oh!

amyeroberts Mar 21, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		scores_processed = scores / self.temperature
		return scores_processed

Change in-place operations to out-of-place in LogitsProcessors #29680

Change in-place operations to out-of-place in LogitsProcessors #29680

Uh oh!

Conversation

zucchini-nlp commented Mar 15, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Mar 15, 2024

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante Mar 15, 2024

Choose a reason for hiding this comment

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

amyeroberts Mar 21, 2024

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Mar 21, 2024

Choose a reason for hiding this comment

Uh oh!

amyeroberts Mar 21, 2024

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Mar 21, 2024

Choose a reason for hiding this comment

Uh oh!

amyeroberts Mar 21, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants