adding search.PrefixConstrainedBeamSearch #2646

nicola-decao · 2020-09-22T10:54:08Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos, doc improvements)
Did you read the contributor guideline?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

This adds a new decoding strategy search.PrefixConstrainedBeamSearch that limits the vocabulary of the next token generation given a prefix (that is the previously generated tokens during beam search). An end user has just to give the optional argument prefix_allowed_tokens_fn to .generate or .sample to activate PrefixConstrainedBeamSearch. prefix_allowed_tokens_fn(batch_id, tokens) is a callback function that given the batch_id and tokens returns the list of allowed token for the next generation step.

Did you have fun?

YES! 🙃

nicola-decao · 2020-09-22T13:13:47Z

The test failed on something that is not part of the pull request

myleott

You can ignore the test_translation_multi_simple_epoch test failure (the psutil import failure has been fixed in trunk).

But the test_ensemble_sequence_generator (tests.test_sequence_generator.TestJitSequeneceGenerator) failures seems related (see comment below)

fairseq/sequence_generator.py

myleott · 2020-09-25T17:58:42Z

fairseq/sequence_generator.py

            if num_remaining_sent == 0:
                break
-            if isinstance(self.search, search.PrefixConstrainedBeamSearch) and step >= max_len:
+            if self.search.stop_on_max_len and step >= max_len:


facebook-github-bot

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

nicola-decao · 2020-10-14T13:13:30Z

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

I am not a Facebook employee so I cannot see the warnings and why this fails.

fabiopetroni · 2020-10-14T13:18:36Z

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

I am not a Facebook employee so I cannot see the warnings and why this fails.

I'm taking care of this :)

facebook-github-bot · 2020-10-14T18:08:10Z

@myleott merged this pull request in 086fe1c.

Summary: # Before submitting - [ ] Was this discussed/approved via a Github issue? (no need for typos, doc improvements) - [x] Did you read the [contributor guideline](https://github.com/pytorch/fairseq/blob/master/CONTRIBUTING.md)? - [x] Did you make sure to update the docs? - [x] Did you write any new necessary tests? ## What does this PR do? This adds a new decoding strategy `search.PrefixConstrainedBeamSearch` that limits the vocabulary of the next token generation given a prefix (that is the previously generated tokens during beam search). An end user has just to give the optional argument `prefix_allowed_tokens_fn` to `.generate` or `.sample` to activate `PrefixConstrainedBeamSearch`. `prefix_allowed_tokens_fn(batch_id, tokens)` is a callback function that given the `batch_id` and `tokens` returns the list of allowed token for the next generation step. ## Did you have fun? YES! � Pull Request resolved: facebookresearch/fairseq#2646 Reviewed By: fabiopetroni Differential Revision: D24006805 Pulled By: myleott fbshipit-source-id: 40b1a866c6ea9f936272db27e2a020b18dbf8164

nicola-decao added 2 commits September 22, 2020 03:24

adding search.PrefixConstrainedBeamSearch

09a183b

id is optional and sent is a tensor not list

fc7fa4e

myleott suggested changes Sep 24, 2020

View reviewed changes

fairseq/sequence_generator.py Outdated Show resolved Hide resolved

nicola-decao added 2 commits September 24, 2020 10:58

fixing original_batch_idxs for TorchScript

3ac6665

fixing TorchScript and runned black on search.py

bd653d6

nicola-decao requested a review from myleott September 25, 2020 08:32

myleott reviewed Sep 25, 2020

View reviewed changes

facebook-github-bot reviewed Sep 30, 2020

View reviewed changes

Merge branch 'master' into add_PrefixConstrainedBeamSearch

374790c

nicola-decao requested a review from myleott October 6, 2020 15:38

facebook-github-bot reviewed Oct 14, 2020

View reviewed changes

nicola-decao mentioned this pull request Oct 14, 2020

Adding prefix constrained beam search huggingface/transformers#7784

Closed

5 tasks

facebook-github-bot closed this in 086fe1c Oct 14, 2020

facebook-github-bot added the Merged label Oct 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding search.PrefixConstrainedBeamSearch #2646

adding search.PrefixConstrainedBeamSearch #2646

Uh oh!

nicola-decao commented Sep 22, 2020 •

edited

Loading

Uh oh!

nicola-decao commented Sep 22, 2020

Uh oh!

myleott left a comment

Uh oh!

Uh oh!

myleott Sep 25, 2020

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

nicola-decao commented Oct 14, 2020

Uh oh!

fabiopetroni commented Oct 14, 2020

Uh oh!

facebook-github-bot commented Oct 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

adding search.PrefixConstrainedBeamSearch #2646

adding search.PrefixConstrainedBeamSearch #2646

Uh oh!

Conversation

nicola-decao commented Sep 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before submitting

What does this PR do?

Did you have fun?

Uh oh!

nicola-decao commented Sep 22, 2020

Uh oh!

myleott left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

myleott Sep 25, 2020

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

nicola-decao commented Oct 14, 2020

Uh oh!

fabiopetroni commented Oct 14, 2020

Uh oh!

facebook-github-bot commented Oct 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nicola-decao commented Sep 22, 2020 •

edited

Loading