[WIP] Add Flax diverse group search #24508

yeandy · 2023-06-26T23:16:03Z

What does this PR do?

Mimics #9006, but for Flax.

We want to match how PyTorch's logic accounts for group_size and num_beam_groups here and here

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

sgugger · 2023-06-27T12:24:21Z

cc @sanchit-gandhi

sanchit-gandhi

This look promising already @yeandy! Left some comments regarding the design below. In addition, could we add a few tests to confirm that:

Group beam search runs when we call model.generate
That group beam search is jit'able
And that we get equivalence with PyTorch

sanchit-gandhi · 2023-06-28T17:22:22Z

src/transformers/generation/flax_utils.py

        trace: bool = True,
        params: Optional[Dict[str, jnp.ndarray]] = None,
        num_return_sequences: Optional[int] = None,
+        num_beam_groups: Optional[int] = 1,


In PyTorch we define a separate beam search method for group beam search:

transformers/src/transformers/generation/utils.py

Line 3375 in 33b5ef5

def group_beam_search(

We only trigger this method if num_beam_groups>1:

transformers/src/transformers/generation/utils.py

Line 1426 in 33b5ef5

is_group_beam_gen_mode = (

My opinion is that we should have a separate group beam search method in Flax as well, rather than adding to the existing one. IMO this is cleaner for the reader and more compartmentalised for building on

cc @gante as well for Flax generate design decision

Thanks @sanchit-gandhi!

My first commit was to get a prototype working for num_beam_groups=1. I intend to refactor the beam search logic to make sure it works for other num_beam_groups sizes.

Will do.

My current logic is jittable, as I've been doing some testing from this example. Are there test in the HF repo that explicitly test whether a function is jittable? Or is sufficient to have an E2E test jits the function?

Will do.

My opinion is that we should have a separate group beam search method in Flax as well, rather than adding to the existing one.

+1 :)

(btw, there was a recent bugfix on the PT side, might be relevant here)

Awesome, sounds good @yeandy! Excited to see how this pans out!

sanchit-gandhi · 2023-06-28T17:22:56Z

src/transformers/generation/flax_utils.py

            add_penalty = ~did_topk_just_finished | beams_in_batch_are_full
-            topk_log_probs += add_penalty * np.array(-1.0e7)
+
+            # Add additional logic for diverse beam search


Nice! My only nit is that we try and avoid lambda functions in transformers - would you be able to re-write these as standard function definitions please?

Lovely thanks

github-actions · 2023-07-27T15:02:21Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

sanchit-gandhi · 2023-08-03T17:25:52Z

Hey @yeandy! This PR is looking in good shape - thanks for your efforts so far! Would you like to go all the way and see it to completion? Happy to help with the remainder of the integration!

yeandy · 2023-08-03T17:39:00Z

Hey @sanchit-gandhi. Due to other commitments, I currently don't have bandwidth to continue this. And the timeline for me to get to this unknown right now. If someone else wants to work on this, I'm ok with that.

sanchit-gandhi · 2023-08-07T16:57:29Z

Thanks for letting me know @yeandy! Best of luck with your other commitments, I hope they go well 🤗 Opening this one up to the community to complete!

yipkingster · 2026-01-13T22:33:18Z

For those who wonder what the status is for this PR, it seems all TF/Flax support has been deprecated. So this PR is no longer in scope.

Rocketknight1 · 2026-01-14T12:12:24Z

Yes, this should have been closed long ago!

Add flax diverse group search

072f785

yeandy changed the title ~~Add flax diverse group search~~ Add Flax diverse group search Jun 26, 2023

sanchit-gandhi reviewed Jun 28, 2023

View reviewed changes

sanchit-gandhi mentioned this pull request Aug 7, 2023

Add Flax diverse group search #25355

Open

sanchit-gandhi linked an issue Aug 7, 2023 that may be closed by this pull request

Add Flax diverse group search #25355

Open

huggingface deleted a comment from github-actions bot Sep 1, 2023

sanchit-gandhi added the WIP Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress label Sep 1, 2023

sanchit-gandhi changed the title ~~Add Flax diverse group search~~ [WIP] Add Flax diverse group search Sep 1, 2023

Rocketknight1 closed this Jan 14, 2026

[WIP] Add Flax diverse group search #24508

[WIP] Add Flax diverse group search #24508

Conversation

yeandy commented Jun 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

sgugger commented Jun 27, 2023

Uh oh!

sanchit-gandhi left a comment

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi Jun 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yeandy Jun 28, 2023

Choose a reason for hiding this comment

Uh oh!

gante Jun 29, 2023

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi Jun 30, 2023

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi Jun 28, 2023

Choose a reason for hiding this comment

Uh oh!

yeandy Jun 28, 2023

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi Jun 30, 2023

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jul 27, 2023

Uh oh!

sanchit-gandhi commented Aug 3, 2023

Uh oh!

yeandy commented Aug 3, 2023

Uh oh!

sanchit-gandhi commented Aug 7, 2023

Uh oh!

yipkingster commented Jan 13, 2026

Uh oh!

Rocketknight1 commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

yeandy commented Jun 26, 2023 •

edited

Loading

sanchit-gandhi Jun 28, 2023 •

edited

Loading