Bt bloom #1221

baskrahmer · 2023-07-22T13:22:56Z

What does this PR do?

What the title says :)
In order to make the tests with key-value cache work, the input format had to be changed in order to be compatible with the Bloom cache format. Perhaps this could be automated by adding to Transformers directly (for the generate method this is done in huggingface/transformers#20213)

Fixes #1175

Duplicate PR of #1187 but I messed up a bit with rebasing there, so I closed that one ;)

Before submitting

Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

nlpcat · 2023-08-04T17:15:31Z

can i ask when we can get this PR merged to optimize bloom? pytorch just added attention bias memory efficient optimization. pytorch/pytorch#104310

nlpcat · 2023-08-04T17:16:05Z

@fxmarty ?

fxmarty · 2023-08-07T07:58:05Z

Thank you, I'll make sure it is merged this week!

fxmarty

LGTM, thanks a lot for the addition @baskrahmer it is working smoothly!

About the untangle batch_size from self.num_heads, was it necessary for numerical equivalence?

baskrahmer · 2023-08-11T15:18:10Z

@fxmarty thanks for the review. IIRC the untangling of the batch size is there since the key/value cache of Bloom is structured a bit differently than other autoregressive language models.

bk-jc and others added 3 commits July 22, 2023 15:06

Implement BetterTransformer model for Bloom

381163c

disable strict validation for Bloom

9cd1f13

add to documentation

f2b3aa6

fxmarty added 2 commits August 11, 2023 11:05

Merge branch 'main' into bt-bloom

524bb4e

make tests pass

5629382

fxmarty approved these changes Aug 11, 2023

View reviewed changes

fxmarty merged commit 456b28f into huggingface:main Aug 11, 2023

This was referenced Aug 11, 2023

BetterTransformer support for BLOOM #1175

Closed

Adding newer models to Optimum (bloom, alpaca) #956

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bt bloom #1221

Bt bloom #1221

Uh oh!

baskrahmer commented Jul 22, 2023

Uh oh!

nlpcat commented Aug 4, 2023

Uh oh!

nlpcat commented Aug 4, 2023

Uh oh!

fxmarty commented Aug 7, 2023

Uh oh!

fxmarty left a comment

Uh oh!

baskrahmer commented Aug 11, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Bt bloom #1221

Bt bloom #1221

Uh oh!

Conversation

baskrahmer commented Jul 22, 2023

What does this PR do?

Before submitting

Uh oh!

nlpcat commented Aug 4, 2023

Uh oh!

nlpcat commented Aug 4, 2023

Uh oh!

fxmarty commented Aug 7, 2023

Uh oh!

fxmarty left a comment

Choose a reason for hiding this comment

Uh oh!

baskrahmer commented Aug 11, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants