Adding FA2 support for MusicGen #27924

staghado · 2023-12-09T15:29:24Z

What does this PR do?

This PR adds Flash Attention 2 support for MusicGen model. It is based on Bart example and it is a WIP for now.
I could not test the model because FA2 is not supported yet for T4 GPUs.

Fixes #27552

@sanchit-gandhi @ylacombe

ylacombe · 2023-12-11T10:07:25Z

Hey @staghado, thanks for taking care of this, let us know when it's ready to be reviewed!

circuluspibo

working test

staghado · 2024-01-06T21:02:53Z

I have conducted some tests on an A10 GPU :
- The code seems to work without errors when _supports_flash_attn_2 is set to True for MusicgenForConditionalGeneration but does not load the model with FA2 if not specified by hand. Maybe it needs to be added at the class level in MusicgenForConditionalGeneration?
- There is no difference in generation speed between eager attention and FA2 :

sanchit-gandhi · 2024-01-10T18:01:34Z

cc @ylacombe could you possibly circle back here when you get the chance!

ylacombe

Hi @staghado, thanks for the update here!

I believe that you have to add _supports_flash_attn_2 to MusicgenForConditionalGeneration and MusicgenForCausalLM, otherwise it won't be flagged as supporting FA2 !

Also, did you make sure to add the flag attn_implementation="flash_attention_2" to from_pretrained ? as indicated here. Let me know.

Besides that, the modeling code looks good except some changes of format that shouldn't happen where you add a coma at the end of the line, all along the modeling file.

You can also add Muscigen to the list of models supported here.

Let me know if you need further help !

src/transformers/models/musicgen/modeling_musicgen.py

Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

staghado · 2024-01-15T13:12:55Z

Hi @ylacombe,

I confirm that the model was instantiated as described here with the exception of torch_dtype=torch.float16 instead of torch_dtype=torch.bfloat16 because some operations did not seem to implement bfloat16.

github-actions · 2024-02-19T08:05:13Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

ylacombe

Hey @staghado , thanks for iterating here ! And sorry for the delay!

There's still some changes that appeared because of added commas, but otherwise looks good to me!

cc @amyeroberts for a review! (btw, there's no need for additional tests right ?)

ylacombe · 2024-02-19T12:36:09Z

src/transformers/models/musicgen/modeling_musicgen.py

+                for v in [
+                    hidden_states,
+                    next_cache,
+                    all_hidden_states,
+                    all_self_attns,
+                    all_cross_attentions,
+                ]


Should be a one liner

Suggested change

for v in [

hidden_states,

next_cache,

all_hidden_states,

all_self_attns,

all_cross_attentions,

]

for v in [hidden_states, next_cache, all_hidden_states, all_self_attns, all_cross_attentions]

ylacombe · 2024-02-19T12:36:16Z

src/transformers/models/musicgen/modeling_musicgen.py

+            torch.ones(
+                (bsz, num_codebooks, max_length),
+                dtype=torch.long,
+                device=input_ids.device,
+            )
+            * -1


ylacombe · 2024-02-19T12:36:27Z

src/transformers/models/musicgen/modeling_musicgen.py

+            torch.ones((channel_codebooks, max_length), dtype=torch.bool),
+            diagonal=max_length - channel_codebooks + 1,


ylacombe · 2024-02-19T12:36:34Z

src/transformers/models/musicgen/modeling_musicgen.py

+                input_ids,
+                generation_config.pad_token_id,
+                generation_config.eos_token_id,


ylacombe · 2024-02-19T12:36:39Z

src/transformers/models/musicgen/modeling_musicgen.py

+                self.text_encoder,
+                self.decoder._modules[decoder_base_model_prefix],
+                self.decoder.base_model_prefix,


ylacombe · 2024-02-19T12:36:50Z

src/transformers/models/musicgen/modeling_musicgen.py

+                    text_encoder_pretrained_model_name_or_path,
+                    **kwargs_text_encoder,
+                    return_unused_kwargs=True,


Same here and for all the other changes below

First try at adding FA2 support for Musicgen

7fe8027

circuluspibo approved these changes Dec 16, 2023

View reviewed changes

ylacombe reviewed Jan 12, 2024

View reviewed changes

src/transformers/models/musicgen/modeling_musicgen.py Outdated Show resolved Hide resolved

src/transformers/models/musicgen/modeling_musicgen.py Outdated Show resolved Hide resolved

staghado force-pushed the musicgen-fa2 branch from 335aca4 to 7fe8027 Compare January 15, 2024 12:06

staghado and others added 4 commits January 15, 2024 13:10

Update src/transformers/models/musicgen/modeling_musicgen.py

f161afe

Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

Update src/transformers/models/musicgen/modeling_musicgen.py

5624359

Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

add flash attention 2 flag to MusicgenForConditionalGeneration

a3c900b

Add MusicGen to the list of models supporting FA2 in the doc

1e23a61

staghado changed the title ~~[WIP] Adding FA2 support for MusicGen~~ Adding FA2 support for MusicGen Jan 25, 2024

ylacombe approved these changes Feb 19, 2024

View reviewed changes

github-actions bot closed this Feb 28, 2024

ylacombe mentioned this pull request Mar 28, 2024

Add Flash Attention 2 support to Musicgen and Musicgen Melody #29939

Merged

		torch.ones((channel_codebooks, max_length), dtype=torch.bool),
		diagonal=max_length - channel_codebooks + 1,

Adding FA2 support for MusicGen #27924

Adding FA2 support for MusicGen #27924

Uh oh!

Conversation

staghado commented Dec 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

ylacombe commented Dec 11, 2023

Uh oh!

circuluspibo left a comment

Choose a reason for hiding this comment

Uh oh!

staghado commented Jan 6, 2024

Uh oh!

sanchit-gandhi commented Jan 10, 2024

Uh oh!

ylacombe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

staghado commented Jan 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 19, 2024

Uh oh!

ylacombe left a comment

Choose a reason for hiding this comment

Uh oh!

ylacombe Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

ylacombe Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

ylacombe Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

ylacombe Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

ylacombe Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

ylacombe Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

staghado commented Dec 9, 2023 •

edited

Loading

staghado commented Jan 15, 2024 •

edited

Loading