[`kernels`] If flash attention2 is not installed / fails to import (cc on our cluster) default to kernels by ArthurZucker · Pull Request #40178 · huggingface/transformers

ArthurZucker · 2025-08-14T18:17:40Z

Improves handling of FlashAttention2 + add community kernel fallback

Updated _check_and_adjust_attn_implementation to set the attention implementation to kernels-community/flash-attn when FlashAttention2 is requested but not available, ensuring seamless fallback and proper kernel registration.

Testing Improvements:

Modified require_flash_attn in testing_utils.py to allow tests to run if either FlashAttention2 or the community kernel is available, broadening test coverage and reliability.

HuggingFaceDocBuilderDev · 2025-08-14T18:30:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker · 2025-08-15T09:29:18Z

run-slow: flash_attention_2

…ers into kernels-by-default

ArthurZucker · 2025-08-20T13:10:57Z

run-slow: flash_attention_2

ArthurZucker · 2025-08-20T13:16:04Z

run-slow: flash_attention_2

…gingface/kernels@main#egg=kernels ` for updated dockers

Cyrilvallez

I think the following is a bit more appropriate, although this code part is starting to be quite convoluted

ArthurZucker · 2025-08-28T13:34:52Z

run-slow: flash_attention_2

ArthurZucker · 2025-08-28T13:46:22Z

run-slow: flash_attention_2

Cyrilvallez · 2025-08-28T14:20:12Z

Confirmed on tests locally that it works! Merging

first step if flash not installed but you set to use it

20ba3bf

ArthurZucker added 2 commits August 14, 2025 19:05

try importing

c5e6ec5

now default to using it

febe83e

ArthurZucker and others added 3 commits August 15, 2025 11:30

Merge branch 'main' into kernels-by-default

3e133a3

update our tests as well

856a732

wow yesterday I was not awake

f230552

ArthurZucker force-pushed the kernels-by-default branch from 2257696 to f230552 Compare August 15, 2025 09:55

ArthurZucker added 4 commits August 15, 2025 09:57

fixup

f2ef0b1

style

45ccc81

lol the fix was very very simple

e6a0755

Merge branch 'kernels-by-default' of github.com:huggingface/transform…

f744f09

…ers into kernels-by-default

`RUN python3 -m pip install --no-cache-dir git+https://github.com/hug…

1198c40

…gingface/kernels@main#egg=kernels ` for updated dockers

ArthurZucker added Flash Attention kernels labels Aug 20, 2025

Cyrilvallez reviewed Aug 20, 2025

View reviewed changes

Comment thread src/transformers/modeling_utils.py Outdated

Comment thread src/transformers/modeling_utils.py Outdated

Cyrilvallez and others added 3 commits August 28, 2025 15:10

Merge branch 'main' into kernels-by-default

5f17bc8

push review comments

5413c3d

fix

08e9b64

Cyrilvallez merged commit 851b8f2 into main Aug 28, 2025
23 of 25 checks passed

Cyrilvallez deleted the kernels-by-default branch August 28, 2025 14:20

albertvillanova mentioned this pull request Oct 17, 2025

Replace unittest skipTest from transformers with pytest.skip huggingface/trl#4297

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`kernels`] If flash attention2 is not installed / fails to import (cc on our cluster) default to kernels#40178

[`kernels`] If flash attention2 is not installed / fails to import (cc on our cluster) default to kernels#40178
Cyrilvallez merged 14 commits into
mainfrom
kernels-by-default

ArthurZucker commented Aug 14, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 14, 2025

Uh oh!

ArthurZucker commented Aug 15, 2025

Uh oh!

ArthurZucker commented Aug 20, 2025

Uh oh!

ArthurZucker commented Aug 20, 2025

Uh oh!

Cyrilvallez left a comment

Uh oh!

Uh oh!

Uh oh!

ArthurZucker commented Aug 28, 2025

Uh oh!

ArthurZucker commented Aug 28, 2025

Uh oh!

Cyrilvallez commented Aug 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ArthurZucker commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Improves handling of FlashAttention2 + add community kernel fallback

Uh oh!

HuggingFaceDocBuilderDev commented Aug 14, 2025

Uh oh!

ArthurZucker commented Aug 15, 2025

Uh oh!

ArthurZucker commented Aug 20, 2025

Uh oh!

ArthurZucker commented Aug 20, 2025

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ArthurZucker commented Aug 28, 2025

Uh oh!

ArthurZucker commented Aug 28, 2025

Uh oh!

Cyrilvallez commented Aug 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ArthurZucker commented Aug 14, 2025 •

edited

Loading