Disable FlexAttention max-autotune when deterministic is used #1808

fegin · 2025-10-07T21:15:34Z

With max-autotune, FlexAttention is not deterministic even if torch.use_deterministic_algorithms is True. When deterministic mode is set, we should also remove the usage of max-autotune.

With max-autotune, FlexAttention is not deterministic even if torch.use_deterministic_algorithms is True. Add a flag to disable max-autotune. Since this this is a debug flag, we use environment variables to implement this flag. DISABLE_TORCHTITAN_FLEX_MAX_AUTOTUNE is the flag.

torchtitan/models/attention.py

wwwjn

LGTM!

…h#1808) With max-autotune, FlexAttention is not deterministic even if torch.use_deterministic_algorithms is True. When deterministic mode is set, we should also remove the usage of `max-autotune`.

fegin requested review from tianyu-l, wconstab and wwwjn as code owners October 7, 2025 21:15

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 7, 2025

wwwjn reviewed Oct 7, 2025

View reviewed changes

torchtitan/models/attention.py Outdated Show resolved Hide resolved

updated

a14ce4b

fegin changed the title ~~Add an environment variable to disable FlexAttention max-autotune~~ Disable FlexAttention max-autotune when deterministic is used Oct 7, 2025

fegin requested a review from wwwjn October 7, 2025 23:00

misc

e5b38e8

wwwjn approved these changes Oct 8, 2025

View reviewed changes

fegin merged commit 41eff53 into main Oct 8, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Disable FlexAttention max-autotune when deterministic is used #1808

Disable FlexAttention max-autotune when deterministic is used #1808

Uh oh!

fegin commented Oct 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

wwwjn left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Disable FlexAttention max-autotune when deterministic is used #1808

Disable FlexAttention max-autotune when deterministic is used #1808

Uh oh!

Conversation

fegin commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

wwwjn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fegin commented Oct 7, 2025 •

edited

Loading