Fix gluon attention example for fp8 by masahi · Pull Request #10399 · triton-lang/triton

masahi · 2026-05-28T00:00:13Z

Currently, running python python/examples/gluon/01-attention-forward.py fails with

python/examples/gluon/01-attention-forward.py:386:14: error: source and result must have the same logical storage size (4096 vs 2048)
    p_tmem = s_tmem._reinterpret(config.dtype, [config.SPLIT_M, 2 * config.BLOCK_N], config.p_tmem_layout)

This happens when the benchmark is run with fp8. pytest python/examples/gluon/01-attention-forward.py doesn't fail simply because it doesn't test on fp8.

The regression is from #10243 but the problematic code hardcoding a fixed column count like config.BLOCK_N // 2 has been there for a long time. Previously it only worked due to the loose convention on reinterpret.

fixed gluon attention example for fp8

e59c648

masahi requested a review from ptillet as a code owner May 28, 2026 00:00

masahi requested review from Mogball and lezcano and removed request for ptillet May 28, 2026 00:00

Mogball approved these changes May 28, 2026

View reviewed changes

masahi merged commit 6650ee3 into triton-lang:main May 28, 2026
10 checks passed

masahi deleted the fix-gluon-attn branch May 28, 2026 06:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gluon attention example for fp8#10399

Fix gluon attention example for fp8#10399
masahi merged 1 commit into
triton-lang:mainfrom
masahi:fix-gluon-attn

masahi commented May 28, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

masahi commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

masahi commented May 28, 2026 •

edited

Loading