Infer src/dst of allowReorder reshapes by neildhar · Pull Request #9926 · triton-lang/triton

neildhar · 2026-04-04T03:06:42Z

Always infer the src/dst of reshapes, even if allowReorder is set. The
result is valid for allowReorder reshapes, even if there isn't a single
canonical encoding. When the existing encoding is one of the possible
results, we prefer that to minimize changes.

This allows inference to always succeed on reshapes, and any heuristics
on whether to use the inferred value can be maintained by the caller.

One example I identified while looking at this was that allowReorder
reshapes will currently fail backward remat in RemoveLayoutConversions
if the reshape cannot be rematerialised with the same source encoding.
This PR instead changes RemoveLayoutConversions to check specifically
for whether the reshape has been marked as efficient, and otherwise
just do the remat. (this is a potentially perf sensitive change)

neildhar · 2026-04-04T03:08:15Z

This is intended to supersede #9906 which somehow got merged while trying to update my stack. See the discussion there.

`inferDstEncoding` currently tries to unconditionally build a sliced encoding from the src encoding it is given. But this is incorrect if the source is rank 1, since we can't take a slice of a rank 1 tensor.

Always infer the src/dst of reshapes, even if allowReorder is set. The result is valid for allowReorder reshapes, even if there isn't a single canonical encoding. When the existing encoding is one of the possible results, we prefer that to minimize changes. This allows inference to always succeed on reshapes, and any heuristics on whether to use the inferred value can be maintained by the caller. One example I identified while looking at this was that allowReorder reshapes will currently fail backward remat in RemoveLayoutConversions if the reshape cannot be rematerialised with the same source encoding. This PR instead changes RemoveLayoutConversions to check specifically for whether the reshape has been marked as efficient, and otherwise just do the remat. (this is a potentially perf sensitive change)

This reverts commit 5f96878.

…erify allowReorder reshapes (#9905)" (#9983)

Always infer the src/dst of reshapes, even if allowReorder is set. The result is valid for allowReorder reshapes, even if there isn't a single canonical encoding. When the existing encoding is one of the possible results, we prefer that to minimize changes. This allows inference to always succeed on reshapes, and any heuristics on whether to use the inferred value can be maintained by the caller. One example I identified while looking at this was that allowReorder reshapes will currently fail backward remat in RemoveLayoutConversions if the reshape cannot be rematerialised with the same source encoding. This PR instead changes RemoveLayoutConversions to check specifically for whether the reshape has been marked as efficient, and otherwise just do the remat. (this is a potentially perf sensitive change)

Reland of #9926. Always infer the src/dst of reshapes, even if allowReorder is set. The result is valid for allowReorder reshapes, even if there isn't a single canonical encoding. When the existing encoding is one of the possible results, we prefer that to minimize changes. This allows inference to always succeed on reshapes, and any heuristics on whether to use the inferred value can be maintained by the caller. One example I identified while looking at this was that allowReorder reshapes will currently fail backward remat in RemoveLayoutConversions if the reshape cannot be rematerialised with the same source encoding. This PR instead changes RemoveLayoutConversions to check specifically for whether the reshape has been marked as efficient, and otherwise just do the remat. (this is a potentially perf sensitive change)

Always infer the src/dst of reshapes, even if allowReorder is set. The result is valid for allowReorder reshapes, even if there isn't a single canonical encoding. When the existing encoding is one of the possible results, we prefer that to minimize changes. This allows inference to always succeed on reshapes, and any heuristics on whether to use the inferred value can be maintained by the caller. One example I identified while looking at this was that allowReorder reshapes will currently fail backward remat in RemoveLayoutConversions if the reshape cannot be rematerialised with the same source encoding. This PR instead changes RemoveLayoutConversions to check specifically for whether the reshape has been marked as efficient, and otherwise just do the remat. (this is a potentially perf sensitive change)

…d Revert "Verify allowReorder reshapes (triton-lang#9905)" (triton-lang#9983)

Reland of #9926. Always infer the src/dst of reshapes, even if allowReorder is set. The result is valid for allowReorder reshapes, even if there isn't a single canonical encoding. When the existing encoding is one of the possible results, we prefer that to minimize changes. This allows inference to always succeed on reshapes, and any heuristics on whether to use the inferred value can be maintained by the caller. One example I identified while looking at this was that allowReorder reshapes will currently fail backward remat in RemoveLayoutConversions if the reshape cannot be rematerialised with the same source encoding. This PR instead changes RemoveLayoutConversions to check specifically for whether the reshape has been marked as efficient, and otherwise just do the remat. (this is a potentially perf sensitive change)

Reland of triton-lang#9926. Always infer the src/dst of reshapes, even if allowReorder is set. The result is valid for allowReorder reshapes, even if there isn't a single canonical encoding. When the existing encoding is one of the possible results, we prefer that to minimize changes. This allows inference to always succeed on reshapes, and any heuristics on whether to use the inferred value can be maintained by the caller. One example I identified while looking at this was that allowReorder reshapes will currently fail backward remat in RemoveLayoutConversions if the reshape cannot be rematerialised with the same source encoding. This PR instead changes RemoveLayoutConversions to check specifically for whether the reshape has been marked as efficient, and otherwise just do the remat. (this is a potentially perf sensitive change)

neildhar changed the base branch from main to neildhar/pr9924 April 4, 2026 03:06

This was referenced Apr 4, 2026

Fix inferDstEncoding for rank 1 reduction #9925

Merged

Verify allowReorder reshapes #9905

Merged

neildhar mentioned this pull request Apr 4, 2026

Infer src/dst of allowReorder reshapes #9906

Merged

neildhar force-pushed the neildhar/pr9924 branch from 8f111e7 to 0fd4805 Compare April 4, 2026 20:21

neildhar force-pushed the neildhar/pr9926 branch from 3a9768a to 5ceb73f Compare April 4, 2026 20:21

neildhar added 2 commits April 4, 2026 16:11

Fix inferDstEncoding for rank 1 reduction

2d5292a

`inferDstEncoding` currently tries to unconditionally build a sliced encoding from the src encoding it is given. But this is incorrect if the source is rank 1, since we can't take a slice of a rank 1 tensor.

neildhar force-pushed the neildhar/pr9926 branch from 5ceb73f to 0b219b4 Compare April 5, 2026 01:00

neildhar force-pushed the neildhar/pr9924 branch from 0fd4805 to 2d5292a Compare April 5, 2026 01:00

Base automatically changed from neildhar/pr9924 to main April 6, 2026 22:04

neildhar marked this pull request as ready for review April 6, 2026 22:05

neildhar requested review from peterbell10 and ptillet as code owners April 6, 2026 22:05

neildhar requested review from ThomasRaoux and lezcano April 6, 2026 22:05

ThomasRaoux approved these changes Apr 7, 2026

View reviewed changes

neildhar merged commit 5f96878 into main Apr 7, 2026
20 of 23 checks passed

neildhar deleted the neildhar/pr9926 branch April 7, 2026 00:38

ThomasRaoux added a commit to ThomasRaoux/triton that referenced this pull request Apr 9, 2026

Revert "Infer src/dst of allowReorder reshapes (triton-lang#9926)"

0ea2474

This reverts commit 5f96878.

ThomasRaoux added a commit that referenced this pull request Apr 9, 2026

Revert "Infer src/dst of allowReorder reshapes (#9926)" and Revert "V…

c84e9d4

…erify allowReorder reshapes (#9905)" (#9983)

This was referenced Apr 10, 2026

Fix constraints in isExpensiveCat #9995

Merged

Add a verifier for CatOp #9996

Merged

neildhar mentioned this pull request Apr 10, 2026

[RELAND] Infer src/dst of allowReorder reshape #9997

Merged

plognjen pushed a commit to plognjen/triton that referenced this pull request Apr 14, 2026

Revert "Infer src/dst of allowReorder reshapes (triton-lang#9926)" an…

b7c5eac

…d Revert "Verify allowReorder reshapes (triton-lang#9905)" (triton-lang#9983)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Infer src/dst of allowReorder reshapes#9926

Infer src/dst of allowReorder reshapes#9926
neildhar merged 2 commits into
mainfrom
neildhar/pr9926

neildhar commented Apr 4, 2026 •

edited

Loading

Uh oh!

neildhar commented Apr 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

neildhar commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

neildhar commented Apr 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

neildhar commented Apr 4, 2026 •

edited

Loading