Skip to content

[BACKEND] Relax patterns to move sink broadcast and hoist convert#2331

Merged
ptillet merged 1 commit intomainfrom
arith_reduction
Sep 18, 2023
Merged

[BACKEND] Relax patterns to move sink broadcast and hoist convert#2331
ptillet merged 1 commit intomainfrom
arith_reduction

Conversation

@ThomasRaoux
Copy link
Copy Markdown
Collaborator

Improve patterns that sync broadcast to reduce the arithmetic density and also hoist convert on top of expand_dims to do less work.

This address comments in #2274

@ThomasRaoux
Copy link
Copy Markdown
Collaborator Author

ThomasRaoux commented Sep 18, 2023

based on PR comments from @peterbell10

Improve patterns that sync broadcast to reduce the arithmetic density
and also hoist convert on top of expand_dims to do less work.

This address comments in #2274
@github-actions
Copy link
Copy Markdown

⚠️ This PR does not produce bitwise identical kernels as the branch it's merged against. Please check artifacts for details. Download the output file here.

@ptillet ptillet merged commit 3a848e2 into main Sep 18, 2023
@ptillet ptillet deleted the arith_reduction branch September 18, 2023 22:08
alexander-zinoviev pushed a commit to alexander-zinoviev/triton that referenced this pull request Sep 21, 2023
…iton-lang#2331)

Improve patterns that sync broadcast to reduce the arithmetic density
and also hoist convert on top of expand_dims to do less work.

This address comments in triton-lang#2274
binarman added a commit to binarman/triton that referenced this pull request Oct 26, 2023
This PR fixes layout propagation algorithm in RemoveLayoutConversions pass.
In some cases during rewriteSlice process, reduce operation with multiple outputs
rewrites only one output layout, which breaks assumption that both outputs should have same layout.

This change is a minimal part of triton-lang#2331 change and
small lit test for regression testing.
alefimov-amd pushed a commit to binarman/triton that referenced this pull request Oct 26, 2023
This PR fixes layout propagation algorithm in RemoveLayoutConversions pass.
In some cases during rewriteSlice process, reduce operation with multiple outputs
rewrites only one output layout, which breaks assumption that both outputs should have same layout.

This change is a minimal part of triton-lang#2331 change and
small lit test for regression testing.
binarman added a commit to binarman/triton that referenced this pull request Oct 30, 2023
This PR fixes layout propagation algorithm in RemoveLayoutConversions pass.
In some cases during rewriteSlice process, reduce operation with multiple outputs
rewrites only one output layout, which breaks assumption that both outputs should have same layout.

This change is a minimal part of triton-lang#2331 change and
small lit test for regression testing.
zhanglx13 pushed a commit to ROCm/triton that referenced this pull request Nov 1, 2023
* [RemoveLayoutConversions] Fix reduce failed infer type error

This PR fixes layout propagation algorithm in RemoveLayoutConversions pass.
In some cases during rewriteSlice process, reduce operation with multiple outputs
rewrites only one output layout, which breaks assumption that both outputs should have same layout.

This change is a minimal part of triton-lang#2331 change and
small lit test for regression testing.

* fix combine test

* Fix issue with incorrect inference layout of make_range output result
jataylo pushed a commit to ROCm/triton that referenced this pull request Nov 3, 2023
* [RemoveLayoutConversions] Fix reduce failed infer type error

This PR fixes layout propagation algorithm in RemoveLayoutConversions pass.
In some cases during rewriteSlice process, reduce operation with multiple outputs
rewrites only one output layout, which breaks assumption that both outputs should have same layout.

This change is a minimal part of triton-lang#2331 change and
small lit test for regression testing.

* fix combine test

* Fix issue with incorrect inference layout of make_range output result
pingzhuu pushed a commit to siliconflow/triton that referenced this pull request Apr 2, 2024
…iton-lang#2331)

Improve patterns that sync broadcast to reduce the arithmetic density
and also hoist convert on top of expand_dims to do less work.

This address comments in triton-lang#2274
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants