[GEN] Add sub_group_reduce operator#1214
Merged
whitneywhtsang merged 3 commits intollvm-targetfrom May 31, 2024
Merged
Conversation
07b8f7b to
6849f19
Compare
6a8c524 to
cadaf9f
Compare
Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>
Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>
cadaf9f to
343a89b
Compare
victor-eds
reviewed
May 31, 2024
Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>
victor-eds
approved these changes
May 31, 2024
whitneywhtsang
added a commit
that referenced
this pull request
Jun 3, 2024
address code review comment: #1214 (comment) Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>
wdziurdz
pushed a commit
that referenced
this pull request
Apr 7, 2026
This fix is needed for #1179. Currently ocloc installed in CRI environment as a part of NEO has support only for CRI architecture and no other GPUs are recognized by it, so this is a quick change to allow benchmarks to build for CRI. --------- Signed-off-by: Gregory Shimansky <gregory.shimansky@intel.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
The
gen.sub_group_reduceoperation is invoked by all work items in a subgroup, each of them providing a $value. The $size argument is used to form groups of $size consecutive work items called clusters. Each cluster performs the reduction operation identified by $kind. The result of the cluster reduction is propagated to the work items belonging to that cluster.It lowers to either WaveAll or WaveClustered depending on the given size.