-
Notifications
You must be signed in to change notification settings - Fork 292
CK Tile GEMM Compute V2 (2 LDS Ping Pong mechanism) #1853
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
40 commits
Select commit
Hold shift + click to select a range
c6eb7c3
comp v4 setup
3e0047a
add a file
cca67d1
Finished the coding of the feature, Compiler not in the way we suppos…
66a183d
Update some of the code to better format
4931698
get tback the restrict variable name, need to switch out to solve the…
71352c4
Solve the compiler issue on SHMEM conflict
dec32dc
Finish the feature and merge with develop on the computeV2
d1e7177
roll back to compute pipeline
b2c7d77
Add the changes from include/ck_tile
6db81a1
Address the comments
3b30146
pre-merge with the develop branch need to fix the bug
800cf89
Merge from internal (#1857)
illsilin 987cc54
Finish the integration to develop and have the correct result
d1715c0
Fix the gtest compilation error
bd09b37
Fix the gemm_basic error
4db6526
clang format
6774dda
switch the default pipeline to V3
2bef550
restore cron trigger (#1863)
illsilin c2bb46f
fix the benchmark basic script
5bb041b
add vectorloads on non-k dim for memory pipelines (#1856)
jakpiase 7409674
Solving the Review comments
feb656d
Support for dtypes (fp8, bf8, bf16 and fp16) for the ck_tile/03_gemm …
kylasa b5d201d
CK Tile - small fix to hotloop scheduler & KPack value. (#1867)
aosewski 34612ef
address the new comments
e3402c9
fix a small bug on the old
ae4243d
Add a host mx gemm reference kernel (#1864)
geyyer f49de49
External CI: enable amd-develop branch trigger (#1859)
danielsu-amd 9ba504b
merge with the develop support the fp8 with computev4
4106dfa
Merge branch 'develop' of https://github.com/ROCm/composable_kernel i…
2003487
Merge branch 'develop' into ck_tile/gemm_compute_v4
2154151
Solve FMHA error
96b135f
clang format
df6042c
Fix the memory pipleine
884a2f7
Merge branch 'develop' into ck_tile/gemm_compute_v4
illsilin a9df418
Merge branch 'develop' of https://github.com/ROCm/composable_kernel i…
ef2b53a
Merge branch 'develop' of https://github.com/ROCm/composable_kernel i…
7dc420a
Solve merge conflict and add the gtest for compv4
4658f2f
Merge branch 'develop' of https://github.com/ROCm/composable_kernel i…
2672ead
Merge branch 'develop' into ck_tile/gemm_compute_v4
1160b99
sync with develop
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,2 +1,5 @@ | ||
| add_executable(tile_example_gemm_basic EXCLUDE_FROM_ALL gemm_basic.cpp) | ||
| add_executable(tile_example_gemm_universal EXCLUDE_FROM_ALL universal_gemm.cpp) | ||
| target_compile_options(tile_example_gemm_universal PRIVATE | ||
| -mllvm -enable-noalias-to-md-conversion=0 | ||
| ) |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.