Skip to content

[MIOpen Downstream] Dynamic Reduction#26

Closed
asroy wants to merge 12 commits into
developfrom
miopen_downstream-dynamic_reduction_pr
Closed

[MIOpen Downstream] Dynamic Reduction#26
asroy wants to merge 12 commits into
developfrom
miopen_downstream-dynamic_reduction_pr

Conversation

@asroy
Copy link
Copy Markdown
Contributor

@asroy asroy commented Aug 26, 2021

For this MIOpen PR ROCm/MIOpen#1108

Chao Liu and others added 12 commits August 18, 2021 11:22
* Squashed 'src/composable_kernel/' content from commit f6edda6

git-subtree-dir: src/composable_kernel
git-subtree-split: f6edda6

* add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files

* Squashed 'src/composable_kernel/' changes from f6edda6..5781adf

5781adf Update develop (#5) (#6)
97e6d51 Merge pull request #4 from ROCmSoftwarePlatform/separate_online_compile
7b1ec41 refactor
49c33aa refactor
54b3e73 rename

git-subtree-dir: src/composable_kernel
git-subtree-split: 5781adf

* fix

* refactor

* remove online compilation from CK

* refactor

* fix

* add ctest

* add c-style pointer cast

* vector/scalar pointer cast use c-style pointer cast instead of reinterpret_cast

* fix clang warning suppression

* tidy

* suppress cppcheck

* fix enum issue

* revert chagnes to hip build

* fix kernel filename

* update CK build script

* rename

* rename

* make innner product compatiable on gfx900

* Update src/include/miopen/solver/ck_utility_common.hpp

Co-authored-by: JD <Jehandad.Khan@amd.com>

* compiler parameter use stream

* use int instead of index_t in kernel wrapper

* DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element

* refactor

* refactor

* change cmakelist

* change ck common utility

* fix

Co-authored-by: JD <Jehandad.Khan@amd.com>
@asroy asroy closed this Aug 26, 2021
@asroy asroy deleted the miopen_downstream-dynamic_reduction_pr branch September 21, 2021 01:41
asroy pushed a commit that referenced this pull request Oct 6, 2021
asroy pushed a commit that referenced this pull request Dec 1, 2023
* support hdim=64/128 in same example code

* support v transpose

* revert gemm.cpp, not intent to modify it

* remove useless code

* fix a bug for swizzle C encoding, no perf change

* optimize LDS encoding

* update LDS layout

* clean up code
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants