Skip to content

Update to gemm_reduce and batched_gemm_reduce#213

Merged
asroy merged 4 commits into
developfrom
gemm_reduce_improve
Apr 29, 2022
Merged

Update to gemm_reduce and batched_gemm_reduce#213
asroy merged 4 commits into
developfrom
gemm_reduce_improve

Conversation

@qianfengz
Copy link
Copy Markdown
Contributor

This P.R picked two commit from PR.211 5c60 and PR.211 8857 and provides the following

  1. Use Threadwise-reduce function to do thread local reduction in the gemm_reduce_xdl_cshuffle kernel
  2. Removed the ReduceSum and ReduceSquareSumdefined in element_wise_reduction_operation.hpp by using Add operation defined in reduction_operator.hpp (ReduceSquareSumis replaced byAddoperation andSquare` Elementwise operation)
  3. Does not pass ReduceOperation object in MakeArgumentPointer() (no need since ReduceOperation type is enough)

@qianfengz qianfengz requested a review from asroy April 28, 2022 15:19
false>;

using ThreadwiseReduce_D1 =
ThreadwiseReduction<FloatReduceAcc,
Copy link
Copy Markdown
Contributor

@asroy asroy Apr 29, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Future improvement] ThreadwiseReduction could have a template param that specify pointwise operation, so that we don't need separate static_for loop for pointwise operation

@asroy asroy mentioned this pull request Apr 29, 2022
1 task
@asroy asroy merged commit c77ae65 into develop Apr 29, 2022
@asroy asroy mentioned this pull request Apr 30, 2022
@qianfengz qianfengz deleted the gemm_reduce_improve branch June 13, 2022 09:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants