Added host_conv_wrw for verification#15
Conversation
| void host_direct_convolution_backward_weights( | ||
| const Tensor<TIn>& in, | ||
| const Tensor<TOut>& out, | ||
| Tensor<TWei>& wei, |
There was a problem hiding this comment.
(in, out, wei). I should change the order to out, in, wei.
|
@zjing14 I didn't find out issue on code, but the result is not right. device result: 18432 |
There is a bug in the function. FIxed. Please pull and check. |
|
@ltqin Please check if GPU and CPU result match |
* slice kv, and use 3d padding LDS layout * add missing sync * put sync to another poace * move sync place * revert to normal
Uh oh!
There was an error while loading. Please reload this page.