New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[Doc][Polish] gemm optimize by vectorize #57

Merged

AndSonder merged 1 commit into PaddleJitLab:develop from muyuuuu:gemm_vec

Dec 17, 2024

Contributor

muyuuuu commented Dec 5, 2024

删除多余的 cudamalloc
转置那里，我没理解错的话，只用 4 个大小的数组缓存就可以？
block_row_thread 的计算应该是反了，只是结果恰好一致


          [Doc][Polish] gemm optimize by vectorize

558f6c0

AndSonder approved these changes

View reviewed changes

Collaborator

AndSonder left a comment

LGTM, Great Work!

AndSonder merged commit 969a8e8 into PaddleJitLab:develop

2 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet