Skip to content

How to optimize sgemm in single-thread ARM cpu, mutli-threads ARM cpu and Nvidia gpu

Notifications You must be signed in to change notification settings

renzibei/optimize-gemm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Optimize Gemm

In this repo, we will show the code about how to optimize sgemm in single-thread ARM cpu, mutli-threads ARM cpu and Nvidia gpu.

In each subdirectory, use make to compile the program. And there will be a benchmark executable program to test the gemm. You can read the makefile files for detail.

About

How to optimize sgemm in single-thread ARM cpu, mutli-threads ARM cpu and Nvidia gpu

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published