Skip to content

Deep Learning Hardware Performance Benchmarking

Notifications You must be signed in to change notification settings

junyoung-sim/net

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a fast deep learning kernel in C built from scratch for hardware performance benchmarking or simple machine learning tasks requiring efficiently configurable and deployable deep neural networks.

The default program in main.c reports timing for model initialization, forward passing, and backward passing of a network with over 1 billion parameters. To compile and execute this program, simply use the terminal commands shown below. Use the same commands if modifications are made in main.c for other use.

cd build
make && ./exec

alt text

Dell XPS 15 9520 (Intel i7-12700H)

About

Deep Learning Hardware Performance Benchmarking

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published