./configure && make && make bench
TO auto-detect CC version and create config.mk
, try
`./configure`
Otherwise, create config.mk
and set the variable CUDA_CC
to the version of code you want to generate.
For example
CUDA_CC=35
Build all supported tests
make
Run all supported tests and dump outputs in $MODULE.csv
make bench
Run individual tests, with outputs to stdout:
<module>/main
https://devblogs.nvidia.com/maximizing-unified-memory-performance-cuda/
https://devblogs.nvidia.com/maximizing-unified-memory-performance-cuda/