Skip to content
This repository has been archived by the owner on Aug 22, 2023. It is now read-only.
/ cuda Public archive

Useful resources for learning CUDA and its Python variants.

Notifications You must be signed in to change notification settings

nysbc/cuda

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 

Repository files navigation

CUDA resources

Useful resources for learning CUDA and its Python variants.

Numba (CUDA-Python)

Numba provides a framework for quickly implementing GPU acceleration. Numba does not support all CUDA functionality but has a simpler implementation than PyCUDA. Kernels are written in Python (technically the CUDA-Python variant) and converted by the Numba compiler to PTX code. http://numba.pydata.org/numba-doc/0.13/CUDAJit.html

PyCUDA Documentation

PyCUDA framework provides complete CUDA functionality, however this may require writing kernels in CUDA C. https://documen.tician.de/pycuda/

Parallel for all NVIDIA blog (Excellent)

NVIDIA blog with excellent discussion of various parallel programming concepts, bottlenecks, and optimizationss. https://devblogs.nvidia.com/parallelforall/efficient-matrix-transpose-cuda-cc/ https://devblogs.nvidia.com/parallelforall/using-shared-memory-cuda-cc/

Optimizing parallel reduction in CUDA (Excellent)

An excellent resource produced by NVIDIA for understanding reduction algorithms, and the different sources of both acceleration and bottlenecks in a parallel processing framework. http://developer.download.nvidia.com/compute/cuda/1.1-Beta/x86_website/projects/reduction/doc/reduction.pdf

Dot product, matrix-vector multiplication, sparse matrix multiplication, global reduction

http://people.cs.pitt.edu/~melhem/courses/xx45p/cuda_examples.pdf

Reductions and low-level performance considerations

https://mc.stanford.edu/cgi-bin/images/5/55/Darve_cme343_cuda_4.pdf

Django/Celery/PyCuda troubleshooting

https://stackoverflow.com/questions/24744755/why-am-i-getting-cumemalloc-failed-not-initialized-even-though-i-am-initializ

About

Useful resources for learning CUDA and its Python variants.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published