v3.8.8
Change logs
- Update README.
- Add missing
cudaDeviceSynchronize
andcudaGetErrorString
for cuda runtime api. - Fix bug of cuda runtime api.
- [Nightly] Add
nvcuda.zluda_get_nightly_flag
.
It returns1
if ZLUDA was built with--nightly
flag. Otherwise, it returns0
. - [Nightly] Unlock cuBLASLt matmul compute type.
- [Nightly] Experimental cuDNN support on Windows. (disabled by default)
cuDNN on Windows
Starting from v3.8.8, the nightly build will include cudnn.dll
.
Supported Architectures
- gfx908, gfx90a
- gfx940, gfx941, gfx942
- gfx1030
- gfx1100, gfx1101, gfx1102
In order to enable cuDNN acceleration on supported device, you should download and unpack HIP SDK extension upon your existing HIP SDK 6.2 installation.
HIP SDK extension: DOWNLOAD
(unzip and paste folders upon path/to/AMD/ROCm/6.2
)
※ HIP SDK extension does not include hipBLASLt.