Skip to content

v3.8.8

Compare
Choose a tag to compare
@github-actions github-actions released this 14 Feb 12:48
32493c9

Change logs

  • Update README.
  • Add missing cudaDeviceSynchronize and cudaGetErrorString for cuda runtime api.
  • Fix bug of cuda runtime api.
  • [Nightly] Add nvcuda.zluda_get_nightly_flag.
    It returns 1 if ZLUDA was built with --nightly flag. Otherwise, it returns 0.
  • [Nightly] Unlock cuBLASLt matmul compute type.
  • [Nightly] Experimental cuDNN support on Windows. (disabled by default)

cuDNN on Windows

Starting from v3.8.8, the nightly build will include cudnn.dll.

Supported Architectures

  • gfx908, gfx90a
  • gfx940, gfx941, gfx942
  • gfx1030
  • gfx1100, gfx1101, gfx1102

In order to enable cuDNN acceleration on supported device, you should download and unpack HIP SDK extension upon your existing HIP SDK 6.2 installation.

HIP SDK extension: DOWNLOAD
(unzip and paste folders upon path/to/AMD/ROCm/6.2)

※ HIP SDK extension does not include hipBLASLt.