Skip to content

rocBLAS-14.3.0 for ROCm1.9

Compare
Choose a tag to compare
@amcamd amcamd released this 12 Oct 03:00
· 3625 commits to master since this release

Changelist:

  • add rocblas_gemm_strided_batched_ex for mixed precision support
  • tested on ROCm1.9
  • fix chunking of A and B matrices
  • expand testing of rocblas_gemm
  • sgemm and hgemm tuning on gfx906 for Resnet50 from Tensile V4.6.0

Known failures:

  • known dgemm failures for m,n < 16