Releases: ROCm/rocFFT
Releases · ROCm/rocFFT
rocFFT 1.0.18 for ROCm 5.3.3
rocFFT code for ROCm 5.3.3 did not change. The library was rebuilt for the updated ROCm 5.3.3 stack.
rocFFT 1.0.18 for ROCm 5.3.2
rocFFT code for ROCm 5.3.2 did not change. The library was rebuilt for the updated ROCm 5.3.2 stack.
rocFFT 1.0.18 for ROCm 5.3.1
rocFFT code for ROCm 5.3.1 did not change. The library was rebuilt for the updated ROCm 5.3.1 stack.
rocFFT 1.0.18 for ROCm 5.3.0
Changed
- Runtime compilation cache now looks for environment variables XDG_CACHE_HOME (on Linux) and LOCALAPPDATA (on Windows) before falling back to HOME.
Optimizations
- Optimized 2D R2C/C2R to use 2-kernel plans where possible.
- Improved performance of the Bluestein algorithm.
- Optimized sbcc-168 and 100 by using half-lds.
Fixed
- Fixed occasional failures to parallelize runtime compilation of kernels.
Failures would be retried serially and ultimately succeed, but this would take extra time. - Fixed failures of some R2C 3D transforms that use the unsupported TILE_UNALGNED SBRC kernels.
An example is 98^3 R2C out-of-place. - Fixed bugs in SBRC_ERC type.
rocFFT 1.0.17 for ROCm 5.2.3
rocFFT code for ROCm 5.2.3 did not change. The library was rebuilt for the updated ROCm 5.2.3 stack.
rocFFT 1.0.17 for ROCm 5.2.1
rocFFT code for ROCm 5.2.1 did not change. The library was rebuilt for the updated ROCm 5.2.1 stack.
rocFFT 1.0.17 for ROCm 5.2.0
Added
- Packages for test and benchmark executables on all supported OSes using CPack.
- Added File/Folder Reorg Changes with backward compatibility support using ROCM-CMAKE wrapper functions.
Changed
- Improved reuse of twiddle memory between plans.
- Set a default load/store callback when only one callback
type is set via the API for improved performance.
Optimizations
- Introduced a new access pattern of lds (non-linear) and applied it on
sbcc kernels len 64 to get performance improvement.
Fixed
- Fixed plan creation failure in cases where SBCC kernels would need to write to non-unit-stride buffers.
rocFFT 1.0.16 for ROCm 5.1.3
rocFFT code for ROCm 5.1.3 did not change. The library was rebuilt for the updated ROCm 5.1.3 stack.
rocFFT 1.0.16 for ROCm 5.1.1
rocFFT code for ROCm 5.1.1 did not change. The library was rebuilt for the updated ROCm 5.1.1 stack.
rocFFT 1.0.16 for ROCm 5.1.0
Changed
- Supported unaligned tile dimension for SBRC_2D kernels.
- Improved (more RAII) test and benchmark infrastructure.
- Enabled runtime compilation of length-2304 FFT kernel during plan creation.
Optimizations
- Optimized more large 1D cases by using L1D_CC plan.
- Optimized 3D 200^3 C2R case.
- Optimized 1D 2^30 double precision on MI200.
Fixed
- Fixed correctness of some R2C transforms with unusual strides.
Removed
- The hipFFT API (header) has been removed from after a long deprecation period. Please use the hipFFT package/repository to obtain the hipFFT API.