rocFFT 1.0.28 for ROCm 6.2.0
Optimizations
- Implemented multi-device transform for 3D pencil decomposition. Contiguous dimensions on input and output bricks
are transformed locally, with global transposes to make remaining dimensions contiguous.
Changes
- Randomly generated accuracy tests are now disabled by default; these can be enabled using
the --nrand option (which defaults to 0).