Skip to content

Releases: ROCm/MIOpen

MIOpen v1.2.0

20 Dec 17:11
a9949e3
Compare
Choose a tag to compare

Notes:

  • This release adds the support for recurrent neural networks (RNNs) for three flavors - Vanilla, LSTMs, and GRU
  • Users can now themselves update the perf-db file, which hosts the tuning parameters for convolutions, by setting appropriate environment variables

Changes:

  • Over 50% improvement in ResNet performance since the last release
  • Multiple padding modes like Same and Valid added
  • Winograd convolution kernels added for strided bwd-data convolutions
  • Tensor Ops allow for beta and alpha scaling values and support up to 5 dimensions with strides and offsets
  • Tensor Copy supports up to 5 dimesnional copies with strides and offsets
  • Unit-tests for LRN are added
  • Several bug fixes for all the layers of the library

Known issues:

  • RNNs may give incorrect result due to a known compiler bug; issue may particulary arise during some RNNs configs with GEMM of size power of 4
  • Potential issue where OpenCL resources will be exhausted for large RNN

MIOpen v.1.1.4

31 Oct 18:10
Compare
Choose a tag to compare
Merge branch '1.1.x' of github.com:AMDComputeLibraries/MLOpen into 1.1.x

MIOpen v.1.1.1

13 Sep 22:34
Compare
Choose a tag to compare
  • Performance improvements for the HIP backend
  • Robust error-checking

MIOpen v1.1

11 Sep 15:25
Compare
Choose a tag to compare

Notes:

  • The scaling parameter alpha and shift parameter beta for layers kernels are only supported for alpha = 1 and beta = 0. The exceptions to this are for miopenOptTensor, miopenConvolutionForwardBias, and miopenConvolutionBackwardBias.
  • Currently, only 32-bit floats are supported in MIOpen.
  • MIOpen only supports tensor layout NCHW.

Changes:

  • Added persistent cache for compiled GPU kernels
  • Performance improvements for batch normalization kernels
  • Performance improvements for all types of convolutions for 1x1 filters
  • Performance improvements for all types of convolutions with non-unit strides
  • Performance improvements for backward-weights convolutions for 3x3 filters
  • Performance improvements for the AddTensor operation
  • Various bug fixes for Winograd convolutions

1.0.2

11 Sep 15:26
Compare
Choose a tag to compare
Bump version