Skip to content

v0.11

Compare
Choose a tag to compare
@vpirogov vpirogov released this 30 Oct 15:49

Performance optimizations

  • Improved convolution performance on future Intel(R) Xeon Phi(TM) processors with AVX512_4FMAPS and AVX512_4VNNIW instruction groups support
  • Improved convolution performance on Intel(R) Xeon processors with Intel(R) AVX512 instruction set support
  • Improved performance of GEMM-based convolutions for small minibatches
  • Improved performance of Winograd convolution algorithm on Intel Xeon Phi processors.

New functionality

  • Added backpropagation support for dilated convolution.
  • Eltwise primitive is extended with support for square, abs, square root, linear, bounded ReLU, soft ReLU and logistic.

Usability improvements

  • Added macOS* support.

Breaking changes to the API

  • All real-value op descriptors' parameters now have float data type (previously double). The change breaks C-API backward compatibility for sum primitive. Please refer to 0bbb22e for details. C++ API maintains backward compatibility.

Thanks to the contributors

This release contains contributions from many Intel(R) Performance Libraries developers as well as Yu Yang @reyoung, Vladimir Mironov @vamironov, Nishant Patel @nbpatel, Leona Cook @indie, Jayaram Bobba @jbobba, Elena Gvozdeva. We would also like to thank everyone who asked questions and reported issues.

* Other names and brands may be claimed as the property of others.