19 May 22:59

vpirogov

7b103b5

v0.9

Performance optimizations

Improved performance on processors with Intel(R) AVX2 instruction set support
Improved performance on processors with Intel(R) AVX512 instruction set support
Added optimizations for Intel(R) Xeon processors with Intel AVX512 instruction set support
Added inference optimizations for Intel(R) Atom processors with Intel(R) SSE4.2 support
Added JIT implementation of SGEMM for Intel(R) Xeon Phi(TM) processors.

New functionality

Average pooling supports 'exclude padding' mode
LRN supports arbitrary local size
Feature preview: Added int8 support in convolution, ReLU, pooling and inner product. Added optimized u8s8u8 convolution flavor for Intel Xeon processors with Intel AVX512 instruction set support.
Feature preview: Added int16 support in convolution, ReLU, pooling and inner product. Added optimized s16s16s32 convolution flavor for future Intel Xeon Phi processors.

Usability improvements

Improved build system to enable integration to other projects.
Intel(R) OpenMP runtime is used when the library built with binary dependency
Feature based dispatcher added to support wide range of Intel(R) processors and compatible

Thanks to the contributors

This release contains contributions from many Intel(R) Performance Libraries developers as well as Ismo Puustinen @ipuustin, Dmitry Gorokhov, Vladimir Dudnik @vladimir-dudnik, @pruthviIntel, and Chris Olivier @cjolivier01. We would also like to thank everyone who asked questions and reported issues.

Assets 8

25 Apr 15:01

vpirogov

v0.7

f0860e3

v0.7 Pre-release

Pre-release

Changes:

Improved performance on processors with Intel(R) AVX2 instruction set support
Improved performance on processors with Intel(R) AVX512 instruction set support
Extended backward propagation optimizations for Intel(R) AVX2 and Intel AVX512 instruction sets
Added SGEMM-based reference convolution implementation significantly improving performance for cases not covered by JIT convolution
Added JIT version of SGEMM function for Intel(R) AVX2 instruction set. This change allows to build optimized Intel(R) MKL-DNN without binary component.
Added backward propagation examples

Assets 11

07 Feb 21:49

vpirogov

v0.5

a96524b

v0.5 Pre-release

Pre-release

Changes:

Added runtime CPUID dispatching mechanism
Added initial Intel(R) AVX512 optimizations
Improved performance on processors with Intel(R) AVX2 instruction set support
Added initial backward propagation optimizations
Extended batch normalization primitive API with scale/shift and mean/variance parameters
Updated XByak to version 5.40

Assets 4

18 Nov 05:30

vpirogov

v0.3

19985aa

v0.3 Pre-release

Pre-release

Changes:

Added sum primitive
Added backward propagation reference implementation

Assets 5

10 Oct 09:49

vpirogov

v0.2

0fc7bd3

v0.2 Pre-release

Pre-release

Changes:

Added batch normalization
Added split and concat
Added linear response normalization inside the channel
Added average pooling

Assets 3

29 Aug 04:57

vpirogov

v0.1

0a48139

v0.1 Pre-release

Pre-release

This release is a technical preview with functionality limited to AlexNet and VGG topologies forward path.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance optimizations

New functionality

Usability improvements

Thanks to the contributors

Releases: oneapi-src/oneDNN

v0.9

Performance optimizations

New functionality

Usability improvements

Thanks to the contributors

v0.7

v0.5

v0.3

v0.2

v0.1