Added softplus FP32 FWD OneDNN kernel #36382

jakpiase · 2021-10-12T23:12:50Z

PR types

New features

PR changes

OPs

Describe

Added softplus FP32 FWD OneDNN kernel. It improves ppyolov2_r50vd_365e_coco model by 14%

paddle-bot-old · 2021-10-12T23:12:54Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

jakpiase · 2021-10-12T23:14:30Z

@piotrekobiIntel please review this PR

ghost

Apart from the import error, this looks great :)

python/paddle/fluid/tests/unittests/mkldnn/test_softplus_mkldnn_op.py

jczaja

LGTM

jczaja

LGTM

jakpiase · 2021-10-14T10:26:34Z

@piotrekobiIntel Could you please continue your review?

ghost · 2021-10-14T12:26:31Z

paddle/fluid/operators/mkldnn/softplus_mkldnn_op.h

+    dnnl::post_ops post_ops;
+    post_ops.append_eltwise(1.0f, dnnl::algorithm::eltwise_soft_relu, 0.0f,
+                            0.0f);
+    if (beta != 1.0f) {
+      post_ops.append_eltwise(1.0f, dnnl::algorithm::eltwise_linear,
+                              1.0f / beta, 0.0f);
+    }
+
+    dnnl::primitive_attr attrs;
+    attrs.set_post_ops(post_ops);
+
+    this->AcquireForwardPrimitiveDescriptor(attrs, dnnl::algorithm::binary_mul,
+                                            x_md, beta_md, x_md);


Something like this would allow to skip the multiplication by 1 if beta = 1, this exact code probably won't work but I hope you understand what I mean. I'm not sure if it's worth putting in the extra work though (it depends how often beta is equal to 1 in practice).

Suggested change

dnnl::post_ops post_ops;

post_ops.append_eltwise(1.0f, dnnl::algorithm::eltwise_soft_relu, 0.0f,

0.0f);

if (beta != 1.0f) {

post_ops.append_eltwise(1.0f, dnnl::algorithm::eltwise_linear,

1.0f / beta, 0.0f);

}

dnnl::primitive_attr attrs;

attrs.set_post_ops(post_ops);

this->AcquireForwardPrimitiveDescriptor(attrs, dnnl::algorithm::binary_mul,

x_md, beta_md, x_md);

if (beta == 1.0f)

{

this->AcquireForwardPrimitiveDescriptor(attrs, dnnl::algorithm::eltwise_soft_relu, x_md, x_md);

}

else

{

dnnl::post_ops post_ops;

post_ops.append_eltwise(1.0f, dnnl::algorithm::eltwise_soft_relu, 0.0f,

0.0f);

post_ops.append_eltwise(1.0f, dnnl::algorithm::eltwise_linear,

1.0f / beta, 0.0f);

dnnl::primitive_attr attrs;

attrs.set_post_ops(post_ops);

this->AcquireForwardPrimitiveDescriptor(attrs, dnnl::algorithm::binary_mul,

x_md, beta_md, x_md);

}

It was done like that in the previous commits of this PR, but I have agreed with Jacek, that the overall change in performance was meaningless, and this way the code is unified and much more clear. Moreover, this operator will be fused with tanh activation(for ppyolov2_r50vd_365e model), so in that case binary operation must be done, because eltwise primitive does not support fusing with another eltwise primitive. But you've definitely got a point that execution time would be faster if there would be just soft_relu without binary_mul at the beginning

ghost

Great, LGTM then :)

jakpiase added 2 commits October 12, 2021 23:10

added softplus

a452c61

refactored softplus op

dff0af9

jakpiase added the Intel label Oct 12, 2021

deleted unnecessary file

91216c5

jakpiase requested a review from jczaja October 12, 2021 23:14

added missing file

980be69

jakpiase added 2 commits October 13, 2021 01:15

added formatting

1a0dd1c

disabled tests if GPU is used

90efe13

ghost reviewed Oct 13, 2021

View reviewed changes

python/paddle/fluid/tests/unittests/mkldnn/test_softplus_mkldnn_op.py Outdated Show resolved Hide resolved

added reviewer suggestion

0377970

jczaja previously approved these changes Oct 13, 2021

View reviewed changes

jakpiase dismissed jczaja’s stale review via be4e56c October 13, 2021 14:54

unified softplus kernel

be4e56c

jakpiase requested a review from jczaja October 13, 2021 18:21

jczaja approved these changes Oct 14, 2021

View reviewed changes

ghost reviewed Oct 18, 2021

View reviewed changes

ghost approved these changes Oct 18, 2021

View reviewed changes

jczaja merged commit bdac9ff into PaddlePaddle:develop Oct 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added softplus FP32 FWD OneDNN kernel #36382

Added softplus FP32 FWD OneDNN kernel #36382

jakpiase commented Oct 12, 2021

paddle-bot-old bot commented Oct 12, 2021

jakpiase commented Oct 12, 2021

ghost left a comment

jczaja left a comment

jczaja left a comment

jakpiase commented Oct 14, 2021

ghost Oct 14, 2021

jakpiase Oct 18, 2021

ghost left a comment

Added softplus FP32 FWD OneDNN kernel #36382

Added softplus FP32 FWD OneDNN kernel #36382

Conversation

jakpiase commented Oct 12, 2021

PR types

PR changes

Describe

paddle-bot-old bot commented Oct 12, 2021

jakpiase commented Oct 12, 2021

ghost left a comment

Choose a reason for hiding this comment

jczaja left a comment

Choose a reason for hiding this comment

jczaja left a comment

Choose a reason for hiding this comment

jakpiase commented Oct 14, 2021

ghost Oct 14, 2021

Choose a reason for hiding this comment

jakpiase Oct 18, 2021

Choose a reason for hiding this comment

ghost left a comment

Choose a reason for hiding this comment