New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

OneDNN md-in-tensor refactoring part 5: Memory descriptor enabled for elementwises, reductions and expand_v2 ops #43036

Merged

jczaja merged 5 commits into PaddlePaddle:develop from jakpiase:5_md_in_tensor

May 30, 2022

Contributor

jakpiase commented May 26, 2022

PR types

Others

PR changes

OPs

Describe

Memory descriptor enabled for elementwises, reductions and expand_v2 ops

jakpiase added the Intel label

jakpiase added 3 commits

May 26, 2022 18:14


          enabled md in elementwises, reductions and expand_v2

b1773e6


          CI fix for invalid numpy copy

a5cb9df


          fixed formatting

07b3438

jakpiase closed this

paddle-bot-old bot added the status: not progressed label

paddle-bot-old bot commented May 27, 2022

很抱歉，经过我们的反复讨论，你的PR暂未达到合入标准，请阅读飞桨原生算子开发规范，你可以重新提交新的PR，我们先将此PR关闭，感谢你的贡献。
Sorry to inform you that through our discussion, your PR fails to meet the merging standard (Reference: Paddle Custom Operator Design Doc). You can also submit an new one. Thank you.

Contributor Author

jakpiase commented May 27, 2022

Reopened due to license/cla being stuck

jakpiase reopened this


          CI rerun

decae8d

jakpiase requested a review from jczaja

May 27, 2022 16:25

jczaja previously approved these changes

View reviewed changes

Contributor

jczaja left a comment

LGTM

Contributor Author

jakpiase commented May 27, 2022

@tsocha please review this PR

tsocha suggested changes

View reviewed changes

paddle/fluid/platform/mkldnn_reuse.h

Comment on lines +709 to +711

    
                                           platform::Place cpu_place, const Tensor* x,

                                           Tensor* out, float scale_x, float scale_y,

                                           const std::vector<int64_t>& extended_x_dims)

Contributor

tsocha May 27, 2022

[CONCERN]
Type is almost the same but order has changed.
I hope that our tests will detect potential problems with it.

Was this change of order necessary?

Contributor Author

jakpiase May 30, 2022

It was not necessary, but for me order: input, output is more intuitive than output, input. I have checked all the tests for reduce_grad and expand_v2 and they all work correctly after my changes

paddle/fluid/platform/mkldnn_reuse.h Outdated

Comment on lines 719 to 721

+                  dnnl::primitive_attr attrs;
+                  attrs.set_scales(DNNL_ARG_SRC_0, 0, {scale_x});
+                  attrs.set_scales(DNNL_ARG_SRC_1, 0, {scale_y});

Contributor

tsocha May 27, 2022

[OPINION]
The old name was good.

paddle/fluid/platform/mkldnn_reuse.h

    
                                       const Tensor* y, std::vector<int64_t> y_tz,

                                       const dnnl::primitive_attr& attr = NULL)

                                       const Tensor* out, std::vector<int64_t> out_tz,

                                       const dnnl::primitive_attr& attrs = NULL)

Contributor

tsocha May 27, 2022

[COMMENT]
If dnnl::primitive_attr can hold multiple attributes its name should be changed in onednn, but if it can't then our parameter should have the old name.

Contributor Author

jakpiase May 30, 2022

It can hold scales, post-ops, zero-points, so it definitely supports multiple attributes, but probably because of backward compatibility it won't be changed inside oneDNN

paddle/fluid/operators/reduce_ops/mkldnn/reduce_mkldnn_op.h Outdated

    
                std::vector<int64_t> output_dims(phi::vectorize(input->dims()));

                std::vector<int64_t> output_dims = phi::vectorize(input->dims());

Contributor

tsocha May 27, 2022

[QUESTION]
Was this change necessary?

Contributor Author

jakpiase May 30, 2022

No, it will produce the exact same assembly, I will revert it then

paddle/fluid/operators/reduce_ops/mkldnn/reduce_mkldnn_op.h Outdated

		for (size_t i = 0; i < reduce_dims.size(); ++i) {
		// handle negative dims, f.e. -1 means last dimension

Contributor

tsocha May 27, 2022

[CONCERN]
I think last dimension is not the best description.
(1, 3, -1, 512, 512) << the last dimension is -1 or 512?

Contributor Author

jakpiase May 30, 2022

You're right, I'll change it to "rightmost"

paddle/fluid/operators/reduce_ops/mkldnn/reduce_mkldnn_op.h

Comment on lines +55 to +64

    
                  const auto* x = ctx.Input<LoDTensor>("X");

                  auto* out = ctx.Output<Tensor>("Out");

                  auto reduce_dims = ctx.Attr<std::vector<int>>("dim");

                  bool reduce_all = ctx.Attr<bool>("reduce_all");

                  bool keep_dim = ctx.Attr<bool>("keep_dim");

                  auto output_dims =

                      CalculateReducedDims(input, output, reduce_dims, reduce_all, keep_dim);

                  auto input_dims = phi::vectorize(input->dims());

                  auto x_tz = phi::vectorize(x->dims());

                  auto out_tz =

                      CalculateReducedDims(x, out, reduce_dims, reduce_all, keep_dim);

Contributor

tsocha May 27, 2022

[OPINION]
I see why you changed names here but input>x IMO

Contributor Author

jakpiase May 30, 2022

I have changed them to unify all of our ops, since all new ops are following this pattern, i.e. inputs are described as 'X' and 'Y' and output is described as 'Out'. Some legacy operators are having different names because of backwards compatibility, but I was thinking that unifying them to get rid of the additional noise would be nice

paddle/fluid/operators/reduce_ops/mkldnn/reduce_mkldnn_op.h Outdated

-                    auto reorder_dst_memory_p = reorder_handler.AcquireDstMemory(
-                        output, input->mem_desc(), ctx.GetPlace());
+                    // reuse same mem desc since it is a simple copy

Contributor

tsocha May 27, 2022

Suggested change

      
                  // reuse same mem desc since it is a simple copy
          
                  // reuse mem desc since it is a simple copy

paddle/fluid/operators/reduce_ops/mkldnn/reduce_mkldnn_op.h Outdated

                   bool keep_dim = ctx.Attr<bool>("keep_dim");
                   bool reduce_all = ctx.Attr<bool>("reduce_all");
                   auto dims = ctx.Attr<std::vector<int>>("dim");
-                  const auto* dout = ctx.Input<Tensor>(framework::GradVarName("Out"));
+                  auto* dout = ctx.Input<Tensor>(framework::GradVarName("Out"));

Contributor

tsocha May 27, 2022

[CONCERN]
Do you plan to change this pointer?

Contributor Author

jakpiase May 30, 2022

Nope, nice catch, that was a mistake, thanks

paddle/fluid/operators/reduce_ops/mkldnn/reduce_mkldnn_op.h

Comment on lines +132 to +133

		auto dout_tz = CalculateReducedDims(dx, dout, dims, reduce_all, keep_dim);
		auto dx_tz = phi::vectorize(dx->dims());

Contributor

tsocha May 27, 2022

[CONCERN]
The logic changed here.
Input has been swaped with output, was it intentional?

What means tz and d in this context?

Contributor Author

jakpiase May 30, 2022

Input and Output were incorrectly swapped here earlier. Reduce kernel was one of the first ops that I have made and there were some inconsistencies and bugs inside of it, so that change is intentional

d means that it is a grad tensor, so dout is the input and dx is the output
I have no idea what tz means, but it is commonly used as a synonym for dims inside PaddlePaddle, oneDNN and other frameworks

jakpiase dismissed jczaja’s stale review via

11dee96

May 30, 2022 13:25


          changes after review

11dee96

tsocha approved these changes

View reviewed changes

jczaja approved these changes

View reviewed changes

Contributor

jczaja left a comment

LGTM

jczaja merged commit 12d8a56 into PaddlePaddle:develop

fuyou765 pushed a commit to fuyou765/Paddle that referenced this pull request


          OneDNN md-in-tensor refactoring part 5: Memory descriptor enabled for…

0d3e200

… elementwises, reductions and expand_v2 ops (PaddlePaddle#43036)

* enabled md in elementwises, reductions and expand_v2

* CI fix for invalid numpy copy

* fixed formatting

* CI rerun

* changes after review

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels