in-place reshape ops #14053

szha · 2019-02-02T00:47:13Z

Description

make NDArray fluent methods for expand_dims/flatten/squeeze in-place reshapes.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented:
For user-facing API changes, API doc string has been updated.
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

make NDArray fluent methods for expand_dims/flatten/squeeze in-place reshapes.

Comments

resolves expand_dims() makes copy instead of simply reshaping #13998

ZhennanQin · 2019-02-02T09:14:12Z

Have you tried this with mkldnn enabled? E.g reshape the output of mkldnn convolution?

szha · 2019-02-02T19:01:01Z

@ZhennanQin I didn't test that specifically based on the assumption that the backend implementation should not cause existing APIs such as NDArray.reshape or NDArray.shape to break.

stephenrawls · 2019-02-02T22:32:26Z

@szha -- Thanks for putting in this patch!

I have a couple questions about the operator expand_dims, which I understand you are actually changing the python code to not use that operator, but I think still has relevance.

(1) I originally discovered this because I was using the C API to call MXImperativeInvoke() using the expand_dims operator, and I noticed it was causing a copy. This fix only effects the Python version when operating on ndarrays, not users of the expand_dims operator.

I see that there is a path in the operator that checks if it is an in-place operation. Presumably it uses that path if I pass an output array that is the same NDArrayHandle as the input array? But what if I still need the original input array handle, and I want to create a new output array handle with the expanded dim but still not make a copy?

(2) In the issue I created you commented that: "For symbol (and thus the hybridized version), since in-place identity is possible it should not matter".

Can you talk a little more about that? I assume you mean that in this case:

x_expanded = x.expand_dims(1)
y = x_expanded + foo

The engine can figure out that x is not needed again, and can thus turn the expand_dims(1) into an in-place operation that doesn't make a copy?

I'm not very familiar with how this part of the code works, so what happens if you had code that looked like this?

x_expanded = x.expand_dims(1)
y = x_expanded + foo
z = 2 * x

i.e. the code still makes a reference to the original x, and thus presumably the engine can't decide to use the in-place version of expand_dims in that case, right? So I guess my question is -- Does the ability for the Syblolic / hybridized engine to elide the copy depend on the code not referencing the un-expanded version of the array after calling expand_dims()? If so, it seems like there will still be some use cases where an unexpected copy is happening.

szha · 2019-02-03T20:41:11Z

@stephenrawls

This fix only effects the Python version when operating on ndarrays, not users of the expand_dims operator.

Right. The reshape operators always return a copy. This is because in imperative mode there is no complete computation graph to be analyzed and see whether a copy is needed or not.

I see that there is a path in the operator that checks if it is an in-place operation. Presumably it uses that path if I pass an output array that is the same NDArrayHandle as the input array?

Right, it is done through output=original_array argument to the operators. It does not work for autograd (i.e. Gluon's training mode) as autograd doesn't support in-place operation.

But what if I still need the original input array handle, and I want to create a new output array handle with the expanded dim but still not make a copy?

As long as it's not used in the context of autograd, you can save the original handle before in-place operation. Though the output of in-place op shares the space with the input, it's still a separate handle.

For symbol (and thus the hybridized version), since in-place identity is possible it should not matter.

To elaborate, in symbolic mode, the backend can see the complete computation graph, and as a result it can analyze the graph (through node coloring) and figure out whether nodes can share the same space. FInplaceIdentity is the graph node attribute that expresses whether this should be allowed in order for compiler to plan memory. For details see 3rdparty/tvm/nnvm/src/pass/plan_memory.cc.

Note that such memory plan happens before the engine/scheduler work happens, and it happens for both symbolic executor and hybridized Gluon HybridBlock.

stephenrawls · 2019-02-03T21:36:42Z

@szha -- Thanks for the detailed explanation, that helps a lot.

So just for clarity, I see that in the operator definition for expand_dims, it sets the FInPlaceIdentity attribute to true:
https://github.com/apache/incubator-mxnet/blob/45d1a1e6ff9c58cfc75f72651c1bf671ac7f1885/src/operator/tensor/matrix_op.cc#L400-L419

I didn't see anything in the plan_memory.cc file that depended on autograd / training.

So should I take it that the code is smart enough to not allocate new memory for the expand_dims operator in symbolic mode, i.e. not to make a copy, even during training when using autograd?

If so, then I think I understand it, and I see what you mean about only needing to fix this for the Python ndarray use case.

Thanks again!
Stephen

vandanavk · 2019-02-04T22:16:14Z

@mxnet-label-bot add [pr-awaiting-review, NDArray]

szha · 2019-02-05T02:03:49Z

@stephenrawls actually, autograd comes into play in pure imperative mode. MXNet's autograd is trace based and does not support in-place operation such as input = mx.nd.expand_dims(input, ..., output=input). You're actually right in that it doesn't help the case where you write it in imperative mode in C++. In order to make use of the graph pass for saving memory, you need to use Symbol API to express the model and use symbolic execution (or its counter part for Gluon, which is CachedOp).

TaoLv · 2019-02-05T14:17:36Z

Does it mean that with these changes python API will have different behavior compared with other frontend languages?

szha · 2019-02-05T18:36:20Z

@TaoLv Other language bindings don't seem to have the fluent methods for reshaping NDArray.

szha · 2019-02-06T22:29:33Z

The current implementation would break backward compatibility for in-place update.

szha · 2019-02-11T21:58:54Z

I plan on adding an inplace argument to these fluent methods to allow users to specify whether in-place operation should be used.

ankkhedia · 2019-02-15T18:42:27Z

@reminisce Could you please help review the PR?

szha · 2019-02-15T19:14:52Z

I haven't finished this PR yet as I haven't got around to add the in-place option.

ankkhedia · 2019-02-15T19:16:50Z

@mxnet-label-bot add [pr-work-in-progress]

ankkhedia · 2019-02-15T19:17:26Z

@mxnet-label-bot remove [pr-awaiting-review]

reminisce

Please add unit tests.

python/mxnet/ndarray/ndarray.py

szha · 2019-02-22T20:10:18Z

@reminisce good catch. I was relying on existing tests before adding the inplace option but forgot to add tests for them afterwards.

szha · 2019-03-03T06:21:58Z

Added tests

wkcn

LGTM : )

I have a question.

It seems that in-place flag does not support Symbol.
In gluon, F.expand_dim(a, inplace=True) will raise inplace not found when hybridizing the model.

Could we add warnings in mxnet/symbol.py ?

szha · 2019-03-06T05:09:36Z

@wkcn good point on symbol and HybridBlock. I'm not sure if a warning should be added as it can get very verbose.

szha · 2019-03-06T05:12:23Z

Nonetheless it should not throw an error when using inplace with hybridize. Let me see how to best deal with it.

karan6181 · 2019-03-19T00:07:20Z

@szha Thank you for your contributions! Is this PR work in progress?

abhinavs95 · 2019-03-29T22:07:31Z

@szha is this PR ready to merge?

piyushghai · 2019-04-09T00:50:50Z

@szha Is this good to merge ?

Roshrini

LGTM

szha · 2019-04-17T18:01:33Z

Thanks for all the reviews. I plan to add a dummy option in the symbol version of the reshape ops so that it won't throw error when people use this option in the context of Gluon

roywei · 2019-04-30T16:46:18Z

@mxnet-label-bot add[pr-work-in-progress]

roywei · 2019-04-30T16:46:44Z

@mxnet-label-bot remove [pr-awaiting-response]

karan6181 · 2019-05-21T21:18:38Z

@szha Did you get a chance to add a dummy option in the symbol version? Thanks!

piyushghai · 2019-06-07T22:39:33Z

@szha @wkcn This PR is ready to be merged I guess ?

@mxnet-label-bot Update [NDArray, pr-awaiting-merge]

szha · 2019-06-08T05:27:43Z

@piyushghai thanks for pinging. I wanted to hold onto this for the 1.5 release code freeze.

abhinavs95 · 2019-07-23T19:36:16Z

@szha is this good to go now?

* in-place reshape ops * add inplace option * add dummy arguments to symbol

szha force-pushed the inplace_reshape branch 2 times, most recently from 58e9144 to 0f36d95 Compare February 2, 2019 06:52

szha force-pushed the inplace_reshape branch from 0f36d95 to 257a0ee Compare February 3, 2019 04:28

szha force-pushed the inplace_reshape branch from 257a0ee to 74a020b Compare February 4, 2019 04:13

marcoabreu added NDArray pr-awaiting-review PR is waiting for code review labels Feb 4, 2019

szha requested a review from reminisce February 5, 2019 06:56

marcoabreu added the pr-work-in-progress PR is still work in progress label Feb 15, 2019

marcoabreu removed the pr-awaiting-review PR is waiting for code review label Feb 15, 2019

szha force-pushed the inplace_reshape branch from b2963d0 to 81536a5 Compare February 19, 2019 02:43

szha removed the pr-work-in-progress PR is still work in progress label Feb 19, 2019

reminisce reviewed Feb 22, 2019

View reviewed changes

python/mxnet/ndarray/ndarray.py Outdated Show resolved Hide resolved

szha force-pushed the inplace_reshape branch from 81536a5 to ad4d47f Compare March 3, 2019 06:18

wkcn approved these changes Mar 6, 2019

View reviewed changes

Roshrini added the pr-awaiting-response PR is reviewed and waiting for contributor to respond label Apr 16, 2019

Roshrini approved these changes Apr 17, 2019

View reviewed changes

marcoabreu added the pr-work-in-progress PR is still work in progress label Apr 30, 2019

marcoabreu removed the pr-awaiting-response PR is reviewed and waiting for contributor to respond label Apr 30, 2019

szha force-pushed the inplace_reshape branch 3 times, most recently from eb1da49 to d58bd77 Compare May 27, 2019 06:05

szha added 3 commits May 30, 2019 11:05

in-place reshape ops

192f74f

add inplace option

b7c70cb

add dummy arguments to symbol

26dc8c7

szha force-pushed the inplace_reshape branch from d58bd77 to 26dc8c7 Compare May 30, 2019 18:05

marcoabreu added pr-awaiting-merge Review and CI is complete. Ready to Merge and removed pr-work-in-progress PR is still work in progress labels Jun 7, 2019

szha merged commit 3ececb3 into apache:master Jul 28, 2019

szha deleted the inplace_reshape branch July 28, 2019 02:33

leezu mentioned this pull request Jul 30, 2019

Regression in nightly: TypeError: squeeze() missing 1 required positional argument: 'axis' #15705

Closed

anirudhacharya pushed a commit to anirudhacharya/mxnet that referenced this pull request Aug 20, 2019

in-place reshape ops (apache#14053)

a408620

* in-place reshape ops * add inplace option * add dummy arguments to symbol

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in-place reshape ops #14053

in-place reshape ops #14053

szha commented Feb 2, 2019 •

edited

Loading

ZhennanQin commented Feb 2, 2019

szha commented Feb 2, 2019

stephenrawls commented Feb 2, 2019

szha commented Feb 3, 2019

stephenrawls commented Feb 3, 2019

vandanavk commented Feb 4, 2019

szha commented Feb 5, 2019

TaoLv commented Feb 5, 2019

szha commented Feb 5, 2019

szha commented Feb 6, 2019

szha commented Feb 11, 2019

ankkhedia commented Feb 15, 2019

szha commented Feb 15, 2019

ankkhedia commented Feb 15, 2019

ankkhedia commented Feb 15, 2019

reminisce left a comment

szha commented Feb 22, 2019

szha commented Mar 3, 2019

wkcn left a comment •

edited

Loading

szha commented Mar 6, 2019

szha commented Mar 6, 2019

karan6181 commented Mar 19, 2019

abhinavs95 commented Mar 29, 2019

piyushghai commented Apr 9, 2019

Roshrini left a comment

szha commented Apr 17, 2019

roywei commented Apr 30, 2019

roywei commented Apr 30, 2019

karan6181 commented May 21, 2019

piyushghai commented Jun 7, 2019

szha commented Jun 8, 2019

abhinavs95 commented Jul 23, 2019

in-place reshape ops #14053

in-place reshape ops #14053

Conversation

szha commented Feb 2, 2019 • edited Loading

Description

Checklist

Essentials

Changes

Comments

ZhennanQin commented Feb 2, 2019

szha commented Feb 2, 2019

stephenrawls commented Feb 2, 2019

szha commented Feb 3, 2019

stephenrawls commented Feb 3, 2019

vandanavk commented Feb 4, 2019

szha commented Feb 5, 2019

TaoLv commented Feb 5, 2019

szha commented Feb 5, 2019

szha commented Feb 6, 2019

szha commented Feb 11, 2019

ankkhedia commented Feb 15, 2019

szha commented Feb 15, 2019

ankkhedia commented Feb 15, 2019

ankkhedia commented Feb 15, 2019

reminisce left a comment

Choose a reason for hiding this comment

szha commented Feb 22, 2019

szha commented Mar 3, 2019

wkcn left a comment • edited Loading

Choose a reason for hiding this comment

szha commented Mar 6, 2019

szha commented Mar 6, 2019

karan6181 commented Mar 19, 2019

abhinavs95 commented Mar 29, 2019

piyushghai commented Apr 9, 2019

Roshrini left a comment

Choose a reason for hiding this comment

szha commented Apr 17, 2019

roywei commented Apr 30, 2019

roywei commented Apr 30, 2019

karan6181 commented May 21, 2019

piyushghai commented Jun 7, 2019

szha commented Jun 8, 2019

abhinavs95 commented Jul 23, 2019

szha commented Feb 2, 2019 •

edited

Loading

wkcn left a comment •

edited

Loading