split_v2 operator #13687

HyperZealot · 2018-12-19T08:20:41Z

Description

New version of split operator to match the behavior of numpy.split

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

New Split_v2 operator
Unit tests

Comments

Roshrini · 2018-12-19T18:59:18Z

@HyperZealot Thanks for working on this.
@mxnet-label-bot Add [pr-awaiting-review, Operator]

rongzha1 · 2018-12-20T02:23:47Z

Could you explain the difference between split(sliceChannel) and split_v2 , and maybe show some performance compare between these two OP? Thanks

python/mxnet/ndarray/ndarray.py

zheng-da · 2018-12-20T16:42:15Z

python/mxnet/symbol/symbol.py

+        indices = [0] + list(indices_or_sections)
+    else:
+        raise ValueError('indices_or_sections must either int or tuple of ints')
+    return _internal._split_v2(ary, indices, axis, squeeze_axis, sections)


why the symbol implementation and ndarray implementation are different? the ndarray impl always computes indices and here it may pass the number of sections directly.

You do not know the shape of a symbol so you cannot pre-compute the indices in symbol mode.

zheng-da · 2018-12-20T16:43:30Z

src/operator/tensor/matrix_op-inl.h

+  }
+};  // struct SplitParam
+
+inline TShape GetSplitIndices(const TShape& ishape, int axis, int sections) {


maybe return Tuple may make more sense.

TShape is also a Tuple: https://github.com/dmlc/tvm/blob/master/nnvm/include/nnvm/tuple.h#L325. But maybe I can also switch to nnvm::Tuple.

i know TShape uses Tuple. I think it's better to use Tuple for the return result. TShape makes the return result look like a shape.

apeforest

Could you explain the difference between split(sliceChannel) and split_v2? Is it possible to update UI of split? We should avoid creating v2 of an operator as much as possible.

HyperZealot · 2018-12-20T21:06:51Z

@apeforest Please also avoid v2 of the same question as well, I think ur question was asked by @rongzha1 already.
This version is to support split of an array at arbitrary points along as axis while the previous split only supported equal splits along an axis, to match numpy behavior of the same operator.
On the other hand, there was some API-breaking changes to cpp-package in my first attempt to modify the original split operator here so that's why I'm moving to creation of a new operator.

Roshrini · 2019-01-02T22:18:23Z

@zheng-da Can you take a look again to see if your comments are addressed? Thanks

zheng-da · 2018-12-21T15:50:57Z

src/operator/tensor/matrix_op-inl.h

+  }
+};  // struct SplitParam
+
+inline TShape GetSplitIndices(const TShape& ishape, int axis, int sections) {


i know TShape uses Tuple. I think it's better to use Tuple for the return result. TShape makes the return result look like a shape.

src/operator/tensor/matrix_op-inl.h

zheng-da · 2019-01-03T03:10:35Z

src/operator/tensor/matrix_op-inl.h

+    const size_t section_size = indices[target + 1] - indices[target];
+    const size_t target_idx =
+      head_idx * trailing_size * section_size + mid_idx * trailing_size + tail_idx;
+    target_data[target_idx] = in_data[i];


I'm a little concerned about this kernel. It takes a lot of computation to copy an element from the original array to a destination array. At least we should copy the entire row unless we split in the last dimension.

I doubt if memcpy is optimal for all cases actually, and splitting the kernel into 2 versions would also make the code harder to maintain and manage, what do you think if we treat that as a further optimization and do it in a follow-up PR so that the users of DGL can enjoy this new op ASAP?

src/operator/tensor/matrix_op-inl.h

HyperZealot requested a review from szha as a code owner December 19, 2018 08:20

marcoabreu added Operator pr-awaiting-review PR is waiting for code review labels Dec 19, 2018

HyperZealot force-pushed the split_v2 branch from 35447ff to fc35119 Compare December 20, 2018 10:17

zheng-da reviewed Dec 20, 2018

View reviewed changes

HyperZealot mentioned this pull request Dec 20, 2018

CI Problem: Build status not reflected on PR #11654

Open

apeforest reviewed Dec 20, 2018

View reviewed changes

zheng-da reviewed Jan 3, 2019

View reviewed changes

HyperZealot force-pushed the split_v2 branch from fc35119 to b8f67ef Compare January 8, 2019 03:49

split_v2

fe0df23

HyperZealot force-pushed the split_v2 branch from b8f67ef to fe0df23 Compare January 8, 2019 06:05

BullDemonKing approved these changes Jan 8, 2019

View reviewed changes

zheng-da approved these changes Jan 14, 2019

View reviewed changes

szha merged commit 45d1a1e into apache:master Jan 23, 2019

HyperZealot deleted the split_v2 branch January 23, 2019 21:10

jessr92 pushed a commit to jessr92/incubator-mxnet that referenced this pull request Jan 27, 2019

split_v2 (apache#13687)

8db75f7

stephenrawls pushed a commit to stephenrawls/incubator-mxnet that referenced this pull request Feb 16, 2019

split_v2 (apache#13687)

5cdff36

haohuanw pushed a commit to haohuanw/incubator-mxnet that referenced this pull request Jun 23, 2019

split_v2 (apache#13687)

6a4a095

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

split_v2 operator #13687

split_v2 operator #13687

HyperZealot commented Dec 19, 2018

Roshrini commented Dec 19, 2018

rongzha1 commented Dec 20, 2018

zheng-da Dec 20, 2018

HyperZealot Dec 20, 2018

zheng-da Dec 20, 2018

HyperZealot Dec 20, 2018

zheng-da Dec 21, 2018

apeforest left a comment

HyperZealot commented Dec 20, 2018

Roshrini commented Jan 2, 2019

zheng-da Dec 21, 2018

zheng-da Jan 3, 2019

HyperZealot Jan 8, 2019

split_v2 operator #13687

split_v2 operator #13687

Conversation

HyperZealot commented Dec 19, 2018

Description

Checklist

Essentials

Changes

Comments

Roshrini commented Dec 19, 2018

rongzha1 commented Dec 20, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apeforest left a comment

Choose a reason for hiding this comment

HyperZealot commented Dec 20, 2018

Roshrini commented Jan 2, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment