[v1.7.x] Backport some numpy features + fixes #18648

sxjscience · 2020-07-01T05:06:46Z

Description

(Brief description on what this PR is about)

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the best of my knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

mxnet-bot · 2020-07-01T05:06:51Z

Hey @sxjscience , Thanks for submitting the PR
All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following commands:

To trigger all jobs: @mxnet-bot run ci [all]
To trigger specific jobs: @mxnet-bot run ci [job1, job2]

CI supported jobs: [windows-gpu, centos-cpu, edge, windows-cpu, unix-gpu, miscellaneous, unix-cpu, website, clang, sanity, centos-gpu]

Note:
Only following 3 categories can trigger CI :PR Author, MXNet Committer, Jenkins Admin.
All CI tests must pass before the PR can be merged.

* FFI cumsum * Dispatch ufunc * Add PythonArg * Remove unused data type * Seperate op_utils and utils

* ffi_bincount percentile/quantile all/any * new ffi

Co-authored-by: Haozheng Fan <[email protected]>

Introduced in apache#17510

* Support ADT as FFI return value * Special operator= for NDArrayHandle * SVD * Support cython * Clear * Add split * Refine * Fix ci * Fix typo * Clear * Resolve sanity issues Co-authored-by: Haozheng Fan <[email protected]>

* impl - FFI for np_may_share_memory * impl - FFI benchmark Co-authored-by: Ubuntu <[email protected]>

* resolution * fix sanity error * remove func 'is_integer'

* impl - FFI for np_indices * fix - use MXNetTypeWithBool2String Co-authored-by: Ubuntu <[email protected]>

sxjscience · 2020-07-01T07:08:53Z

@ciyongch I traced the commits that are before 743bbcb and these are the initial list of commits.

There are two remaining commits like:

~~[numpy] fix mixed type backward #18250~~ (backported)
New set default dtype #18251

Also, there is one commit that I'm not sure whether it should be backported or not:

[CI] Test CMake builds instead of Makefile builds #17645

* ffi_diag/diagonal/diag_indices_from * sanity && benchmark

* impl - FFI for np dstack * impl - benchmark np_einsum np_dstack * impl - FFI for np_unique * impl - benchmark np_unique Co-authored-by: Ubuntu <[email protected]>

* F * f

* NumPy Laplace Distribution partly Frontend and Backend Signed-off-by: AntiZpvoh <[email protected]> * NumPy Laplace Distribution Backend style rectified Signed-off-by: AntiZpvoh <[email protected]> * NumPy Laplace Distribution Frontend modified Signed-off-by: AntiZpvoh <[email protected]> * Laplece op nightly test and normal op test correction Signed-off-by: AntiZpvoh <[email protected]> * NumPy Laplace Distribution unit test and code style Signed-off-by: AntiZpvoh <[email protected]> * Register uniform_n in CUDA Signed-off-by: AntiZpvoh <[email protected]> * Delete the registering of Laplace_n Signed-off-by: AntiZpvoh <[email protected]> * fix some alignment and indentation problems Signed-off-by: AntiZpvoh <[email protected]> * fix some sanity problems such as too long lines * fix some sanity problems again * laplace parmeters form change * implement basic laplace function * add frontend implement and ndarray loc case * complete the frontend * fix some sanity problems * fix some sanity problems * fix some typos * fix some problems * fix a typo * add size==() condition handling * fix some typos * remove unused code Co-authored-by: Ubuntu <[email protected]>

* fix - cpplint * impl - benchmark ffi for ops * rm - FFI for ops with param * fix - makefile * fix - not include unordered_map and use num_inputs * ci - compiler error * fix - change cholesky interface Co-authored-by: Ubuntu <[email protected]>

* fix - python interface * impl - ffi for matrix_rank * impl - ffi benchmark Co-authored-by: Ubuntu <[email protected]>

* [Numpy]Add kron * Implement the forward of Kron op * Implement the Backward of a * Implement the Backward of b * Fix 3rd party * Fix cpp sanity * Finish grad check * address comments: fix test_np_op and reduce req to req[0] * * Fix ndim = 0 * * Fix uninitialize bugs * * Impl FFI

* interp * fix_uninitialized_issue

* triu * rebase * fix ci * merge * triu new ffi * cpplint * cpplint * ffi benchmark * fix style * merge * fix conflict Co-authored-by: Ubuntu <[email protected]> Co-authored-by: Hao Jin <[email protected]>

…am (apache#17866) * add ffi for sum, var and std * add ffi wrapper for np.average * add ffi wrapper for np.histogram

* ffi_bitwise binary * retrigger ci

* change the header file of np.random.choice * add np_choice_op.cc file * add including header file * implement the basic function of random.choice * try to use take op in backend * try to use take op in backend * add take invoking function * fix some syntax problems * fix some problems * complete numpy.random.choice ffi * first commit of ffi indexing_op.cc * add random.choice ffi benchmark * complete take ffi * change the implementation of random.choice * add take op benchmark * complete clip op ffi and fix a problem * add clip op benchmark * fix some sanity problems * add space before ( and fix reimport * fix a typo * remove dead code and remove new operator Co-authored-by: Ubuntu <[email protected]>

sxjscience · 2020-07-01T08:15:57Z

@ciyongch I find that there are lots of numpy stuffs here. This is the initial attempt for backporting some commits. One issue is #17645, in which I'm not sure whether to port or not.

ciyongch · 2020-07-01T08:24:57Z

Thanks @sxjscience for the effort.
It seems that this backport introduce huge code changes which will be a big concern to the current stable code base especially after code freeze, then I'm more preferred NOT to include this at 1.7 release, what do you think? Copy @szha as well.

sxjscience · 2020-07-01T09:04:48Z

I think we should try to include as many numpy fixes as possible. The numpy feature was introduced in 1.6.0 and we should have a rather stable version in 1.7.0. Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: ciyong <[email protected]> Sent: Wednesday, July 1, 2020 1:25:12 AM To: apache/incubator-mxnet <[email protected]> Cc: Xingjian SHI <[email protected]>; Mention <[email protected]> Subject: Re: [apache/incubator-mxnet] [v1.7.x] Backport some numpy features + fixes (#18648) Thanks @sxjscience<https://github.com/sxjscience> for the effort. It seems that this backport introduce huge code changes which will be a big concern to the current stable code base especially after code freeze, then I'm more preferred NOT to include this at 1.7 release, what do you think? Copy @szha<https://github.com/szha> as well. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#18648 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABHQH3SZPF2V7KKMFANCJE3RZLXGRANCNFSM4ONBA5YQ>.

sxjscience · 2020-07-01T09:09:18Z

In addition, we do have the ability to test whether the numpy APIs are stable or not by checking the tests in GluonNLP: https://github.com/dmlc/gluon-nlp/tree/numpy/tests Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: Xingjian SHI <[email protected]> Sent: Wednesday, July 1, 2020 2:04:42 AM To: apache/incubator-mxnet <[email protected]>; apache/incubator-mxnet <[email protected]> Cc: Mention <[email protected]> Subject: Re: [apache/incubator-mxnet] [v1.7.x] Backport some numpy features + fixes (#18648) I think we should try to include as many numpy fixes as possible. The numpy feature was introduced in 1.6.0 and we should have a rather stable version in 1.7.0. Get Outlook for iOS<https://aka.ms/o0ukef>

________________________________ From: ciyong <[email protected]> Sent: Wednesday, July 1, 2020 1:25:12 AM To: apache/incubator-mxnet <[email protected]> Cc: Xingjian SHI <[email protected]>; Mention <[email protected]> Subject: Re: [apache/incubator-mxnet] [v1.7.x] Backport some numpy features + fixes (#18648) Thanks @sxjscience<https://github.com/sxjscience> for the effort. It seems that this backport introduce huge code changes which will be a big concern to the current stable code base especially after code freeze, then I'm more preferred NOT to include this at 1.7 release, what do you think? Copy @szha<https://github.com/szha> as well. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#18648 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABHQH3SZPF2V7KKMFANCJE3RZLXGRANCNFSM4ONBA5YQ>.

ciyongch · 2020-07-01T11:11:37Z

As numpy operator feature is in a quite active development in current master branch and mainly targeting on 2.0 release. It'll be fine if the backport patch is small and controllable at this time frame, but given the current one is introducing substantial modifications (+16,079 −1,160 code changes in 192 files), which is really a big concern whether this patch can help to make it solid enough while not introducing any other numpy issue. I agree to include existing fixes as much as possible, but this one is kind of complicated and need further confirmation. What do you think @szha @sandeep-krishnamurthy ?
Quote from @sandeep-krishnamurthy

+1 to mark this as experimental as it was not stable in 1.6 or advertised as will be stable in v1.7. We are mainly looking at numpy ops in 2.0.

Thanks.

sandeep-krishnamurthy · 2020-07-01T15:17:04Z

First of all thank you very much @sxjscience for this tremendous effort in backporting PRs.

I am really concerned with addition of new experimental features in the last minute. 1.7 was stable with many required functionality for users and no known breaking new changes compared to v1.6.

I still believe all numpy related changes in these commits should go in a separate release and not block v1.7 as I believe this is not a stable feature set expected by users or advertised 2 months ago before code freeze as v1.7 feature.

sxjscience · 2020-07-01T16:09:40Z

@sandeep-krishnamurthy @ciyongch I created this PR to track how many numpy commits would potentially go into v1.7.0 (not all of them should go into 1.7.0):

We may

Ignore all the FFI commits
Ignore commits that add new operators

However, we should add all the bug fixes

Apart from the following two commits

We will also need

sandeep-krishnamurthy · 2020-07-01T16:49:41Z

@sandeep-krishnamurthy @ciyongch I created this PR to track how many numpy commits would potentially go into v1.7.0 (not all of them should go into 1.7.0):

We may

Ignore all the FFI commits

Ignore commits that add new operators

However, we should add all the bug fixes

Apart from the following two commits

[numpy] unify impl of mixed type binary op between linux and windows #18523

Add zero grad for npi_unique #18080

We will also need

[numpy] fix mixed type backward #18250

[Numpy] Fix np.clip in scalar case #17788

Thank you @sxjscience - Can you please help with back port of these 4 bug fix PRs?

ciyongch · 2020-07-02T02:05:13Z

Thanks a lot @sxjscience for the great help to filter the necessary PRs targeting at 1.7 release and @sandeep-krishnamurthy for the valuable comments :)
When you're deciding to backport these PRs, are there any additional cases to make sure they're working expected at your side?

sandeep-krishnamurthy · 2020-07-02T04:00:49Z

@sxjscience when will it be possible to have a Backport PRs? @ciyongch what is the timeline you had planned for rc0?

ciyongch · 2020-07-02T04:25:58Z

Hi @sandeep-krishnamurthy , actually I was planning to drop rc0 before numpy related issues were raised.
@sxjscience has already submitted partial PRs listed as above in #18653. The remaining PRs are (#18250 and #18523), it would be great if the rest of PRs can be backported within 48h. @sxjscience can you please help to give an estimate for the remaining effort, or we might need to cut off some fixes/PRs if it's uncertain or taking too much time. What do you think? Thanks!

sxjscience · 2020-07-02T05:08:05Z

@ciyongch I'm going to test v1.7 after this backporting PR (#18653) is merged. WOuld you merge that one?

sxjscience · 2020-07-02T05:08:16Z

Close this PR now

ciyongch · 2020-07-02T05:31:31Z

Sure, that one should be fine, how about the rest of two PRs? May I know how much effort and time you'll need to backport them? Thanks!

ciyongch · 2020-07-02T05:43:51Z

@TaoLv helped to merge that PR, please help to check if the current code base works expected or not :)

add zero grad for npi_unique (apache#18080)

a7e7fa4

Yiyan66 and others added 11 commits June 30, 2020 22:18

flatnonzero (apache#17690)

b9c6a4a

[Numpy] FFI for cumsum and add (apache#17747)

8b5cb5e

* FFI cumsum * Dispatch ufunc * Add PythonArg * Remove unused data type * Seperate op_utils and utils

[Numpy] FFI: Bincount, Percentile/Quantile, All/Any (apache#17717)

e8e320e

* ffi_bincount percentile/quantile all/any * new ffi

Add ffi benchmark (apache#17780)

b88092e

Co-authored-by: Haozheng Fan <[email protected]>

ffi wrappers for polyval, ediff1d, nan_to_num (apache#17832)

2a3dbda

Fix compiler warnings in new FFI (apache#17718)

8750350

Introduced in apache#17510

ffi invocation: expand_dims, tril, diff, broadcast_to (apache#17738)

8c90040

[Numpy] FFI for split and svd (apache#17816)

e406293

* Support ADT as FFI return value * Special operator= for NDArrayHandle * SVD * Support cython * Clear * Add split * Refine * Fix ci * Fix typo * Clear * Resolve sanity issues Co-authored-by: Haozheng Fan <[email protected]>

add ffi for full_like, binary (apache#17811)

7ed6bf7

* impl - FFI for np_where_op (apache#17817)

a81d5f4

* impl - FFI for np_may_share_memory * impl - FFI benchmark Co-authored-by: Ubuntu <[email protected]>

ffi for roll/rot90 (apache#17861)

b816d43

sxjscience requested review from anirudh2290, eric-haibin-lin and szha as code owners July 1, 2020 06:02

JiangZhaoh and others added 3 commits June 30, 2020 23:40

[Numpy] allow mix integer dtypes for power/add/multiply (apache#17921)

34b4708

* resolution * fix sanity error * remove func 'is_integer'

fix true_divide (apache#18393)

aedf66b

* FFI for np.argmax and np.argmin (apache#17843)

f862579

* impl - FFI for np_indices * fix - use MXNetTypeWithBool2String Co-authored-by: Ubuntu <[email protected]>

haojin2 and others added 9 commits July 1, 2020 00:11

add alias for np.__version__, np._NoValue and np.dtype (apache#17777)

f00c3b5

[Numpy] FFI for diag/diagonal/diag_indices_from (apache#17789)

27d5008

* ffi_diag/diagonal/diag_indices_from * sanity && benchmark

ffi_atleast_1/2/3d (apache#17897)

905eb27

add: numpy rollaxis (apache#17865)

acb535b

* impl - FFI for np einsum (apache#17869)

ce37bef

* impl - FFI for np dstack * impl - benchmark np_einsum np_dstack * impl - FFI for np_unique * impl - benchmark np_unique Co-authored-by: Ubuntu <[email protected]>

[numpy] add op random.f (apache#17586)

28dbfda

* F * f

[Numpy] FFI: split and svd apache#17816

5a9bf61

sxjscience changed the title ~~[v1.7.x] [Backport]add zero grad for npi_unique (#18080)~~ [v1.7.x] Backport some numpy features + fixes Jul 1, 2020

sjtuWangDing and others added 12 commits July 1, 2020 00:49

* impl - linalg matrix_rank for cpu and gpu implemented (apache#18020)

ff2dbab

* fix - python interface * impl - ffi for matrix_rank * impl - ffi benchmark Co-authored-by: Ubuntu <[email protected]>

[Numpy] OP_interp (apache#17793)

2f69252

* interp * fix_uninitialized_issue

[numpy] add op median apache#17084

6fe903b

[Numpy] Add op fmax, fmin, fmod (apache#17567)

3f195fb

add: numpy op tril_indices (apache#17904)

673d3b3

[NumPy] Add NumPy support for triu (apache#17614)

5ba7a77

* triu * rebase * fix ci * merge * triu new ffi * cpplint * cpplint * ffi benchmark * fix style * merge * fix conflict Co-authored-by: Ubuntu <[email protected]> Co-authored-by: Hao Jin <[email protected]>

fix mixed type backward (apache#18250)

9ec1c4b

[Numpy] Add ffi for np.sum, np.std, np.var, np.average and np.histogr…

05551d0

…am (apache#17866) * add ffi for sum, var and std * add ffi wrapper for np.average * add ffi wrapper for np.histogram

[numpy] FFI binary bitwise ops (apache#17812)

2e57a6b

* ffi_bitwise binary * retrigger ci

fix np.clip scalar input case (apache#17788)

327f7ad

sxjscience mentioned this pull request Jul 1, 2020

Backporting recent mx.np changes to 1.7 branch #18641

Open

sxjscience closed this Jul 2, 2020

yijunc mentioned this pull request Jul 2, 2020

[v1.7.x] backport mixed type binary ops to v1.7.x #18649

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v1.7.x] Backport some numpy features + fixes #18648

[v1.7.x] Backport some numpy features + fixes #18648

sxjscience commented Jul 1, 2020

mxnet-bot commented Jul 1, 2020

sxjscience commented Jul 1, 2020 •

edited

Loading

sxjscience commented Jul 1, 2020

ciyongch commented Jul 1, 2020

sxjscience commented Jul 1, 2020 via email

sxjscience commented Jul 1, 2020 via email

ciyongch commented Jul 1, 2020

sandeep-krishnamurthy commented Jul 1, 2020

sxjscience commented Jul 1, 2020

sandeep-krishnamurthy commented Jul 1, 2020

ciyongch commented Jul 2, 2020

sandeep-krishnamurthy commented Jul 2, 2020

ciyongch commented Jul 2, 2020

sxjscience commented Jul 2, 2020

sxjscience commented Jul 2, 2020

ciyongch commented Jul 2, 2020

ciyongch commented Jul 2, 2020

[v1.7.x] Backport some numpy features + fixes #18648

[v1.7.x] Backport some numpy features + fixes #18648

Conversation

sxjscience commented Jul 1, 2020

Description

Checklist

Essentials

Changes

Comments

mxnet-bot commented Jul 1, 2020

sxjscience commented Jul 1, 2020 • edited Loading

sxjscience commented Jul 1, 2020

ciyongch commented Jul 1, 2020

sxjscience commented Jul 1, 2020 via email

sxjscience commented Jul 1, 2020 via email

ciyongch commented Jul 1, 2020

sandeep-krishnamurthy commented Jul 1, 2020

sxjscience commented Jul 1, 2020

sandeep-krishnamurthy commented Jul 1, 2020

ciyongch commented Jul 2, 2020

sandeep-krishnamurthy commented Jul 2, 2020

ciyongch commented Jul 2, 2020

sxjscience commented Jul 2, 2020

sxjscience commented Jul 2, 2020

ciyongch commented Jul 2, 2020

ciyongch commented Jul 2, 2020

sxjscience commented Jul 1, 2020 •

edited

Loading