[RFC] Apache MXNet 2.0 Roadmap #16167

szha · 2019-09-13T21:46:25Z

Overview

Status: https://github.com/apache/incubator-mxnet/projects/18
The status for each project will be updated by the contributor who's driving it. If you have more projects that you intend to drive please first discuss here.

The purpose of this RFC is to organize and present the roadmap towards 2.0. As 2.0 will be a major release, changes that would break backward compatibility are permissible.

The proposed changes in this RFC are either collected from past roadmap discussions such as #9686, or are based on various common issues from the past. This RFC organizes these changes into self-contained projects to facilitate clear definition of project, captures the risks and status quo to the best of our knowledge. To help navigate, the projects are further divided into several high-level areas. Some of the listed projects are already in progress, and are included to provide a clear overview.

The objectives of Apache MXNet 2.0 include:

Improve expressiveness and usability of user-facing API.
Improve expressiveness and usability of the technical stack for lower development cost and maintainability.

In terms of frontend, this roadmap focuses mostly on Python-frontend since MXNet has been taking a Python-first approach. The expectation with respect to other language bindings is that they would evolve along with the backend evolution and make use of the improvements. Given that breaking changes can occur, maintainers of different language bindings are expected to participate in related interface definition discussions.

1. MXNet NP Module

NumPy has long been established as the standard math library in Python, the most prevalent language for the deep learning community. With this library as the cornerstone, there are now the largest ecosystem and community for scientific computing. The popularity of NumPy comes from its flexibility and generality.

In #14253, the MXNet community reached consensus on moving towards a NumPy-compatible programing experience and committed to a major endeavor on providing NumPy compatible operators.

The primary goal of the projects below is to provide the equivalent usability and expressiveness of NumPy in MXNet to facilitate Deep Learning model development, which not only helps existing deep learning practitioners but also provides people in the existing NumPy community with a shortcut for getting started in Deep Learning. The efforts towards this goal would also help a secondary goal, which is to enable the existing NumPy ecosystem to utilize GPUs and accelerators to speed up large scale computation.

1.1. NumPy Operator Testing

Scope:

adopt array_function and numpy existing tests.
extend testing to GPU
investigate numpy testing strategies
decide correctness criteria for acceptance

1.2. NumPy Operator performance profiling

Scope:

Automatically profile the performance of NumPy operators

1.3. NumPy operator coverage

Scope:

improve operator until full NumPy coverage, with prioritization towards operators used in the ecosystem and deep learning in general

Operator coverage as of 07/03/2019

|    module |     NumPy | deepNumPy |       jax |      cupy |
|-----------|-----------|-----------|-----------|-----------|
|        np |       603 |        89 |       445 |       321 |
|   ndarray |        71 |        32 |        71 |        56 |
|    random |        63 |         5 |        15 |        49 |
|    linalg |        31 |         2 |         8 |        15 |

1.4. NumPy Extension Operator Reorganization and Renaming

Scope:

consistent type usage for index input and return values from sort, topk Use dtype=int for the indices returned by TopK #11031 [MXNET-507] Set dtype=int32 for ret_indices in ordering ops #11134, topk regression #12197
array creation operators with flexible dtype definition [MXNET-798] Fix the dtype cast from non float32 in Gradient computation #12290. (dtype=None)
moving_mean/moving_var in batchnorm
consistent usage of axis vs dim
promote or deprecate contrib operators

1.5. NumPy ndarray type extension

Scope:

bfloat16 support (not in NumPy yet but useful for deep learning) (low priority — Intel)
boolean type support
complex (for FFT)

1.6. NumPy ndarray boolean indexing

Scope:

allow boolean masks in NumPy ndarray indexing by adding the operator, potentially through extending op.where

1.7. Hybridizable basic (and advanced) indexing

Scope:

Allow operations such as y = x[1:3, 2, ...] to be hybridizable

Note: Preliminary work: #15663

2. Graph Enhancement and 3rdparty support

The objective of the following projects is to enable easier development of third-party extensions without requiring changes to be checked in the MXNet project. Examples of such extensions include third-party operator library and accelerators.

2.1. Graph Partitioning for Dynamic Shape Operators

Scope:

partition inside control flow operators (and all cached ops)
partition on operators with dynamic shapes for partial memory planning and caching.

2.2. Improved Third-party Operator Support

Scope:

allow registering custom operators by exposing C API (and frontend API) to register NNVM op at runtime.
verify serialization, deserialization, and graph passes for graphs with these operators are working properly.

2.3. Improved Third-party Backend Support (subgraph property)

Scope:

expose a graph pass for standard graph partitioning with back-end-specific criteria as a C API and frontend API.

2.4. Large tensor support by default

Scope:

enable default support for tensor with int64 dimension sizes
make sure there’s no significant performance regression in operators

Risks:

performance regression may happen in a subset of operators, which can disproportionally affect certain models.
compatibility and silent behavior change.

Notes: in progress (RFC: https://lists.apache.org/thread.html/df53b8c26e9e0433378dd803baba9fec4dd922728a5ce9135dc164b3@%3Cdev.mxnet.apache.org%3E)

3. API Changes

The objective of the following projects is to address the technical debts accumulated during the development of MXNet 0.x and 1.x with respect to the API definition.

3.1. C-API Clean-up

C-API is the foundational API in MXNet that all language bindings depend on.

Scope:

use packed function for flexibility (and potentially efficiency through avoiding string parsing)
do not expose backend accelerator-specific types such as mkldnn::memory in C-API
do not rely on topological ordering for argument passing (Reliance on topological ordering for graph inputs #15362).
verification of thread-safety and performance for C API

Risks:

backend integration may require refactoring or even redesign
existing use cases such as other frontend may be broken without substitute
feedback is scattered and we may miss the opportunity to change some APIs in 2.0

3.2. Unify Executor

Scope:

SymbolBlock equivalent in C/C++, unify the executor implementation for symbol/module and the one for gluon blocks
migrate other versions of inference API
Support mirror option in the unified executor

3.3. Gradient of Gradient support

Scope:

higher order gradient support for a subset of operators

Risks:

large number of backward operators could introduce significant technical debt if not properly verified.
ill-informed prioritization may result in usability issue (e.g. common GAN not supported)

3.4. Autograd Extension

Scope:

improve interface to support specifying intermediate output grad nodes
improve interface for better usability. (retain_graph → something not involving graph)
update graph pass for correctness

3.5. NNVM-backend Operator Interface Changes

Scope:

support more than one temporary spaces
split forward shape/type inference and reverse shape/type inference for better error messaging.
deferred initialization removal (or improve error/info message)
accompanying operator implementation changes

Risks:

some changes may make operator implementation less error-prone while less flexible, and thus require some reworking.

4. Gluon 2.0

Since the introduction of the Gluon API, it has superceded other API for model development such as symbolic API and model API. Conceptually, Gluon is the first attempt in the deep learning community to unify the flexibility of imperative programming with the performance benefits of symbolic programming, through trace-based just-in-time compilation.

The objectives of the following projects are:

address usability issue as a result of the divergence in the behavior of NDArray and Symbol.
extend the JIT to improve the coverage of hybridization.
introduce new functionality to facilitate more areas of research such as Baysian methods and AutoML.
improve the usability and performance of the utility in Gluon.

4.1. Unifying symbolic and imperative mode for tensor library

Scope:

unify the operator implementation and behaviors of symbolic and imperative execution modes (How to debug hybridize() failures? #10875)
allow naming for ndarray similar to symbol
address the necessary changes in shape/type inference.

4.2. Unifying Block and HybridBlock

Scope:

move hybridization logic to a JIT decorator
extend parameter management to Block
user-friendly warning for native control flow in JIT code.

4.3. Gluon Block Enhancement

Scope:

inspection of graph internals similar to monitor for Module (PR 15839)
support additional types in argument such as dict, kwargs, None
fused parameters and gradients respectively
register custom parameter

4.4. Enable Symbolic Shape (& Dtype) for Array Creation in NNVM-backend

Scope:

allow flexible creation of array based on shapes of other arrays that are only known at runtime
add constant symbol type as the return value of symbol.shape (?)
support constant symbol as operator arguments (?)
constant folding for constant symbols

4.5. Gluon Distributions Module

Scope:

sampling and pdf definition for distributions. Distribution https://github.com/amzn/MXFusion. PDF operators for the random samplers, and also the Dirichlet #14617.
wrap operators into more usable classes.
reproducible global seed

4.6. Gluon Metrics Module

Scope:

address usability and performance issues in mxnet.metric using hybridizable NumPy op

4.7. Gluon Optimizer Module

Scope:

API changes such as consistent weight decay (Inconsistent weight decay logics in multiple optimizers #9881), change default value to not apply wd on bias terms (do not regularize beta and bias #11953)
hybridizable optimizers
new optimizers (Optimizer wish list #9182)

4.8. Gluon Data API Extension and Fixes

Scope:

address diverging interfaces and remove transform= constructor arg (Transforms are not compatible with DownloadedDatasets #11141).
reorganize io/image modules and provide data loader instead.
lowering dataloader to backend for efficiency (Low CPU usage of MXNet in subprocesses #13593)
shared memory propagation?

4.9. Gluon Estimator Extension for Experimenting Utilities

Scope:

logging of configuration (DeepNLU), state, and performance for checkpointing for easier resume
pre-defined estimators for common problems

4.10. Gluon Estimator Refactoring for Examples and Tutorials

Scope:

modularize and refactor unstructured scripts and examples into estimator class utilities

4.11. Gluon Distributed Training Usability Enhancement

Scope:

more flexibility for communication with kvstore UDFs
add distribution strategies to estimator
plugin for communication backends (horovod, byteps, parameter server) for data parallel training
data sharding/sampling/streaming enhancement for distributed training

5. Documentation

Documentation is the most important factor for new adoption of a library. The following projects aim to:

address the usability and discoverability issues in the current MXNet website
improve the quality of documentation to make it correct, clear, and concise.
help adoption of the changes in MXNet 2.0 from existing users.

5.1. MXNet 2.0 Migration Guide

Scope:

document high-level mapping from old functionality to new API for data pipeline, modeling, optimization, training loop, metric, inspection and logging, debugging.

Risks:

parallel development of the doc may result in outdated doc.
auto doc verification is needed.

5.2. MXNet 2.0 Developer Guide

Scope:

carefully document the design and contribution guide for features with low entry bar such as operator, gluon block, doc, optimizer, metric, examples and tutorials.
clear and up-to-date system design overview.
clear roadmap

5.3. Adopt beta.mxnet.io as official website

Scope:

infrastructure change for new doc build
merge into master with NumPy.mxnet.io
improve load time and browsing experience
CDN in popular region such as China, with automated validation and testing.

Note: https://github.com/ThomasDelteil/mxnet.io-v2

6. Profiling and Debugging

Profiling and debugging is a common step in the development of deep learning models, and proper tools can help significantly improve developer's productivity. The objective of these projects is to provide such tools to make it easier to discover issues in correctedness and performance of models.

6.1. Memory Profiler

Scope:

memory profiler logging support in backend
automatic array naming tool based on scope
tree-map visualization tool for inspecting profiler dump

6.2. Enhanced Debugging Tool

Scope:

Enable user-specified error handling
Improve error message
Stacktrace inspection in debug API
Automatic error reporting tool
Runtime API for turning off asynchronous execution

7. Advanced Operators

The objective of these projects are to extend the tensor library and operators for better performance and for advanced use.

7.1. Strided ndarray support

Scope:

support strided array in a subset of operators
support auto-transpose of strided array in graph pass and executor

7.2. Ragged ndarray and operators

Scope:

introduce ragged (variable length) tensor as 1st class tensor. Support zero-copy from RaggedNDArray to NDArray when no dimension is ragged.
Load balancing strategy for operators that take RaggedNDArray as input
cover operators for NLP applications (RNN, transformer)

7.3. Improved Sparse Support

Scope:

sparse format and operator support
scipy coverage
operators for graph neural-networks (e.g. ops in minigun)

Minimum support:

format: csr,
zerocopy to DLPack
integration with minigun kernels

Next-level support:

format: coo and block sparse.

8. Building and Configuration

8.1. CMake improvement and Makefile deprecation

Scope:

reimplement CMakeLists for DMLC dependencies
reimplement CMakeLists for MXNet to support 1) building best performing binary in any platform 2) building portable binary distribution for pip

8.2. MXNet Configurator

Scope:

drop environment variables and centralize them as config.
define functionalities that support runtime-switch (candidates: memory pool, engine, worker thread pools) and expose frontend API
allow saving and loading of mxnet system config

9. Advanced training and deployment

9.1. Automatic Quantization and Quantized Training for NumPy

Scope:

automatic quantization based on heuristic (or learning)
BMXNet

9.2. Mobile and edge-device deployment

Scope:

replace amalgamation with more user-friendly function (TF-lite equivalent).
tutorial and example
metal support

10. Performance

10.1. MXNet Execution Overhead

Scope:

[Discussion] Overhead in MXNet Execution #14883

The text was updated successfully, but these errors were encountered:

pengzhao-intel · 2019-09-15T07:36:00Z

@szha Really great proposal and we may want to add some items in 2.0 too.
Is there a timeline of 2.0?

mxnet-label-bot · 2019-09-16T20:18:26Z

Hey, this is the MXNet Label Bot.
Thank you for submitting the issue! I will try and suggest some labels so that the appropriate MXNet community members can help resolve it.
Here are my recommended label(s): Feature

zachgk · 2019-09-16T22:56:42Z

Is there a plan to create a branch either for the 1.x version and have master reflect 2.0 or to create a branch for the 2.0 version and keep master on 1.x for now?

szha · 2019-09-17T05:36:11Z

@pengzhao-intel a tentative target date is by end of Q1 2020.

@zachgk we will create a branch for 2.0. Initially we will keep master to be 1.x and have 2.0 in a new branch. After 1.6 release we will revisit how to make the 2.0 branch the master.

braindotai · 2019-09-25T17:54:10Z

Just a quick cheer up for a new website of MXNet... its way more awesome and beautiful than I expected.
Though minor bugs are still there, for ex- most of the link in the tutorials are broken and not working.
Anyways great work so far.

stereomatchingkiss · 2019-12-10T06:18:53Z

Any plan to simplify the build of c and c++ api for mxnet2.0?It is hard(or very hard) to build a working version of mxnet with cpp api on different platforms(windows, linux, mac), every new release of the mxnet may or may not break something and we need to spend many hours to figure out how to make it work.

I am happy with python api, but not all of the tasks suitable for python. Almost every deep learning tools are based on c and c++, but almost everyone of them are difficult to or partially work with c and c++.

szha · 2019-12-10T06:20:27Z

@stereomatchingkiss good point. What are you using c/c++ api for?

stereomatchingkiss · 2019-12-10T06:29:10Z

@stereomatchingkiss good point. What are you using c/c++ api for?

Develop stand alone app on desktop and mobile(maybe on another devices like rpi4 or jetson nano in the future)
Wrapper of another language(ex : php)
Run the inference task on aws lambda, we do not want to prune the libs of python manually if we could build a slim library of mxnet/tensorflow/pytorch.

Maybe you could open a post to ask the users what are they expect for c or c++ api, I guess most of them only need to use the api to perform inference task but not training(python do a great job about this), this should help you shrink the size of the libs and made the codes less complicated.

edmBernard · 2019-12-11T10:12:16Z

@stereomatchingkiss That's a bit what amalgamation part was for ? a simplified inference interface. The last time I use amalgamation (some years ago) it was often break by update and not really maintain.

szha · 2019-12-15T22:02:29Z

The status of MXNet 2.0 project is tracked at: https://github.com/apache/incubator-mxnet/projects/18. The status for each project will be updated by the contributor who's driving it. If you have more projects that you intend to drive please first discuss here.

szha · 2019-12-15T22:03:58Z

Once 1.6 release is complete, we will create a branch for MXNet 1.x for future releases and start using master branch for 2.0 development.

sxjscience · 2019-12-26T21:59:54Z

Should we create a new branch for 2.0? I think we are also planing for 1.7.0 #16864

leezu · 2019-12-27T12:40:49Z

In the past we always kept development on the master branch, thus how about branching out 1.7.0 release branch and keeping development on master?

TaoLv · 2019-12-27T13:40:02Z

+1 for using master branch for 2.0 development. I think we need 3 branches at least:

master branch: for 2.0 development
v1.x: for 1.x development and maintenance
v1.7.x: for 1.7.x release

szha · 2019-12-28T03:43:55Z

That's what I had in mind. The v1.7.x branch doesn't have to be created until code freeze for 1.7.0

TaoLv · 2019-12-31T02:28:19Z

3.1. C-API Clean-up
C-API is the foundational API in MXNet that all language bindings depend on.

@szha I'm looking at the item 3.1.2. Could you please explain the scope of C-API? Do you mean those APIs sit in the src/c_api/ folder?

szha · 2019-12-31T03:40:32Z

@TaoLv one promising direction that the community is converging to is the interface based on packed function (motivation as described by @tqchen in #17097 (comment)). What this means to the project is that the existing c API will be updated to follow the packed function interface.

apeforest · 2020-02-19T23:36:35Z

Is there a plan to remove the cudnn_off argument from the neural network operators such as Dropout, Convolution, Pool etc. It creates a few usability issues:
(1) Once a model is exported. It requires users to change this flag in all the layers manually if they want to enable/disable cuDNN.
(2) When the cudnn_off is set to true in some layers, the global env variable MXNET_CUDNN_AUTOTUNE_DEFAULT becomes don't care. It's very confusing to users to see an error message like "Please turn off MXNET_CUDNN_AUTOTUNE_DEFAULT" by indeed it does not do anything.
(3) Why did we expose such implementation detail to users at the first place? In the worst case, we should just provide a global variable to turn on/off cuDNN in all layers instead of at operator level.

kalcohol · 2020-02-25T12:03:05Z

Thanks for this awesome work, it has benefited me a great deal.

Here are some disadvantages(may be) listed blow:

it seems that c and c++ interface both could work, but can not finish single task only by one;
low bit training or inference is not available via c/c++(ver. 1.6.0 fix fp16 training);
static linking lib is (very) far away from easy to use, cmake configuration file(like MxNetConfig.cmake, etc.) generated by cmake will enough for end users to integrate libmxnet.a and other large bunch of static third party libs(it's not easy to maintain gentlemanly demeanor all a day when manually linking these day by day). people could easy to hack loading interface of a dynamic library.
smaller size of lib will more friendly to edge devices.
more c++ training demo, including how to use kvstore(multiple cards and multiple servers), it's really not easy to understand.

Good day everyone.

leezu · 2020-02-25T17:41:50Z

@kalcohol please create a new issue about "static linking lib is (very) far away from easy to use", describing your setup in more detail and potentially suggestions how to improve the user experience.

kalcohol · 2020-02-26T07:29:27Z

@kalcohol please create a new issue about "static linking lib is (very) far away from easy to use", describing your setup in more detail and potentially suggestions how to improve the user experience.

#17692 add this tiny requist.

* refactor optimizer * refactor optimizer * fix svrg test * fix rmsprop param naming * fix signum test * fix pylint and perl test * fix perl test and signsgd test * fix * retrigger ci * reduce ci overheads

timespaceuniverse · 2020-03-28T05:20:22Z

@szha
i checked some docs and projects about distributed training ,
'Horovod' is project from uber team , 'Gloo' is project from facebook team.
The basic idea is to use trick from HPC computing field which is more efficient then traditional param-server:
http://andrew.gibiansky.com/blog/machine-learning/baidu-allreduce/?from=timeline
There is a tool called openmpi on which the 'Horvod' project is based ,but i found openmpi is too difficult to configure and use .
I also check the 'Gloo' which seems to use 'redis' to replace 'openmpi' .
I strongly suggest not to use Horovod directly which is based on openmpi that is too complex and old.

I also find bytedance has a good project solving the same problem not using MPI ,
https://github.com/bytedance/byteps

maybe we cant better integrate bytedance solution in roadmap 2.0 .
or we can have a mxnet internal solution similar to bytedance solution.

eric-haibin-lin · 2020-03-28T17:42:42Z

@lilongyue the integration of bytePS to mxnet is in this PR #17555

timespaceuniverse · 2020-03-29T14:41:01Z

@lilongyue the integration of bytePS to mxnet is in this PR #17555
that's great !

* refactor optimizer * refactor optimizer * fix svrg test * fix rmsprop param naming * fix signum test * fix pylint and perl test * fix perl test and signsgd test * fix * retrigger ci * reduce ci overheads

zheng-da · 2020-04-14T00:23:05Z

A quick comment: DGL contains all sampling implementation and no longer relies on the implementation in MXNet. I think we should deprecate the graph sampling implementation in MXNet.

* refactor optimizer * refactor optimizer * fix svrg test * fix rmsprop param naming * fix signum test * fix pylint and perl test * fix perl test and signsgd test * fix * retrigger ci * reduce ci overheads

Replaced by cmake buildsystem as per #16167

fhieber · 2020-07-22T09:23:29Z

@szha is there a recent estimate on the timeline for MXNet 2.0? Would you recommend to develop downstream toolkits (e.g. Sockeye) against the master branch now or rather wait a little bit longer?
Is there already documentation on how to transition MXNet 1.x projects to 2.x?

szha · 2020-07-22T18:42:31Z

@fhieber we are planning to release the first public beta on this somewhere in August. At the moment we are finalizing some API changes and also validating them in GluonNLP. We will publish a transition doc as part of the public beta.

TristonC · 2020-08-07T18:16:31Z

@szha We need to add moving AMP package from contrib to core? We will file RFC for this task.

Neutron3529 · 2020-08-18T15:00:41Z

@szha I found an inconvenient thing that there is no concat layer for gluon. Is it possible to add a concat layer for gluon?

davisliang · 2020-08-19T00:32:13Z

Making MXNET_SAFE_ACCUMULATION=1 default when running on float16 would be very convenient!

szha · 2020-08-19T00:36:37Z

+1 for turning it on by default. Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: Davis Liang <[email protected]> Sent: Tuesday, August 18, 2020 5:32:29 PM To: apache/incubator-mxnet <[email protected]> Cc: Sheng Zha <[email protected]>; Mention <[email protected]> Subject: Re: [apache/incubator-mxnet] [RFC] Apache MXNet 2.0 Roadmap (#16167) Making `MXNET_SAFE_ACCUMULATION=1` default when running on float16 would be very convenient!

-- You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub: #16167 (comment)

Replaced by cmake buildsystem as per apache#16167

deepakkumar1984 · 2021-04-12T11:16:58Z

I made some good progress with the C# version for v2 changes. I have implemented most of the numpy operators in v2 till date and in phase of updating Gluon interface as per latest python version and to use numpy api's. Can we include/promote this project from the main website to attact more contributors.

https://github.com/deepakkumar1984/MxNet.Sharp

szha · 2021-04-12T14:12:10Z

@deepakkumar1984 awesome work, thanks for contributing to the ecosystem! I think we can definitely highlight it in the ecosystem page as a community project. Feel free to send a pull request to add it there. If you are interested, once it gets close to completion, we could also publish a blog to attract more attention.

How do you envision the codebase to be maintained and hosted going forward?

deepakkumar1984 · 2021-04-12T22:39:12Z

Thanks @szha, I will start working on the PR to highlight in the ecosystem page. I did started on writing some tutorials eg. https://mxnet.tech-quantum.com/docs-2/getting-started/create-a-neural-network/, but prefer in future to maintain these blogs similar to other bindings like https://mxnet.apache.org/versions/2.0/api/csharp. MxNet Sharp is more than just binding of the api's, I have implemented the Gluon package in version 1.5 itself and now in process of upgrading them. Also the gluon.probability will be implemented after completion of the gluon interface.

I am happy if the core project MxNet.Sharp can be merged with the main branch something like: https://github.com/apache/incubator-mxnet/csharp-package

I have other projects which are making small steps like GluonCV, GluonNLP, GluonTS, AutoGluon and SciKit learn (MxNet version). I can seperate them from my branch and keep them with me for now and probably start linking them in future in the ecosystem page when they are completing one by one.

barry-jin · 2021-05-12T22:21:20Z

Cpp-package will be added back in #20131. As this language binding will still rely on symbolic programming, some of the module like APIs removed in #18531 will also be added back. So, we may need to support these module APIs for some languange bindings, especially for cpp-package. @szha @leezu

szha pinned this issue Sep 13, 2019

szha added the Roadmap label Sep 13, 2019

xidulu mentioned this issue Sep 18, 2019

[RFC] [WIP] Making sampling methods differentiable. #16196

Open

sxjscience mentioned this issue Nov 7, 2019

[Numpy] Fix collect_params().zero_grad() in gluon numpy interface #16716

Merged

6 tasks

szha added the RFC Post requesting for comments label Dec 15, 2019

leezu mentioned this issue Feb 1, 2020

Add Scala 2.12 and 2.13 cross-compilation (#16438) #17503

Open

8 tasks

szha mentioned this issue Feb 24, 2020

[RFC] MXNet 2.0 API Deprecation #17676

Open

szha mentioned this issue Feb 27, 2020

[RFC] New Branches for MXNet 1.x, 1.7.x, and 2.x #17701

Closed

This was referenced Mar 20, 2020

MXNet Nightly Builds Moved to S3 apache/tvm#5114

Closed

MXNet Nightly Builds Moved to S3 amzn/xfer#70

Closed

zheng-da mentioned this issue Apr 14, 2020

Raise toolchain requirements for MXNet 2 #17984

Merged

szhengac mentioned this issue Apr 22, 2020

Inconsistent weight decay logics in multiple optimizers #9881

Closed

15 tasks

ciyongch mentioned this issue Apr 30, 2020

[Discussion] 1.7.0 Roadmap #16864

Open

apeforest mentioned this issue May 5, 2020

[MXNET-1450] Improve the backward mirroring implementation #18228

Merged

6 tasks

xidulu mentioned this issue May 27, 2020

Gluon.probability #18403

Merged

leezu mentioned this issue Jul 15, 2020

Remove Makefile #18721

Merged

leezu added a commit that referenced this issue Jul 20, 2020

Remove Makefile build support (#18721)

a7c6606

Replaced by cmake buildsystem as per #16167

leezu mentioned this issue Jul 24, 2020

remove other language bindings section from website api page #18783

Merged

4 tasks

szha unpinned this issue Aug 1, 2020

chinakook pushed a commit to chinakook/mxnet that referenced this issue Nov 23, 2020

Remove Makefile build support (apache#18721)

91471c3

Replaced by cmake buildsystem as per apache#16167

deepakkumar1984 mentioned this issue Apr 13, 2021

Adding MxNet.Sharp package to the ecosystem page #20162

Merged

leezu mentioned this issue May 11, 2021

[2.0] Add cpp-package #20131

Merged

11 tasks

[RFC] Apache MXNet 2.0 Roadmap #16167

[RFC] Apache MXNet 2.0 Roadmap #16167

Comments

szha commented Sep 13, 2019 • edited Loading

Overview

1. MXNet NP Module

1.1. NumPy Operator Testing

1.2. NumPy Operator performance profiling

1.3. NumPy operator coverage

1.4. NumPy Extension Operator Reorganization and Renaming

1.5. NumPy ndarray type extension

1.6. NumPy ndarray boolean indexing

1.7. Hybridizable basic (and advanced) indexing

2. Graph Enhancement and 3rdparty support

2.1. Graph Partitioning for Dynamic Shape Operators

2.2. Improved Third-party Operator Support

2.3. Improved Third-party Backend Support (subgraph property)

2.4. Large tensor support by default

3. API Changes

3.1. C-API Clean-up

3.2. Unify Executor

3.3. Gradient of Gradient support

3.4. Autograd Extension

3.5. NNVM-backend Operator Interface Changes

4. Gluon 2.0

4.1. Unifying symbolic and imperative mode for tensor library

4.2. Unifying Block and HybridBlock

4.3. Gluon Block Enhancement

4.4. Enable Symbolic Shape (& Dtype) for Array Creation in NNVM-backend

4.5. Gluon Distributions Module

4.6. Gluon Metrics Module

4.7. Gluon Optimizer Module

4.8. Gluon Data API Extension and Fixes

4.9. Gluon Estimator Extension for Experimenting Utilities

4.10. Gluon Estimator Refactoring for Examples and Tutorials

4.11. Gluon Distributed Training Usability Enhancement

5. Documentation

5.1. MXNet 2.0 Migration Guide

5.2. MXNet 2.0 Developer Guide

5.3. Adopt beta.mxnet.io as official website

6. Profiling and Debugging

6.1. Memory Profiler

6.2. Enhanced Debugging Tool

7. Advanced Operators

7.1. Strided ndarray support

7.2. Ragged ndarray and operators

7.3. Improved Sparse Support

8. Building and Configuration

8.1. CMake improvement and Makefile deprecation

8.2. MXNet Configurator

9. Advanced training and deployment

9.1. Automatic Quantization and Quantized Training for NumPy

9.2. Mobile and edge-device deployment

10. Performance

10.1. MXNet Execution Overhead

pengzhao-intel commented Sep 15, 2019

mxnet-label-bot commented Sep 16, 2019

zachgk commented Sep 16, 2019

szha commented Sep 17, 2019

braindotai commented Sep 25, 2019

stereomatchingkiss commented Dec 10, 2019 • edited Loading

szha commented Dec 10, 2019

stereomatchingkiss commented Dec 10, 2019 • edited Loading

edmBernard commented Dec 11, 2019 • edited Loading

szha commented Dec 15, 2019

szha commented Dec 15, 2019

sxjscience commented Dec 26, 2019

leezu commented Dec 27, 2019

TaoLv commented Dec 27, 2019

szha commented Dec 28, 2019

TaoLv commented Dec 31, 2019

szha commented Dec 31, 2019

apeforest commented Feb 19, 2020 • edited Loading

kalcohol commented Feb 25, 2020 • edited Loading

leezu commented Feb 25, 2020

kalcohol commented Feb 26, 2020

timespaceuniverse commented Mar 28, 2020 • edited Loading

eric-haibin-lin commented Mar 28, 2020

timespaceuniverse commented Mar 29, 2020

zheng-da commented Apr 14, 2020

szha commented Sep 13, 2019 •

edited

Loading

stereomatchingkiss commented Dec 10, 2019 •

edited

Loading

stereomatchingkiss commented Dec 10, 2019 •

edited

Loading

edmBernard commented Dec 11, 2019 •

edited

Loading

apeforest commented Feb 19, 2020 •

edited

Loading

kalcohol commented Feb 25, 2020 •

edited

Loading

timespaceuniverse commented Mar 28, 2020 •

edited

Loading

deepakkumar1984 commented Apr 12, 2021 •

edited

Loading