Feature request - einsum #10840

jaanli · 2018-05-07T20:40:55Z

Useful for writing models: https://rockt.github.io/2018/04/30/einsum

roywei · 2018-05-09T20:37:38Z

@sandeep-krishnamurthy could you help to add label Feature Request? Thanks

jasonyu1996 · 2018-09-03T04:15:35Z

I am interested in implementing this. Has anyone already started working on this?

sandeep-krishnamurthy · 2018-09-03T04:35:07Z

@jasonyu1996 - Thanks for looking into this issue, as far as I know, nobody is working on this feature. Contributions are welcome. Let us know, if you need any help.

jasonyu1996 · 2018-09-03T13:44:56Z

@sandeep-krishnamurthy It seems that it is somewhat complicated to implement einsum in the backend. But I would like to have a try first. Thanks!

jasonyu1996 · 2018-09-04T04:17:28Z

@sandeep-krishnamurthy I am not sure whether this should be implemented in the backend. It needs to be based on several other operators, as it would be complicated and inefficient to directly forward and backward as a whole (especially backward). The HybridBlock in my opinion would be a good place to hold the implementation, but there are two problems：

Firstly, implementing a HybridBlock would make the code language-dependent, which means supporting these many languages would be error-prone and labour-intensive.
Secondly, this might violate the definition of a Block a little bit, because the doc tells us Block is the 'base class for neural network layers and models', whose subclasses are mostly in the gluon.nn package. einsum of course may or may not fit into this, as I believe it is somewhat ambiguous what 'neural network layer' actually means, and there is hardly any difference between a 'neural network layer' and an operator, especially a complex one (whether parameterized or not does not help, because many layers in gluon.nn have no parameters, and some layers are even simple capsulation of the corresponding operators).

Could you point to me where I should work if I need to use other operators? Or, alternatively, is there a way of storing temporary data in forward for use in backward? Thanks!

sandeep-krishnamurthy · 2018-09-04T19:07:16Z

Operators are stateless, but, I remember there is a optimization switch, that enables saving data from forward pass to be used in backward pass to make the computation faster.
@azai91 - Can you please help here?

jasonyu1996 · 2018-09-05T02:55:27Z

Would it be good then to implement it as a HybridBlock?

jasonyu1996 · 2018-09-06T10:27:34Z

I have almost finished the implementation, save some polishing and testing. The current solution is not in the backend, but alongside Block. To allow one piece of code to support both ndarray and symbol at the same time, it uses the same idea as HybridBlock. The difference is that for HybridBlock there might be parameters inside and it is necessary to instantiate and possibly hybridize it before use, whereas an operator has its interfaces exposed in mxnet.ndarray and mxnet.symbol, and should look the same as other operators 'imported' from the backend.

I think this is probably a decision on the design, as it means a new interface for developing new operators which are relatively complex and high-level and should be implemented based on existing ones.

yifeim · 2018-11-29T07:00:42Z

May I follow up what is the current state of einsum? It is convenient at times and most useful, perhaps, for its autograd computation.

jasonyu1996 · 2018-11-30T08:58:35Z

@yifeim I have actually implemented one based on the high-level interfaces. However, it supports only the Gluon interface. I guess to add support for Symbol I have to move onto a lower abstraction layer, or rely on some missing high-level interfaces (#12484 for example).

@altosaar @roywei @sandeep-krishnamurthy I would be grateful if you can help.

yifeim · 2018-11-30T17:57:37Z

@jasonyu1996 Could you actually point me to the Gluon interface. A hacky way to get a symbol is to export gluon and load back as a symbol.

yifeim · 2018-11-30T18:02:13Z

See #13244 for an example about how to export gluon models.

sxjscience · 2020-04-13T17:06:53Z

We now have einsum in the numpy interface of MXNet. Thus, close this issue. You may try

import mxnet as mx
import numpy as np
mx.npx.set_np()

lhs = mx.np.array(np.random.normal(0, 1, (64, 8, 128, 512)), dtype=np.float32, ctx=mx.gpu())
rhs = mx.np.array(np.random.normal(0, 1, (64, 8, 128, 512)), dtype=np.float32, ctx=mx.gpu())
mx.npx.waitall()

gt = mx.np.einsum('abiz,abjz->abij', lhs, rhs)
gt_np = gt.asnumpy()

sandeep-krishnamurthy added the Feature request label May 27, 2018

This was referenced Sep 12, 2018

Feature: tensorize the interpolation codes (more general) scikit-hep/pyhf#251

Merged

Disable MXNet backend until einsum support is added scikit-hep/pyhf#256

Closed

Neutron3529 mentioned this issue Jul 25, 2019

[Discussion] MXNet 2.0 Roadmap (was: APIs that might be a good idea to break in 2.0) #9686

Closed

sxjscience closed this as completed Apr 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request - einsum #10840

Feature request - einsum #10840

jaanli commented May 7, 2018

roywei commented May 9, 2018

jasonyu1996 commented Sep 3, 2018

sandeep-krishnamurthy commented Sep 3, 2018

jasonyu1996 commented Sep 3, 2018

jasonyu1996 commented Sep 4, 2018 •

edited

Loading

sandeep-krishnamurthy commented Sep 4, 2018

jasonyu1996 commented Sep 5, 2018

jasonyu1996 commented Sep 6, 2018 •

edited

Loading

yifeim commented Nov 29, 2018

jasonyu1996 commented Nov 30, 2018

yifeim commented Nov 30, 2018

yifeim commented Nov 30, 2018

sxjscience commented Apr 13, 2020 •

edited

Loading

Feature request - einsum #10840

Feature request - einsum #10840

Comments

jaanli commented May 7, 2018

roywei commented May 9, 2018

jasonyu1996 commented Sep 3, 2018

sandeep-krishnamurthy commented Sep 3, 2018

jasonyu1996 commented Sep 3, 2018

jasonyu1996 commented Sep 4, 2018 • edited Loading

sandeep-krishnamurthy commented Sep 4, 2018

jasonyu1996 commented Sep 5, 2018

jasonyu1996 commented Sep 6, 2018 • edited Loading

yifeim commented Nov 29, 2018

jasonyu1996 commented Nov 30, 2018

yifeim commented Nov 30, 2018

yifeim commented Nov 30, 2018

sxjscience commented Apr 13, 2020 • edited Loading

jasonyu1996 commented Sep 4, 2018 •

edited

Loading

jasonyu1996 commented Sep 6, 2018 •

edited

Loading

sxjscience commented Apr 13, 2020 •

edited

Loading