Probability Distributions Support #12932

thomelane · 2018-10-23T21:28:38Z

Probability Distributions Support

Would be great to have out-of-the-box support for distributions, similar to functionality provided by TensorFlow Probability and PyTorch Distributions. My current use case is for Reinforcement Learning algorithms that learn stochastic action policies (i.e. learn parameters of a distribution from which actions are sampled), and I update these parameters using the likelihood.

MXNet would ideally have methods on each type of distribution for calculating:

probability density,
log probability of a data sample given the distribution,
entropy of distribution,
kl divergence of distribution with another distribution,
sampling from distribution (mostly already implemented).

And would support a variety of distributions including:

Categorical
MultivariateNormal
Bernoulli
Beta
Dirichlet
Exponential
Gamma
Poisson
Uniform

MXFusion is a related project but doesn't have the functionality mentioned above. And it would be ideal to have this as a submodule of the MXNet package.

thomelane · 2018-10-23T21:30:09Z

@mxnet-label-bot [Feature Request, Gluon, Operators]

anirudhacharya · 2018-10-23T23:04:35Z

Some of them are here - https://mxnet.incubator.apache.org/api/python/ndarray/random.html?highlight=mxnet.ndarray.random#random-distribution-generator

KL Loss - https://mxnet.incubator.apache.org/api/python/gluon/loss.html?highlight=kl#mxnet.gluon.loss.KLDivLoss

But yes we do need a probability submodule within mxnet

thomelane · 2018-10-23T23:28:45Z

Cheers @anirudhacharya, but the functionality I'm referring to is not covered by those references.

I'm talking about functionality beyond just sampling from distributions: calculations of probability density, log probability of a data sample given the distribution, entropy of distribution, etc. And as far as I'm aware the KL Loss in MXNet only works with samples, rather than the theoretical KL Divergence between distributions, so this is also insufficient for certain use cases.

You can't back propagate gradients through samples, so that's why it's important to have such formulas (e.g. log probability) implemented.

I can see a single case of probability being returned by mxnet.ndarray.random.multinomial but this is only for the sampled data point, and not calculated for an arbitrary data point which is required.

eric-haibin-lin · 2018-10-25T17:46:58Z

+1 on this feature

asmushetzel · 2018-11-01T17:18:22Z

We have also the need for this as part of our project. I have a local version for computing PDF and LOG_PDF including forward/backward pass (aka gradients for all parameters and samples) for the following distributions: uniform/normal/exponential/gamma/poisson/neg-binomial/Dirichlet. All coded as C++ operators and working on CPU and GPU. But it would take some more effort to make it clean enough to commit them. Have to see when I find the time. Naturally we could extend this for supporting CDF etc.
Regarding more complex distributions like Multivariate-Gaussians, all necessary basic functionality exists already as part of the linalg-namespace in MXNet. I have plugged it together in python with a couple of lines. In addition build quite a bit around this that allows building more complex stuff (Gaussian Mixtures etc).
We are using all that stuff in a specific project and will likely not have the amount of time in near future to polish this all on our own up to the point where we can back contribute. But if other interesting parties are willing to join the effort, we can collaborate.

anirudhacharya · 2018-12-10T00:04:02Z

FYI - https://github.com/amzn/MXFusion

eric-haibin-lin · 2018-12-10T02:14:31Z

@asmushetzel these are awesome stuff. Really look forward to see the contribution back in the future. Do you have some distribution like https://www.tensorflow.org/api_docs/python/tf/initializers/truncated_normal ?

asmushetzel · 2018-12-10T08:55:49Z

We are talking with MXFusion people. They don't have the PDFs mentioned in here coded and are not planning to do so.
Concerning truncated_normal above, I think this request is primarily about a sampler (though we should provide PDF/PMF for all samplers that we support in MXNet anyway). Building such a sampler should be small work when put into the framework of the already existing ones (normal, uniform, gamma, etc). I will see whether I can get some resources.

asmushetzel · 2019-04-01T14:25:27Z

PR #14579 will bring in log-pdf/pdf of almost all distributions mentioned above.

asmushetzel · 2019-04-05T09:29:07Z

For technical reasons, the PR #14579 has been moved to a new PR #14617

yulinliu101 · 2021-07-16T21:36:03Z

it seems that the solution from PR #14617 is not sufficient to implement fully functional Gaussian policies in continuous action RL tasks.

However, in MxNet 2.0 Alpha we have probabilistic support similar to Tensorflow and PyTorch distribution modules. It would be great if we can merge these implementations from 2.0 to 1.x since there is still a large group of users consuming MxNet 1.x.

marcoabreu added Feature request Gluon labels Oct 23, 2018

asmushetzel mentioned this issue Apr 1, 2019

PDF operators for each distribution #14579

Closed

4 tasks

asmushetzel mentioned this issue May 1, 2019

PDF operators for the random samplers, and also the Dirichlet #14617

Merged

4 tasks

sxjscience mentioned this issue Sep 30, 2019

[RFC] [WIP] Making sampling methods differentiable. #16196

Open

matthewfeickert mentioned this issue Oct 2, 2019

Drop MXNet Support scikit-hep/pyhf#593

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Probability Distributions Support #12932

Probability Distributions Support #12932

thomelane commented Oct 23, 2018

thomelane commented Oct 23, 2018

anirudhacharya commented Oct 23, 2018

thomelane commented Oct 23, 2018

eric-haibin-lin commented Oct 25, 2018

asmushetzel commented Nov 1, 2018 •

edited

Loading

anirudhacharya commented Dec 10, 2018

eric-haibin-lin commented Dec 10, 2018

asmushetzel commented Dec 10, 2018

asmushetzel commented Apr 1, 2019

asmushetzel commented Apr 5, 2019

yulinliu101 commented Jul 16, 2021

Probability Distributions Support #12932

Probability Distributions Support #12932

Comments

thomelane commented Oct 23, 2018