Implement random method for LKJCorr by junpenglao · Pull Request #2443 · pymc-devs/pymc

junpenglao · 2017-07-25T16:11:16Z

using the algorithm in LKJ 2009(vine method based on a C-vine)

implement test.

using the algorithm in LKJ 2009(vine method based on a C-vine)

ferrine · 2017-07-25T16:19:19Z

pymc3/distributions/multivariate.py

+        P = np.zeros((n, n)) # partial correlations
+        r_triu = []
+
+        for k in range(n-1):


Is it possible to vectorize it?

I can not see an easy way to do it, the step below converting the partial correlation to raw correlation is especially tricky.

junpenglao · 2017-07-31T06:02:41Z

Any suggestion of implementing a test? I am stuck. @ferrine @aseyboldt

ferrine · 2017-07-31T06:10:39Z

What about comparing kde with theoretical density?

ferrine · 2017-07-31T06:12:15Z

pymc3/distributions/multivariate.py

+        for k in range(n-1):
+            beta -= 1/2
+            for i in range(k+1, n):
+                P[k, i] = stats.beta.rvs(a=eta, b=eta)  # sampling from beta


You'd better create these samples beforehand

twiecki · 2017-07-31T10:42:26Z

Or you could run a KS-test between true and sampled density.

junpenglao · 2017-07-31T11:26:02Z

I see - is there any similar test I can reference to in our test? @ferrine @twiecki

aseyboldt · 2017-07-31T11:27:41Z

This is similar I think: https://github.com/pymc-devs/pymc3/blob/master/pymc3/tests/sampler_fixtures.py#L28

This reverts commit 42e620e.

junpenglao · 2017-08-01T12:18:40Z

Thanks everyone! I think I finally figure it out ;-)

twiecki · 2017-08-01T12:23:42Z

pymc3/distributions/multivariate.py

+        for k, i in zip(triu_ind[0], triu_ind[1]):
+            p = P[k, i]
+            for l in range(k-1, -1, -1):  # convert partial correlation to raw correlation
+                p = p * np.sqrt((1-P[l, i]**2)*(1-P[l, k]**2)) + P[l, i]*P[l, k]


pep8 wants white spaces around math operators

twiecki · 2017-08-01T12:23:54Z

pymc3/distributions/multivariate.py

+            samples = generate_samples(stats.beta.rvs, eta, eta,
+                                       dist_shape=self.shape,
+                                       size=size)
+            samples = (samples-0.5)*2


white-spaces

junpenglao · 2017-08-01T14:12:44Z

pymc3/distributions/multivariate.py

+        for k, i in zip(triu_ind[0], triu_ind[1]):
+            p = P[k, i]
+            for l in range(k-1, -1, -1):  # convert partial correlation to raw correlation
+                p = p * np.sqrt((1 - P[l, i]**2) *


what i am doing here is slow in a for loop, should I change it to reduce @ColCarroll?

For stuff like this we could also consider to use numba if it is available. (check if it is available and create a no-op replacement if it is not).

ferrine · 2017-08-01T16:58:28Z

pymc3/distributions/multivariate.py

+        self.tri_index[np.triu_indices(n, k=1)] = np.arange(shape)
+        self.tri_index[np.triu_indices(n, k=1)[::-1]] = np.arange(shape)
+
+    def _random(self, n, eta, size=None):


should be static I suppose

seems like size argument is ignored

Yep. I ignored the size as the _random method here can only generate 1 slide of the random matrix.

Thanks for the suggestion - I manage to use the size and vectorized the _random method.

junpenglao · 2017-08-02T11:42:09Z

OK the implementation should be correct as the distribution of the matrix elements match beta() distribution. However I am getting none positive-definite matrix when the dimension n>3:

import numpy as np
from scipy import stats

# cherry picked LKJ random function
def lkj_random(n, eta, size=None):
    beta0 = eta - 1 + n/2
    shape = n * (n-1) // 2
    triu_ind = np.triu_indices(n, 1)
    beta = np.array([beta0 - k/2 for k in triu_ind[0]])
    # partial correlations sampled from beta dist.
    P = np.ones((n, n) + (size,))
    P[triu_ind] = stats.beta.rvs(a=beta, b=beta, size=(size,) + (shape,)).T
    # scale partial correlation matrix to [-1, 1]
    P = (P - .5) * 2
    
    for k, i in zip(triu_ind[0], triu_ind[1]):
        p = P[k, i]
        for l in range(k-1, -1, -1):  # convert partial correlation to raw correlation
            p = p * np.sqrt((1 - P[l, i]**2) *
                            (1 - P[l, k]**2)) + P[l, i] * P[l, k]
        P[k, i] = p
        P[i, k] = p

    return np.transpose(P, (2, 0 ,1))

def is_pos_def(A):
    if np.array_equal(A, A.T):
        try:
            np.linalg.cholesky(A)
            return 1
        except np.linalg.linalg.LinAlgError:
            return 0
    else:
        return 0

P = lkj_random(4, 1., 1000)
k=0
for i, p in enumerate(P):
    k+=is_pos_def(p)
print(k)

Thoughts? @aseyboldt

aseyboldt · 2017-08-02T13:26:40Z

@junpenglao I haven't look at it in detail, but should there really be that many 1. in the partial correlations P? Maybe something like P = np.ones((n, n) + (size,)) / 2?

junpenglao · 2017-08-02T13:31:24Z

@aseyboldt at the end only the diagonal is 1. Currently LKJ random returns the upper triangular elements (same as the distribution) but I was just doing some test in the code above.

aseyboldt · 2017-08-02T13:42:33Z

Sorry, you are right.
The partial correlations are just a scaled version of the precision matrix, right? So maybe you could use that to check the result?

junpenglao · 2017-08-02T14:28:49Z

@aseyboldt not sure I understand what you mean.

~~I also compare with a Julia implementation, both have the same output. So it might be a numerical stability issue.~~

[EDIT], trying to figure out whether I have the same output as Julia or Stan.

aseyboldt · 2017-08-02T14:38:16Z

hm. But shouldn't A_{ij}/np.sqrt(A_ii * A_jj) equal the original partial correlations, were A = \sigma^{-1}? It isn't. And sometimes the eigenvalues are definitely negative, much more so than could be explained by some simple rounding.

aseyboldt · 2017-08-02T14:39:25Z

I'm using this to compute the partial correlations:

tau = linalg.inv(P[0])

partial = tau.copy()
partial /= np.sqrt(np.diag(tau))[None, :]
partial /= np.sqrt(np.diag(tau))[:, None]

(And just added a print statement in the function to get the original values)

junpenglao · 2017-08-02T16:02:36Z

I will also compare with the implementation in R from @rmcelreath https://github.com/rmcelreath/rethinking/blob/master/R/distributions.r#L165-L184, I cannot really figure it out in Stan and Julia

aseyboldt · 2017-08-02T17:48:10Z

Sorry, my last comment was wrong, I looked at the wrong array. I just deleted it.

generated Corr matrix is now positive definite.

junpenglao · 2017-08-02T21:10:28Z

Turns out the R implementation from @rmcelreath is the most stable - test pass locally and samples are mostly positive definite (failed sometimes when n is large and eta<<1, but much better than the previous implementation nonetheless).

junpenglao · 2017-08-03T06:28:57Z

@aseyboldt The current random method could potentially be extended for generating LKJCholskyCov as it produces a triangular matrix as an intermedia step.

[WIP] implement random method for LKJCorr

a6eca21

using the algorithm in LKJ 2009(vine method based on a C-vine)

ferrine reviewed Jul 25, 2017

View reviewed changes

pep8 fix

7e9a068

ferrine reviewed Jul 31, 2017

View reviewed changes

import random method

42e620e

Junpeng Lao added 6 commits August 1, 2017 07:40

Revert "import random method"

87a5fe8

This reverts commit 42e620e.

restrict dimension n to n > 1, fix random sample for n = 2

f9aadad

unify parameter naming between LKJCorr and LKJCholeskyCov

df49d6a

improved random method

f68ccb9

Bug fix in shape

c43abc6

Add test

29237a8

junpenglao changed the title ~~[WIP] implement random method for LKJCorr~~ Implement random method for LKJCorr Aug 1, 2017

fix test

b57c90f

twiecki reviewed Aug 1, 2017

View reviewed changes

pep8 clean up, fixed test warning

0f8b574

junpenglao commented Aug 1, 2017

View reviewed changes

ferrine reviewed Aug 1, 2017

View reviewed changes

improved random method

91a5c52

forked R code for random method

cd6829e

generated Corr matrix is now positive definite.

junpenglao merged commit f845575 into pymc-devs:master Aug 3, 2017

junpenglao deleted the lkj_random branch August 3, 2017 13:48

junpenglao mentioned this pull request Jan 27, 2018

Add random kwarg to DensityDist #2106 #2805

Merged

Conversation

junpenglao commented Jul 25, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

junpenglao commented Jul 31, 2017

Uh oh!

ferrine commented Jul 31, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

twiecki commented Jul 31, 2017

Uh oh!

junpenglao commented Jul 31, 2017

Uh oh!

aseyboldt commented Jul 31, 2017

Uh oh!

junpenglao commented Aug 1, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

junpenglao Aug 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

junpenglao commented Aug 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aseyboldt commented Aug 2, 2017

Uh oh!

junpenglao commented Aug 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aseyboldt commented Aug 2, 2017

Uh oh!

junpenglao commented Aug 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aseyboldt commented Aug 2, 2017

Uh oh!

aseyboldt commented Aug 2, 2017

Uh oh!

junpenglao commented Aug 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aseyboldt commented Aug 2, 2017

Uh oh!

junpenglao commented Aug 2, 2017

Uh oh!

junpenglao commented Aug 3, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

junpenglao commented Jul 25, 2017 •

edited

Loading

junpenglao Aug 2, 2017 •

edited

Loading

junpenglao commented Aug 2, 2017 •

edited

Loading

junpenglao commented Aug 2, 2017 •

edited

Loading

junpenglao commented Aug 2, 2017 •

edited

Loading

junpenglao commented Aug 2, 2017 •

edited

Loading