Integration with ArviZ #35

rlouf · 2020-09-28T15:22:27Z

ArviZ is the best package for the exploratory analysis of models, it makes sense to provide a seamless integration with the library. I chose to implement the Trace object as a subclass of Arviz's InferenceData so MCX traces can be directly used in ArviZ.
There is a loss of information in that translation since the sampler returns pretty much all there is to know about the process, but we can get that information (say for debugging) using iterative sampling.

OriolAbril · 2020-09-30T17:23:14Z

I'll try to keep an eye and help wherever possible, I should have time to review and answer questions about InferenceData.

rlouf · 2020-09-30T17:50:08Z

Thanks! I'm doing a lot of code reading and trial/error at the moment, I'll ping you when I have something tangible!

mcx/trace.py

rlouf · 2020-10-04T05:42:30Z

@OriolAbril Several of the metrics in sample_stats are noticeably slowing down inference for large sample sizes. Some I can probably improve (I have to transpose a couple of arrays to fit the format), but for others like log_probability I need to perform more calculations.

It can be problematic when users are first iterating in a model and only need the plot_trace. I can also think of production applications where you would only need the trace and performance is critical. Why slow the program down for metrics that are not necessary?

So I was wondering if you thought overriding some of the class' attributes with a @property would be reasonable in this case?

I'll first finish this draft precomputing everything and would implement it when I think I have a good design.

OriolAbril · 2020-10-04T15:13:08Z

Not sure I understand, you mean so that lp and other sample stats are available but only computed whenever needed?

I think we have not considered it (could be wrong though).

What we currently have for the log_likelihood (it is useful for loo/waic but otherwise requires extra memory and extra computing) group is an argument to from_pymc3, from_pyro... so that users can choose whether or not the group is to be included in the resulting InferenceData. The same could be done for sample_stats and its variables.

rlouf · 2020-10-04T17:58:23Z

Not sure I understand, you mean so that `lp` and other sample stats are available but only computed whenever needed?

Yes, I could do that by defining a `sample_stats` method in my `Trace` class that overrides the corresponding attribute in `InferenceData`. Do I make sense?

I think we have not considered it (could be wrong though).

I don't necessarily think you would need to do that on your end.

What we currently have for the `log_likelihood` (it is useful for loo/waic but otherwise requires extra memory and extra computing) group is an argument to `from_pymc3`, `from_pyro`... so that users can choose whether or not the group is to be included in the resulting InferenceData. The same could be done for sample_stats and its variables.

I did not see that, I will check it out!

rlouf · 2020-10-05T14:26:45Z

Nevermind, I ran some simple benchmarks and it looks like the running time is only marginally affected by all the conversions. I believe I also found a way to get the log_likelihood for free during sampling by making a small change to my compiled logpdf.

Edit: It is free in terms of computation, but it does add complexity in the inference core that I am not willing to add if I don't have to. However, it is possible and might make sense to get it when I compute the value of deterministic variables. I guess that's one issue with having too much freedom 🤷‍♂️
I'm happy to be working on this know, it helps clarifying both the API and parts of the internals.

rlouf · 2020-10-05T17:14:11Z

I can confirm that all plots / stats that take an InferenceData instance as an input and do not require log_likelihood work fine.

We add the warmup samples, warmup sampling info and warmup informations (such as inverse mass matrix and step size) to the trace.

rlouf · 2020-10-12T11:38:59Z

Just implemented concatenation. I will now add append and leave the prior/posterior predictive data for when I clean their api in mcx.

rlouf · 2020-10-12T19:22:42Z

@OriolAbril if you want to have a look, the magic happens in trace.py. Everything that should work with ArviZ does, so does the (inplace) addition. I just need to add the append method, improve the documentation and refactor the code a little before merging. I'll handle prior and predictive samples in a later PR.

rlouf · 2020-10-14T12:57:42Z

Found a performance issue linked to the fact that lax.scan and the for loop do not output the chain in the same format. Reported in #41. I found a fix, but will implement in another PR.

rlouf mentioned this pull request Sep 28, 2020

ArviZ InferenceData transformer #32

Closed

rlouf force-pushed the master branch 14 times, most recently from 8d48e5a to 9b0c1e7 Compare September 29, 2020 13:20

rlouf force-pushed the trace-arviz branch from 14e9804 to 14885ba Compare September 30, 2020 07:04

rlouf force-pushed the trace-arviz branch 5 times, most recently from d319196 to a320664 Compare October 3, 2020 12:50

OriolAbril reviewed Oct 3, 2020

View reviewed changes

mcx/trace.py Outdated Show resolved Hide resolved

rlouf force-pushed the trace-arviz branch from d448f30 to 497e138 Compare October 5, 2020 08:15

rlouf added 2 commits October 6, 2020 16:22

done with data except loglikelihood

c60e258

compute contribution to likelihood of each variable

a102afb

rlouf force-pushed the trace-arviz branch from 20c654c to 9863248 Compare October 6, 2020 15:36

rlouf added 4 commits October 6, 2020 17:37

return chain and warmup state in update function [facepalm]

9863248

add loglikelihood to trace

a233701

return sampler info during warmup

52bfab9

add warmup informations to trace

36a577f

We add the warmup samples, warmup sampling info and warmup informations (such as inverse mass matrix and step size) to the trace.

rlouf added 4 commits October 12, 2020 15:17

support inplace addition for trace

1ab0dd4

support trace addition

9fbcd74

fix formatting with black

c4cf6f0

implement compiler to get loglikelihood contributions

98a69c3

rlouf force-pushed the trace-arviz branch from 3b97d5c to 98a69c3 Compare October 12, 2020 15:16

rlouf added 2 commits October 12, 2020 19:10

fix type issues

267d4c1

update tests

9e76773

rlouf force-pushed the trace-arviz branch 2 times, most recently from e56c92c to a5b5b76 Compare October 13, 2020 09:29

append samples to the chain

a5b5b76

rlouf force-pushed the trace-arviz branch 2 times, most recently from 8973cb4 to 6eee6fc Compare October 13, 2020 15:44

rlouf added 2 commits October 13, 2020 17:44

use isort to format imports

6eee6fc

fix performance issue on sampling with progress bar

f373647

rlouf added 4 commits October 14, 2020 15:10

clean the sampler

adf5276

update hmc evaluator

15b0403

various fixes

2244f05

wrong stacking

5f3a658

rlouf merged commit e49698f into master Oct 14, 2020

rlouf deleted the trace-arviz branch October 14, 2020 14:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration with ArviZ #35

Integration with ArviZ #35

Uh oh!

rlouf commented Sep 28, 2020 •

edited

Loading

Uh oh!

OriolAbril commented Sep 30, 2020

Uh oh!

rlouf commented Sep 30, 2020

Uh oh!

Uh oh!

rlouf commented Oct 4, 2020 •

edited

Loading

Uh oh!

OriolAbril commented Oct 4, 2020

Uh oh!

rlouf commented Oct 4, 2020 via email •

edited

Loading

Uh oh!

rlouf commented Oct 5, 2020 •

edited

Loading

Uh oh!

rlouf commented Oct 5, 2020

Uh oh!

rlouf commented Oct 12, 2020 •

edited

Loading

Uh oh!

rlouf commented Oct 12, 2020 •

edited

Loading

Uh oh!

rlouf commented Oct 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Integration with ArviZ #35

Integration with ArviZ #35

Uh oh!

Conversation

rlouf commented Sep 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OriolAbril commented Sep 30, 2020

Uh oh!

rlouf commented Sep 30, 2020

Uh oh!

Uh oh!

rlouf commented Oct 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OriolAbril commented Oct 4, 2020

Uh oh!

rlouf commented Oct 4, 2020 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rlouf commented Oct 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rlouf commented Oct 5, 2020

Uh oh!

rlouf commented Oct 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rlouf commented Oct 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rlouf commented Oct 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rlouf commented Sep 28, 2020 •

edited

Loading

rlouf commented Oct 4, 2020 •

edited

Loading

rlouf commented Oct 4, 2020 via email •

edited

Loading

rlouf commented Oct 5, 2020 •

edited

Loading

rlouf commented Oct 12, 2020 •

edited

Loading

rlouf commented Oct 12, 2020 •

edited

Loading