Pareto k diagnostics and Pareto smoothing #283

n-kall · 2023-04-26T11:27:59Z

Summary

This PR adds functionality for Pareto smoothing, following on from work by @ozanstats and @avehtari to address #237. It adds two primary user-facing functions: pareto_smooth and pareto_khat. pareto_smooth performs smoothing on the tail(s) of the input and returns the smoothed draws and diagnostic values, whereas pareto_khat calls pareto_smooth but only returns the diagnostic values.

pareto_khat is designed for use in similar ways to the convergence functions rhat and ess_bulk. It is made to work in summarise_draws. On the other hand, pareto_smooth is probably most useful if one wants to do Pareto smoothing but not use the loo package for this (for example to implement moment-matching + psis in the iwmm package).

I believe the only functionality it does not implement that is mentioned in #237 is allowing r_eff to be a vector of different values (one for each variable). If r_eff is specified, it must be just a single value, as I'm not sure how to allow different r_eff values for each variable when using summarise_draws. However, when it is automatically calculated (default), then it works as intended and is calculated separately for each variable.

Current status

The functions should be working as intended, and the documentation is more or less complete (ready for comments). There are no tests yet, so any suggestions for recommended tests would be helpful.

Example functionality

mu <- extract_variable_matrix(example_draws(), "mu")
pareto_khat(mu)
# > 0.270711

ex <- example_draws()
summarise_draws(ex, pareto_khat, .args = list(extra_diags = TRUE))
# # A tibble: 10 × 5
#   variable      k min_ss khat_threshold convergence_rate
#   <chr>     <dbl>  <dbl>          <dbl>            <dbl>
# 1 mu       0.271    23.5          0.616            0.971
# 2 tau      0.0151   10.4          0.616            1.00
# 3 theta[1] 0.123    13.8          0.616            0.994
# 4 theta[2] 0.0681   11.8          0.616            0.998
# 5 theta[3] 0.362    37.0          0.616            0.937
# 6 theta[4] 0.211    18.5          0.616            0.984
# 7 theta[5] 0.145    14.8          0.616            0.992
# 8 theta[6] 0.181    16.6          0.616            0.988
# 9 theta[7] 0.265    22.9          0.616            0.973
#10 theta[8] 0.0888   12.5          0.616            0.997

Copyright and Licensing

By submitting this pull request, the copyright holder is agreeing to
license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)

paul-buerkner · 2023-04-27T11:54:51Z

Please let me know once this PR is ready for review.

n-kall · 2023-05-03T11:33:19Z

I've now added tests for the pareto_khat and pareto_smooth functions. So I think now it is ready for review @paul-buerkner @avehtari.

I suppose tests for gpd functions could be imported from loo or the PR from @ozanstats here

paul-buerkner

This PR looks quite good already. I hope my comments help to further improve it. Thank you for working on this!

paul-buerkner · 2023-05-05T07:00:01Z

R/gpd.R

+
+#' @rdname GPD
+#' @export
+dgpd <- function(x, mu = 0, sigma = 1, k = 0, log = FALSE) {


Would it be sensible to make all the functions of the GPD vectorized across mu, sigma, and k?

Als the names are pretty short. How about (d|p...)generalized_pareto?

After thinking about these functions, I realised that only dgpd is used internally for the calculation of the marginal posterior in gpdpost. extraDistr already provides GPD (using a C++ implementation), so I'm not sure posterior needs to have reimplementations of them. @avehtari can you see a clear use for having the GPD functions (q*, p*, r*) directly available via posterior or could we just have dgpd for internal use and only export gpdpost?

For now, I have removed most of these functions from the PR, leaving only the necessary ones for point estimate diagnostics and pareto smoothing. I have also changed the remaining to internal functions, not exported.

R/gpd.R

paul-buerkner · 2023-05-05T07:03:28Z

R/pareto_smooth.R

+
+#' @rdname pareto_khat
+#' @export
+pareto_khat.rvar <- function(x, ...) {


@mjskay could you have a look and check if this pattern for rvars is the most sensible approach (same for pareto_smooth below)?

I took a look, and I'm not sure the resulting output format is quite what I'd expect for rvars given how we've tended to define other diagnostics and transformations on rvars, which tend to return rvars (the above returns the draws and the diagnostics of the smoothed rvar as a flattened list of arrays).

If you do something like this for pareto_smooth:

pareto_smooth.rvar <- function(x, ...) { draws_diags <- summarise_rvar_by_element_with_chains(x, pareto_smooth.default, ...) dim(draws_diags) <- dim(draws_diags) %||% length(draws_diags) margins <- seq_along(dim(draws_diags)) list( x = rvar(apply(draws_diags, margins, function(x) x[[1]]$x), nchains = nchains(x)), khat = apply(draws_diags, margins, function(x) x[[1]]$diagnostics$khat) ) }

Then the smoothed sample is returned as an rvar with the same structure as the input, and the khat diagnostic is returned as an array with the same dimensions as the input; e.g.:

set.seed(1234) x = rvar_rng(rnorm, 4, 1:4) dim(x) = c(2,2) pareto_smooth(x) # $x # rvar<4000>[2,2] mean ± sd: # [,1] [,2] # [1,] 1 ± 1.00 3 ± 0.99 # [2,] 2 ± 0.99 4 ± 1.00 # # $khat # [,1] [,2] # [1,] -0.12174198 -0.07504085 # [2,] -0.05649535 -0.01960201

I think something along those lines would make the most sense to me, unless I've misunderstood something about the intent here.

Thanks for this suggestion. I have modified the code accordingly, and also made it handle the extra diagnostics. The output is now like this for rvars:

pareto_smooth(x, extra_diags = TRUE) $x rvar<4000>[2,2] mean ± sd: [,1] [,2] [1,] 1 ± 1.00 3 ± 0.99 [2,] 2 ± 0.99 4 ± 1.00 $diagnostics $diagnostics$khat [,1] [,2] [1,] -0.12174198 -0.07504085 [2,] -0.05649535 -0.01960201 $diagnostics$min_ss [,1] [,2] [1,] 10 10 [2,] 10 10 $diagnostics$khat_threshold [,1] [,2] [1,] 0.7223811 0.7223811 [2,] 0.7223811 0.7223811 $diagnostics$convergence_rage [,1] [,2] [1,] 1 1 [2,] 1 1

@mjskay can you check if this is what you had in mind?

yeah, looks good to me!

R/pareto_smooth.R

n-kall · 2023-05-07T05:30:13Z

Thanks for the review and comments @paul-buerkner and @mjskay! I'll have an updated version in the coming days with these suggestions addressed.

n-kall · 2023-05-09T08:31:26Z

After thinking about this PR more, I am going to split it into two separate PRs:

This PR: the basic Pareto smoothing functions and (point estimate) diagnostics (pretty much complete)
Another PR: the functions for the marginal posterior of the Pareto k (needs more work).

As I still need to think a bit more about 2, I will remove the parts pertaining to it from this PR, so they do not hold it up.

n-kall · 2023-05-12T12:48:43Z

Note, a fix is needed for the extra_diags:

if khat > 1, min_ss should be Inf

paul-buerkner · 2023-05-12T13:01:35Z

Please let me know, once the PR is ready to merge from your side.

n-kall · 2023-05-13T06:47:11Z

There may be a slight delay with finishing this PR as the repository of the fork seems to be corrupted on GitHub and is not able to accept new commits or even be cloned. I've filed a ticket with GitHub, hoping for a quick response.

n-kall · 2023-05-17T08:28:34Z

After discussion with @avehtari we decided to make some further adjustments:

separate the extra_diags into another function pareto_diags
allow pareto_smooth to only return the smoothed draws, and not diagnostics (for use in mutate_variables for example)
add interpretation details into the documentation

github-actions · 2023-05-25T09:56:47Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if c7101a2 is merged into master:

:ballot_box_with_check:as_draws_array: 158ms -> 158ms [-1.11%, +0.64%]
:ballot_box_with_check:as_draws_df: 128ms -> 127ms [-3.07%, +1.89%]
:ballot_box_with_check:as_draws_list: 272ms -> 275ms [-0.71%, +2.47%]
:ballot_box_with_check:as_draws_matrix: 49.1ms -> 48.9ms [-1.94%, +1.28%]
:ballot_box_with_check:as_draws_rvars: 247ms -> 250ms [-0.27%, +2.74%]
:ballot_box_with_check:summarise_draws_100_variables: 1.08s -> 1.08s [-0.76%, +0.81%]
:rocket:summarise_draws_10_variables: 193ms -> 121ms [-37.75%, -36.63%]
Further explanation regarding interpretation and methodology can be found in the documentation.

github-actions · 2023-05-25T11:09:12Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if ada117a is merged into master:

:ballot_box_with_check:as_draws_array: 131ms -> 130ms [-4.8%, +3.55%]
:ballot_box_with_check:as_draws_df: 111ms -> 112ms [-3.44%, +4.67%]
:ballot_box_with_check:as_draws_list: 230ms -> 236ms [-0.31%, +5.25%]
:ballot_box_with_check:as_draws_matrix: 40.2ms -> 40.8ms [-2.79%, +5.94%]
:ballot_box_with_check:as_draws_rvars: 211ms -> 212ms [-4.11%, +4.5%]
:ballot_box_with_check:summarise_draws_100_variables: 871ms -> 888ms [-2.98%, +6.69%]
:rocket:summarise_draws_10_variables: 157ms -> 98.3ms [-39.8%, -34.82%]
Further explanation regarding interpretation and methodology can be found in the documentation.

github-actions · 2023-08-07T13:38:15Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if a344f01 is merged into master:

:ballot_box_with_check:as_draws_array: 165ms -> 166ms [-0.76%, +1.2%]
:rocket:as_draws_df: 142ms -> 57.6ms [-60.72%, -58.26%]
:ballot_box_with_check:as_draws_list: 301ms -> 303ms [-0.68%, +1.7%]
:ballot_box_with_check:as_draws_matrix: 51.4ms -> 51.1ms [-2.54%, +1.62%]
:ballot_box_with_check:as_draws_rvars: 276ms -> 277ms [-1.05%, +1.79%]
:rocket:summarise_draws_100_variables: 1.16s -> 1.15s [-2.02%, -0.64%]
:ballot_box_with_check:summarise_draws_10_variables: 128ms -> 126ms [-3.74%, +1.17%]
Further explanation regarding interpretation and methodology can be found in the documentation.

avehtari

Went through the doc parts. Found things to improve and some typos

R/gpd.R

tests/testthat/test-pareto_smooth.R

R/pareto_smooth.R

avehtari · 2023-08-30T09:45:42Z

R/pareto_smooth.R

+
+  # automatically calculate relative efficiency
+  if (is.null(r_eff)) {
+    r_eff <- ess_basic(x) / S


Ah, now that I rehink this, ess_tail would be more appropriate when r_eff is used to determine tail_length

R/pareto_smooth.R

man-roxygen/args-pareto.R

github-actions · 2023-08-30T13:26:15Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if 4962965 is merged into master:

:ballot_box_with_check:as_draws_array: 163ms -> 162ms [-0.77%, +0.09%]
:ballot_box_with_check:as_draws_df: 55.6ms -> 55.6ms [-1.03%, +0.98%]
:ballot_box_with_check:as_draws_list: 288ms -> 288ms [-1.01%, +0.41%]
:ballot_box_with_check:as_draws_matrix: 50.2ms -> 50ms [-2.41%, +1.5%]
:ballot_box_with_check:as_draws_rvars: 264ms -> 266ms [-0.64%, +2.06%]
:rocket:summarise_draws_100_variables: 1.12s -> 1.11s [-1.55%, -0.38%]
:ballot_box_with_check:summarise_draws_10_variables: 124ms -> 124ms [-0.32%, +0.64%]
Further explanation regarding interpretation and methodology can be found in the documentation.

avehtari

docs ok now

paul-buerkner · 2023-08-30T18:53:44Z

Great! @avehtari can I merge the PR now?

avehtari · 2023-08-31T06:53:09Z

I approve merging

paul-buerkner · 2023-08-31T07:00:39Z

Perfect, thank you both so much for adding this functionality to posterior!

jgabry · 2023-08-31T19:08:12Z

Perfect, thank you both so much for adding this functionality to posterior!

Indeed, thank you! I've been preoccupied with a bunch of other Stan R package issues, so I hadn't looked through this yet. Very cool!

Ozan147 and others added 15 commits August 22, 2022 08:54

added pareto-k diagnostics

5f38bee

Begin cleanup of Pareto-smoothing functions

9ca1c1a

Minor changes to pareto-smooth functions

73380a9

add left tail pareto-k calculation

11c4068

fix pareto_khat.default to not convert to draws

9290c21

add pareto-khat calculation for "both" tails

c51bb53

improve pareto-k diagnostic message handling

4f95834

calculate ndraws_tail for pareto_k in accordance with psis paper

a1d1486

refactor pareto-smoothing functions

500fc60

Merge branch 'stan-dev:master' into pareto_k

b2d6e96

Improve documentation for pareto smoothing functions

76cfe41

Merge branch 'pareto_k' of github.com:n-kall/posterior into pareto_k

7ec263f

fix documentation for ndraws_tail in pareto smoothing functions

cae2893

update documentation for pareto smoothing

30dff2e

return pareto diagnostics as list

0d1acf9

andrjohns assigned avehtari and paul-buerkner Apr 28, 2023

n-kall and others added 2 commits May 3, 2023 14:01

Add tests and cleanup pareto diagnostics

049d8fe

Merge branch 'stan-dev:master' into pareto_k

75d9c67

paul-buerkner requested changes May 5, 2023

View reviewed changes

n-kall added 6 commits May 9, 2023 11:36

improvements to pareto smooth functions (fix rvar method, simplify)

3ae4af4

cleanup and documentation

6e232c3

Merge branch 'pareto_k' of github.com:n-kall/posterior into pareto_k

9778a3d

remove currently unused genralized pareto distribution functions

ef9a2b3

handle extra diagnostics in pareto_khat.rvar

077c975

improve documentation and comments for pareto smoothing functions

ca58e6d

n-kall added 2 commits May 12, 2023 18:14

set minimum ss when pareto-k >=1 to infinity

982d408

Merge branch 'pareto_k' of github.com:n-kall/posterior into pareto_k

7049233

n-kall and others added 3 commits May 13, 2023 09:50

set min_ss based on pareto_k to inf if k >= 1

ad68f27

Merge branch 'pareto_k' of github.com:n-kall/posterior into pareto_k

7ff235b

add pareto smoothing reference in docs

3e823e7

n-kall added 3 commits May 17, 2023 12:24

add new function pareto_diags and adjust existing functions

4c9aa75

expand pareto diagnostic documentation

c383f50

add further tests for pareto functions

4af4ede

add documentation file fo pareto_diags

54011ec

This was referenced Jun 23, 2023

Replace loo::psis with posterior::pareto_khat (when available) topipa/iwmm#2

Closed

Use pareto smoothing functions from posterior package topipa/iwmm#3

Merged

n-kall added 2 commits July 31, 2023 12:10

Merge branch 'stan-dev:master' into pareto_k

bb64c2a

Merge branch 'stan-dev:master' into pareto_k

f107908

avehtari requested changes Aug 30, 2023

View reviewed changes

use ess_tail for r_eff, documentation fixes, typofixes

a85b9b5

avehtari approved these changes Aug 30, 2023

View reviewed changes

paul-buerkner merged commit df1ab19 into stan-dev:master Aug 31, 2023

avehtari mentioned this pull request Aug 31, 2023

Add Pareto-khat diagnostic #237

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pareto k diagnostics and Pareto smoothing #283

Pareto k diagnostics and Pareto smoothing #283

n-kall commented Apr 26, 2023

paul-buerkner commented Apr 27, 2023

n-kall commented May 3, 2023

paul-buerkner left a comment

paul-buerkner May 5, 2023

n-kall May 7, 2023 •

edited

Loading

n-kall May 9, 2023

paul-buerkner May 5, 2023

mjskay May 7, 2023

n-kall May 9, 2023

paul-buerkner May 9, 2023

mjskay May 12, 2023

n-kall commented May 7, 2023

n-kall commented May 9, 2023

n-kall commented May 12, 2023

paul-buerkner commented May 12, 2023

n-kall commented May 13, 2023

n-kall commented May 17, 2023

github-actions bot commented May 25, 2023

github-actions bot commented May 25, 2023

github-actions bot commented Aug 7, 2023

avehtari left a comment

avehtari Aug 30, 2023

github-actions bot commented Aug 30, 2023

avehtari left a comment

paul-buerkner commented Aug 30, 2023

avehtari commented Aug 31, 2023

paul-buerkner commented Aug 31, 2023

jgabry commented Aug 31, 2023

Pareto k diagnostics and Pareto smoothing #283

Pareto k diagnostics and Pareto smoothing #283

Conversation

n-kall commented Apr 26, 2023

Summary

Current status

Example functionality

Copyright and Licensing

paul-buerkner commented Apr 27, 2023

n-kall commented May 3, 2023

paul-buerkner left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

n-kall May 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

n-kall commented May 7, 2023

n-kall commented May 9, 2023

n-kall commented May 12, 2023

paul-buerkner commented May 12, 2023

n-kall commented May 13, 2023

n-kall commented May 17, 2023

github-actions bot commented May 25, 2023

github-actions bot commented May 25, 2023

github-actions bot commented Aug 7, 2023

avehtari left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Aug 30, 2023

avehtari left a comment

Choose a reason for hiding this comment

paul-buerkner commented Aug 30, 2023

avehtari commented Aug 31, 2023

paul-buerkner commented Aug 31, 2023

jgabry commented Aug 31, 2023

n-kall May 7, 2023 •

edited

Loading