-
Notifications
You must be signed in to change notification settings - Fork 19
Description
Description
Current Issue with Mean-Only Approach
The current implementation uses only the GP mean prediction to calculate Sobol indices, but this misses a crucial component of uncertainty. When you only use the mean, you're assuming perfect knowledge of the true function - which defeats the purpose of having a probabilistic emulator.
What's Missing: Emulator Uncertainty
From the paper attached, we should treat the computer model as an unknown function with a GP posterior distribution. This means:
- Sobol indices themselves become random variables (not fixed numbers)
- We need posterior expectations instead of point estimates:
- As a consequence there is a modification to the standard Hoeffding-Sobol decomposition:
Defining the Terms:
-
$z_i(x_i) = E(Y|x_i) - E(Y)$ is the main effect of input$X_i$ -
$E^*[\cdot]$ denotes posterior expectation with respect to the GP -
$\text{var}^*[\cdot]$ denotes posterior variance with respect to the GP -
$V_i = \text{var}{z_i(X_i)}$ is the main effect variance
Variance Decomposition Breakdown:
-
$\text{var}[E^*{z_i(X_i)}]$ : Variation of the posterior mean prediction (what current implementation captures) -
$E[\text{var}^*{z_i(X_i)}]$ : Average uncertainty within the GP posterior about what the main effect actually is (currently ignored)
The Problem with Point Estimates
When using only the GP mean, you're effectively setting
Reproduction steps
The paper provides the formalisation on how one should properly compute Sobol indices for probablistic emulators.
Version
Screenshots

OS
No response
Metadata
Metadata
Assignees
Labels
Type
Projects
Status