Return NaN for negative ModeResult variance estimates #2471

frankier · 2025-01-20T11:44:11Z

Here's a modified example that gets negative estimates for variance of some parameters (coefficients_versicolor[3]):

using Turing
using RDatasets
using StatsPlots
using MLDataUtils: shuffleobs, splitobs, rescale!
using NNlib: softmax
using FillArrays
using LinearAlgebra
using Random
Random.seed!(0);

using Optim
using StatsBase

data = RDatasets.dataset("datasets", "iris");
data[rand(1:size(data, 1), 20), :]
species = ["setosa", "versicolor", "virginica"]
data[!, :Species_index] = indexin(data[!, :Species], species)
data[rand(1:size(data, 1), 20), [:Species, :Species_index]]
trainset, testset = splitobs(shuffleobs(data), 0.5)
features = [:SepalLength, :SepalWidth, :PetalLength, :PetalWidth]
target = :Species_index

train_features = Matrix(trainset[!, features])
test_features = Matrix(testset[!, features])
train_target = trainset[!, target]
test_target = testset[!, target]

μ, σ = rescale!(train_features; obsdim=1)
rescale!(test_features, μ, σ; obsdim=1);

@model function logistic_regression(x, y, σ)
    n = size(x, 1)
    length(y) == n ||
        throw(DimensionMismatch("number of observations in `x` and `y` is not equal"))

    # Priors of intercepts and coefficients.
    intercept_versicolor ~ Normal(0, σ)
    intercept_virginica ~ Normal(0, σ)
    coefficients_versicolor ~ MvNormal(Zeros(4), σ^2 * I)
    coefficients_virginica ~ MvNormal(Zeros(4), σ^2 * I)

    # Compute the likelihood of the observations.
    values_versicolor = intercept_versicolor .+ x * coefficients_versicolor
    values_virginica = intercept_virginica .+ x * coefficients_virginica
    for i in 1:n
        # the 0 corresponds to the base category `setosa`
        v = softmax([0, values_versicolor[i], values_virginica[i]])
        y[i] ~ Categorical(v)
    end
end;

model = logistic_regression(train_features, train_target, 1)
mle_estimate = Optim.optimize(model, MLE())
println(coeftable(mle_estimate))

Without this PR, this will throw a DomainError in coeftable when calling getting the stderr of coefficients_versicolor[3].

frankier · 2025-01-20T11:50:37Z

This is related to #2048

I don't fully agree with the conclusion that there is nothing to fix in Turing.jl here.

Propagating a NaN makes it easier to inspect the coeftable and see that something has gone wrong with the optimisation process.

codecov · 2025-01-20T17:30:08Z

Codecov Report

Attention: Patch coverage is 73.33333% with 8 lines in your changes missing coverage. Please review.

Project coverage is 84.22%. Comparing base (24d5556) to head (8ad39c7).
Report is 4 commits behind head on master.

Files with missing lines	Patch %	Lines
src/optimisation/Optimisation.jl	73.33%	8 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2471      +/-   ##
==========================================
- Coverage   85.01%   84.22%   -0.80%     
==========================================
  Files          21       21              
  Lines        1582     1597      +15     
==========================================
  Hits         1345     1345              
- Misses        237      252      +15

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

coveralls · 2025-01-20T19:01:18Z

Pull Request Test Coverage Report for Build 12867285375

Details

1 of 1 (100.0%) changed or added relevant line in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.02%) to 76.474%

Totals
Change from base Build 12786420948:	0.02%
Covered Lines:	1206
Relevant Lines:	1577

💛 - Coveralls

yebai

Thanks, @frankier. It looks like a nice improvement!

mhauru · 2025-01-21T10:20:07Z

Thanks @frankier, I agree that the current situation where coeftable fails with DomainError is not optimal, and this is an improvement. I wonder if we could be even more explicit though, and in our method for coeftable, catch the DomainError, print out a warning explaining that the solution seems to have negative variance and thus you should be very suspicious of your result, and then return a table with stderr as NaN. @frankier, as a user, do you think that would be helpful? This would also save us introducing a new dependency that we only use on a single line.

frankier · 2025-01-22T10:36:59Z

Yes, I think on balance that would be better.

I think there is also the possibility of getting a SingularException in inv, which I guess also indicates model identifiability (and thus optimization) problems. So I guess it's better to catch these, aggregate them and report how it failed alongside the table.

I'll update this PR to work this way soon.

src/optimisation/Optimisation.jl

mhauru

This is looking really nice @frankier! I had a few small proposals. Let me know once you're done making edits and I'm happy to merge. Also, I realise you're helping out on a volunteer basis, so if you don't have time to attend to my comments that's fine too, we can merge as is and I can add a couple of tests myself and call it done.

src/optimisation/Optimisation.jl

mhauru · 2025-02-12T11:45:51Z

test/optimisation/Optimisation.jl

+        @assert isnan(tab.cols[2][1])
+        @assert tab.colnms[end] == "Error notes"
+        @assert occursin("singular", tab.cols[end][1])
+    end


Could we also add a test where we check that for some harmless model where everything works out fine, coeftable returns the same result with and without numerrors_warnonly?

Also, a test case for negative variance would be great, to make sure tests cover all code paths.

Co-authored-by: Markus Hauru <[email protected]>

yebai approved these changes Jan 21, 2025

View reviewed changes

frankier force-pushed the neg-var-moderesult branch 2 times, most recently from 3ad099c to 937c1b6 Compare February 10, 2025 15:34

frankier requested a review from yebai February 10, 2025 15:34

Return NaN for negative ModeResult variance estimates

8ad39c7

frankier force-pushed the neg-var-moderesult branch from 937c1b6 to 8ad39c7 Compare February 11, 2025 12:34

yebai approved these changes Feb 11, 2025

View reviewed changes

mhauru reviewed Feb 12, 2025

View reviewed changes

src/optimisation/Optimisation.jl Show resolved Hide resolved

mhauru reviewed Feb 12, 2025

View reviewed changes

src/optimisation/Optimisation.jl Outdated Show resolved Hide resolved

mhauru reviewed Feb 12, 2025

View reviewed changes

src/optimisation/Optimisation.jl Outdated Show resolved Hide resolved

mhauru reviewed Feb 12, 2025

View reviewed changes

Apply suggestions from mhauru

124613c

Co-authored-by: Markus Hauru <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return NaN for negative ModeResult variance estimates #2471

Return NaN for negative ModeResult variance estimates #2471

frankier commented Jan 20, 2025

frankier commented Jan 20, 2025

codecov bot commented Jan 20, 2025 •

edited

Loading

coveralls commented Jan 20, 2025

yebai left a comment

mhauru commented Jan 21, 2025

frankier commented Jan 22, 2025

mhauru left a comment

mhauru Feb 12, 2025

Return NaN for negative ModeResult variance estimates #2471

Are you sure you want to change the base?

Return NaN for negative ModeResult variance estimates #2471

Conversation

frankier commented Jan 20, 2025

frankier commented Jan 20, 2025

codecov bot commented Jan 20, 2025 • edited Loading

Codecov Report

coveralls commented Jan 20, 2025

Pull Request Test Coverage Report for Build 12867285375

Details

💛 - Coveralls

yebai left a comment

Choose a reason for hiding this comment

mhauru commented Jan 21, 2025

frankier commented Jan 22, 2025

mhauru left a comment

Choose a reason for hiding this comment

mhauru Feb 12, 2025

Choose a reason for hiding this comment

codecov bot commented Jan 20, 2025 •

edited

Loading