Refactor MSR Banzhaf valuation #605

janosg · 2024-06-19T15:31:49Z

Description

This PR implements Maximum Sample Re-use (MSR) Banzhaf valuation in the new architecture. The implementation deviates strongly from the previous implementation and fixes a bug in the variance estimation.

The new implementation uses two ValuationResult instances to keep track of the positive and negative running means. After each update, those are combined into the final result object. The update counter of the combined result is set to the minimum of the two update counters. The variance of the combined result is set to the sum of variances (assuming independence).

Open questions

How should update counters for MSR be defined? I chose the minimum of the positive and negative update counters because this makes the counters comparable to the counters for normal semivalues.
To which value should we set the value estimates if either the positive or the negative mean had zero updates? The paper suggests zero but I think nan is safer. Note that if the update counter is defined as suggested above, this situation can be completely avoided by using MinUpdates as stopping criterion.
To which value should we set the variance estimates if either the positive or the negative mean had zero updates? I chose inf.

Checklist

Wrote Unit tests (if necessary)
Updated Changelog

janosg · 2024-06-24T10:54:26Z

I clarified the documentation of ValuationResult.variance, but I still think it is a bit misleading that ValuationResult.variances is not just the square of ValuationResult.stderr.

I think it would be clearer if we only expose the square root of the variances as ValuationResult.stdev; Then it's clear that the difference betweenn stdev and stderr must be a conceptual one. Also, most of the time standard deviations are more interpretable than variances.

src/pydvl/valuation/methods/msr_banzhaf.py

jakobkruse1

I just looked over the files, seems all good to me at first glance. I did not run any code or tests yet though, so no way to be sure for me. Also the CI is not running for this branch. Are we updating the notebooks for the new valuation structure as well or is this planned at a later stage? Looking at notebooks may help to verify that everything works as expected.
I am not aware of the current plans about notebooks etc. so feel free to ignore me if there are any decisions that I am unaware of.

… - rename file to target-name

… - rename source-file to git-split-temp

… - resolve conflict and keep both files

… - restore name of source-file

src/pydvl/valuation/methods/msr_banzhaf.py

src/pydvl/valuation/samplers/msr.py

src/pydvl/valuation/types.py

Co-authored-by: Kristof Schröder <[email protected]>

src/pydvl/valuation/methods/msr_banzhaf.py

Co-authored-by: Kristof Schröder <[email protected]>

First draft for new MSR Banzhaf.

fe23e79

janosg changed the base branch from develop to feature/refactor-value June 19, 2024 15:32

janosg added 3 commits June 20, 2024 15:20

Polishing and docstrings.

9900c60

Run mypy.

675934f

Update changelog.

e4145f8

janosg requested review from mdbenito and jakobkruse1 June 20, 2024 16:01

Clarify documentation of variances and stderr in ValuationResult.

204dc53

janosg commented Jun 26, 2024

View reviewed changes

src/pydvl/valuation/methods/msr_banzhaf.py Outdated Show resolved Hide resolved

jakobkruse1 reviewed Jun 27, 2024

View reviewed changes

janosg added 6 commits July 1, 2024 10:40

Fix variances for MSR.

d860268

Split history tests/test_results.py to tests/valuation/test_result.py…

5bd3ef0

… - rename file to target-name

Split history tests/test_results.py to tests/valuation/test_result.py…

795c2bf

… - rename source-file to git-split-temp

Split history tests/test_results.py to tests/valuation/test_result.py…

ee8c693

… - resolve conflict and keep both files

Split history tests/test_results.py to tests/valuation/test_result.py…

fa6bbfe

… - restore name of source-file

Add and adjust old tests for ValuationResult.

667136c

janosg requested a review from schroedk July 8, 2024 10:16