ENH implement fair representation learner #1478

taharallouche · 2024-12-28T09:43:59Z

Description

Tackles #1026

Hello ! Here are the most important details imo regarding this implementation:

Comparison to original paper and AIF360 implementation

	Paper	AIF360	FairRepresentationLearner
Multi-class target	❌	✅ (*)	❌ (**)
> 2 sensitive features groups	❌	❌	✅ (***)
sklearn-compatibility	-	❌	✅

(*) I'm not sure the implementation for multi-class target in AIF360 is correct though, the class probabilities w are bound
to be in $(0,1)$ but they are not constrained to sum to $1$.
(**) I think we can handle multi-class cases. The optimization problem will have linear constraints and would be solved with other optimizers than the L-BFGS one that the paper uses, among a couple of other things to adapt.
(***) Achieved by including the absolute difference between each two groups in the $L_z$ loss term (more details in the user-guide).

Fallback Estimator

In order to be sklearn-compatible, the .fit should work with the default sensitive_features kwarg value, which is None.
To achieve that, I made the choice to fallback to fitting a regular LogisticRegression if no sensitive_features are passed.
The transformation in such cases will be the identity function, and the predictions are those of the fallback logistic regressor.

Unit-tests

I have to say I feel the amount of unit tests is not much compared to the other modules, but I run out of ideas of code paths to cover, especially that the sklearn estimator checks already tests for many edge cases.

Tests

no new tests required
new tests added
existing tests adjusted

Documentation

no documentation changes needed
user guide added or updated
API docs added or updated
example notebook added or updated

Screenshots

taharallouche · 2024-12-28T10:25:56Z

I think that the CI is failing for the windows 3.8 job because pywinpty's project.toml is lacking version (pywinpty is a dep of jupyterlite_sphinx), here's a related issue

project.versionfield is required in pyproject.toml unless it is present in theproject.dynamic` list

maturin (which is not frozen in pywinpty's build) only very recently started to require the field.

TamaraAtanasoska · 2025-01-06T09:54:44Z

I think that the CI is failing for the windows 3.8 job because pywinpty's project.toml is lacking version (pywinpty is a dep of jupyterlite_sphinx), here's a related issue

project.versionfield is required in pyproject.toml unless it is present in theproject.dynamic` list

maturin (which is not frozen in pywinpty's build) only very recently started to require the field.

I am back :) Wow that is a big PR thanks! I will look into the Windows issue, so we can separate it from the PR here and have a clean CI.

TamaraAtanasoska · 2025-01-06T11:43:22Z

@taharallouche it seems that the error was made less severe so the builds pass now, and the library maintainers are keeping an eye on it. CI now builds with pywinpty, but there are unrelated test failures. There are some sklearn estimator checks failing around validation, try using this function in the fixes instead of the native one, as it is made to work with both versions before and after sklearn 1.6.

TamaraAtanasoska · 2025-01-08T10:34:13Z

I get notifications so just popping here without a proper review to hopefully make your day easier. The sklearn 1.6 compatibility issues were working for what we had, trying to add minimal added code, but if you are using some other functions that we didn't have you might want to port some of this code found here https://github.com/sklearn-compat/sklearn-compat, and if it is too much and becomes annoying we can decide to vendor the whole library. It is a bit of a tedious thing, feel free to ping me on Discord about anything, I'd love to support if I can!

I wasn't clear enough the other day about the parametrize_with_checks compatibility function, you only need it if you have failed estimator checks that you want to place there, just for the easiness of development and having passing CI, the checks can be addressed later. Since the compatibility function only needs to be used when there are failing checks, the empty addition isn't necessary. I should add that to the docs as I realised it isn't there.

taharallouche · 2025-01-10T21:15:58Z

EDIT: False alarm

Whoops it seems scikit-learn 1.6.1 broke our CorrelationRemover and AdversarialFairnessRegressor 's compatibility 👀 fyi @TamaraAtanasoska and @adrinjalali

(It's breaking my new estimator as well so I'll be looking into it ...)

TamaraAtanasoska · 2025-01-10T21:37:48Z

hi @taharallouche, the main branch passes fully with sklearn 1.6, there are no issues, just rechecked again locally. it must be something in this one.
I tried to give some suggestions, but I think I can't see well atm without checking it out and seeing what is in detail. The TypeError next to the ImportError in validate_data in _fixes.py you added, does it help to remove that? You can also fallback on the non-compatibility parametrize_with_checks unless you need to xfail some estimator checks.
Otherwise maybe there is some wisdom in the compat validation methods here as they include way more than it is added and wasn't necessary before. I remember the initial errors were validation related if I remember right after the pywinpty problem.
I had to edit because I am sleepy and I wrote half a sentence again :D

TamaraAtanasoska · 2025-01-11T12:05:21Z

Yay for those changes fixing most of the issues! It seems there are still some issues with the LightGBM, check out pyproject.toml under filterwarnings (like this in my PR) for the way to filter those, as we can't do anything until they make a release.

TamaraAtanasoska · 2025-01-11T17:23:44Z

hi @fairlearn/fairlearn-maintainers, this is now ready for a deeper review! I will sign myself up to do one next week, I have already looked into the branch a bit. A second reviewer is necessary. @hildeweerts as you have created the issue, would you be up for a review too?

hildeweerts

After a first pass, I’m unsure what a good API for this method should look like and/or how to categorize it properly. Intuitively, I wouldn’t expect a "pre-processing" algorithm to have a “predict” method (or need of a fall-back estimator).

According to the paper, the intuition behind the proposed representation learning approach is to pre-process data in an alternative representation, which could theoretically be reused by a different party for classification. Yet, the main experiments (section 4.2) use the probabilities derived from prototypes directly for classification, which seems more like an end-to-end representation learning approach rather than a pre-processing approach...

The chosen representation, which maps instances in the dataset probabilistically to prototypes, is a bit “weird” in the sense that it's not entirely clear how it would be used directly as input for a particular classification algorithm. In Section 4.3, they seem to only use the probabilities as a representation and not the learned prototypes (please correct me if I'm wrong).

I'm not sure if we need to change the implementation so much as the documentation. E.g. we might want to reconsider calling this a pre-processing method, make it clear in the API docs/user guide how the predicted probabilities are derived, show how intermediate transformations could be used in a pipeline with a different classification algorithm, etc.

Thoughts? @fairlearn/fairlearn-maintainers

hildeweerts · 2025-01-16T14:24:49Z

docs/user_guide/mitigation/preprocessing.rst

+
+.. _fair_representation_learner:
+
+Fair Representation Learner


I'm not sure how we should name this module. On the one hand, fair representation learning has become a category of fair-ml algorithms in the literature, on the other hand, naming something "FairRepresentationLearner" suggests the representations are actually "fair" in a meaningful way, a claim we generally try to avoid.

Reading the paper I thought about these intermediate representations as the minimal possible representation that balances utility and decoupling as much as possible from the sensitive groups the individuals belong to. They use the term "sanitised" at some point, and although I don't really like the term itself, maybe "Sanitised Intermediate Representation Learner" would fit here? It is a mouthful though, and a bit too on the nose :)

hildeweerts · 2025-01-23T13:59:37Z

fairlearn/preprocessing/_fair_representation_learner.py

+    performance.
+
+    The model minimizes a loss function that consists of three terms: the reconstruction error,
+    the classification error, and the statistical-parity error.


For consistency across our docs we might consider calling this something like "an approximation of the demographic parity difference".

Suggested change

the classification error, and the statistical-parity error.

the classification error, and an approximation of the demographic parity difference.

The formulation is not exactly the same as DPD, as it considers the difference in probability of mapping to a prototype rather than the target variable, but seems more consistent with the rest of our docs.

hildeweerts · 2025-01-23T14:12:09Z

fairlearn/preprocessing/_fair_representation_learner.py

+    n_prototypes : int, default=2
+        Number of prototypes to use in the latent representation.
+
+    Ax : float, default=1.0


I think we typically don't really use capitals for parameters in our docs (unless it's an array), perhaps something like alpha / beta / gamma would be more consistent with naming in other classes?

hildeweerts · 2025-01-23T14:12:28Z

fairlearn/preprocessing/_fair_representation_learner.py

+        Number of prototypes to use in the latent representation.
+
+    Ax : float, default=1.0
+        Weight for the reconstruction error term in the objective function.


Suggested change

Weight for the reconstruction error term in the objective function.

Weight of the reconstruction error term in the objective function.

hildeweerts · 2025-01-23T14:12:51Z

fairlearn/preprocessing/_fair_representation_learner.py

+        Weight for the reconstruction error term in the objective function.
+
+    Ay : float, default=1.0
+        Weight for the classification error term in the objective function.


Suggested change

Weight for the classification error term in the objective function.

Weight of the classification error term in the objective function.

hildeweerts · 2025-01-23T14:18:20Z

fairlearn/preprocessing/_fair_representation_learner.py

+    random_state : int, np.random.RandomState, or None, default=None
+        Seed or random number generator for reproducibility.
+
+    optimizer : Literal["L-BFGS-B", "Nelder-Mead", "Powell", "SLSQP", "TNC", "trust-constr",


Do we have any advice on which optimizers make the most sense in what scenarios? (If not that's also perfectly fine)

hildeweerts · 2025-01-23T14:18:52Z

fairlearn/preprocessing/_fair_representation_learner.py

+    tol : float, default=1e-6
+        Convergence tolerance for the optimization algorithm.
+
+    max_iter : int, default=1000


Similar to above - do we have advice on max_iter for different optimizers?

hildeweerts · 2025-01-23T14:25:44Z

fairlearn/preprocessing/_fair_representation_learner.py

+    max_iter : int, default=1000
+        Maximum number of iterations for the optimization algorithm.
+
+    Attributes


This made me realize we haven't added attributes to docstrings of any of the other modules, lol... I would consider dropping the user-defined attributes, but perhaps leave the others (n_iter_, etc.).

Should we consider adding attributes to all of our docstrings? @fairlearn/fairlearn-maintainers

n_iter_, max_iter, _classes and a few others are part of the necessary scikit-learn compatibility API, not sure if that speaks in favour or not. scikit-learn adds attributes to their docs, I can see how they make the code more accessible with that extra information bit available. Tools like Copilot or Cursor can now successfully add these automatically for you, update them too (of course you'd need to proofread), so it isn't a huge maintenance burden. I personally am ok with either way.

hildeweerts · 2025-01-23T14:26:58Z

fairlearn/preprocessing/_fair_representation_learner.py

+    _fall_back_classifier : LogisticRegression or None
+        Fallback classifier used when no sensitive features are provided.
+
+    Methods


This part is not really necessary IMO as each method has its own docstring.

I agree! Also the rendering documentation exposes all public methods automatically.

hildeweerts · 2025-01-23T14:37:07Z

fairlearn/preprocessing/_fair_representation_learner.py

+            The target values.
+
+        sensitive_features : array-like or None, default=None
+            Sensitive features to be considered whose groups will be used to enforce statistical


Suggested change

Sensitive features to be considered whose groups will be used to enforce statistical

Sensitive features to be considered whose groups will be used to enforce demographic

Enforce sounds perhaps a bit stronger than what the method does, perhaps something like improve/promote/increase/etc.?

TamaraAtanasoska · 2025-01-24T14:28:12Z

before I start the review because I believe this PR will be affected too - I am working on a PR that will address the codecov issues #1448 (comment) by adding a CI sklearn-compatibility job, as we codecov is complaining because we are not testing with both versions so all code isn't run #1485 (comment). I will make sure I have it proposed before I leave for the two weeks, but it might need to be merged before this one.

TamaraAtanasoska

Thank you again for all the hard work! Since we cleared out most of the sklearn issues before, I have general stylistic nitpicks and a few questions. I want to think a bit if we could test something else too, but we can also add more tests in a subsequent PR as we discuss reproducibility further for all the methods.

TamaraAtanasoska · 2025-01-24T14:57:42Z

docs/user_guide/mitigation/preprocessing.rst

+    FairRepresentationLearner(max_iter=10, n_prototypes=4)
+    >>> X_train_transformed = frl.transform(X_train)
+    >>> X_test_transformed = frl.transform(X_test)
+    >>> y_hat = frl.predict(X_test)


Since I know that you are really great with these things, is there any visualisation/plotting that could be interesting here to add? Something simple?

TamaraAtanasoska · 2025-01-24T15:02:21Z

fairlearn/preprocessing/_fair_representation_learner.py

+    _fall_back_classifier : LogisticRegression or None
+        Fallback classifier used when no sensitive features are provided.
+
+    Methods


I agree! Also the rendering documentation exposes all public methods automatically.

TamaraAtanasoska · 2025-01-24T15:17:27Z

fairlearn/preprocessing/_fair_representation_learner.py

+            expect_sensitive_features=False,
+            enforce_binary_labels=True,
+        )
+        assert sensitive_features is None or isinstance(sensitive_features, pd.Series)


from a best practices perspective and the limitations that some testing libraries bring, it is best when assert is used only when testing and debugging. gently erroring out if the code shouldn't continue instead of breaking abruptly would be better, or an if/else could maybe substitute. that could merge with line 285 maybe?

another question here before reading the rest, why does the type need to be Series? asking for future refactorings.

TamaraAtanasoska · 2025-01-24T15:22:47Z

fairlearn/preprocessing/_fair_representation_learner.py

+        self, X, y, sensitive_features: pd.Series, random_state: np.random.RandomState
+    ):
+        """
+        Minimize the loss given the sensitive features.


I was reading the code and slowly putting the method together in my mind as I read, and I thought that maybe two-three sentences of the summarised "how" of the optimisation as a docstring under the one existing might be nice for our future selves :)

TamaraAtanasoska · 2025-01-24T15:29:25Z

fairlearn/preprocessing/_fair_representation_learner.py

+            + self._prototype_dim  # alpha: the weight of each dimension in the distance computation
+        )
+
+        def objective(x: np.ndarray, X, y) -> float:


this might be a personal preference, but I find the function a tad bit too long to be a nested one. it can exist right above as a private method. it will make the optimisation a bit more readable.

TamaraAtanasoska · 2025-01-24T15:31:31Z

fairlearn/preprocessing/_fair_representation_learner.py

+        )
+
+        def objective(x: np.ndarray, X, y) -> float:
+            assert x.shape == (self._optimizer_size,)


same for the assert here as above

TamaraAtanasoska · 2025-01-24T15:47:33Z

fairlearn/preprocessing/_fair_representation_learner.py

+        Az: float = 1.0,
+        random_state: int | np.random.RandomState | None = None,
+        optimizer: Literal[
+            "L-BFGS-B", "Nelder-Mead", "Powell", "SLSQP", "TNC", "trust-constr", "COBYLA", "COBYQA"


I remember the mention of L-BFGS for optimisation, but where do they other names come from? I see them in the scipy documentation, are they just the compatible alternatives?

TamaraAtanasoska · 2025-01-24T15:49:14Z

fairlearn/preprocessing/_fair_representation_learner.py

+                options={"maxiter": self.max_iter},
+            )
+        except Exception as optimization_error:
+            raise RuntimeError("The loss minimization failed.") from optimization_error


Can this error be improved with the addition of what folks can do to improve this not to fail, for example increase the max_iter?

taharallouche added 2 commits December 28, 2024 10:43

feat: implement fair representation learner

5106009

trigger ci

98c2a5b

taharallouche added 4 commits January 8, 2025 08:20

fix: parametrize with checks compatibility

6ca808f

fix: hack empty expected failed checks

10723da

fix: handle ensure all finite in validate data

13a78b5

fix: default expected failed checks

b28b2ab

fix: sklearn compat with multiple versions

d6dcf34

fix: hopefully fixes the estimator checks

1dc25f5

Merge branch 'main' into learning-fair-representation

b7091f2

TamaraAtanasoska self-requested a review January 11, 2025 17:23

hildeweerts self-requested a review January 15, 2025 08:22

hildeweerts reviewed Jan 24, 2025

View reviewed changes

TamaraAtanasoska reviewed Jan 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH implement fair representation learner #1478

ENH implement fair representation learner #1478

taharallouche commented Dec 28, 2024

taharallouche commented Dec 28, 2024 •

edited

Loading

TamaraAtanasoska commented Jan 6, 2025

TamaraAtanasoska commented Jan 6, 2025

TamaraAtanasoska commented Jan 8, 2025

taharallouche commented Jan 10, 2025 •

edited

Loading

TamaraAtanasoska commented Jan 10, 2025 •

edited

Loading

TamaraAtanasoska commented Jan 11, 2025 •

edited

Loading

TamaraAtanasoska commented Jan 11, 2025

hildeweerts left a comment

hildeweerts Jan 16, 2025

TamaraAtanasoska Jan 24, 2025 •

edited

Loading

hildeweerts Jan 23, 2025

hildeweerts Jan 23, 2025

hildeweerts Jan 23, 2025

hildeweerts Jan 23, 2025

hildeweerts Jan 23, 2025

hildeweerts Jan 23, 2025

hildeweerts Jan 23, 2025

TamaraAtanasoska Jan 24, 2025

hildeweerts Jan 23, 2025

TamaraAtanasoska Jan 24, 2025

hildeweerts Jan 23, 2025

hildeweerts Jan 24, 2025

TamaraAtanasoska commented Jan 24, 2025

TamaraAtanasoska left a comment

TamaraAtanasoska Jan 24, 2025

TamaraAtanasoska Jan 24, 2025

TamaraAtanasoska Jan 24, 2025

TamaraAtanasoska Jan 24, 2025

TamaraAtanasoska Jan 24, 2025

TamaraAtanasoska Jan 24, 2025

TamaraAtanasoska Jan 24, 2025

TamaraAtanasoska Jan 24, 2025

TamaraAtanasoska Jan 24, 2025

	the classification error, and the statistical-parity error.
	the classification error, and an approximation of the demographic parity difference.

	Weight for the reconstruction error term in the objective function.
	Weight of the reconstruction error term in the objective function.

	Weight for the classification error term in the objective function.
	Weight of the classification error term in the objective function.

	Sensitive features to be considered whose groups will be used to enforce statistical
	Sensitive features to be considered whose groups will be used to enforce demographic

ENH implement fair representation learner #1478

Are you sure you want to change the base?

ENH implement fair representation learner #1478

Conversation

taharallouche commented Dec 28, 2024

Description

Comparison to original paper and AIF360 implementation

Fallback Estimator

Unit-tests

Tests

Documentation

Screenshots

taharallouche commented Dec 28, 2024 • edited Loading

TamaraAtanasoska commented Jan 6, 2025

TamaraAtanasoska commented Jan 6, 2025

TamaraAtanasoska commented Jan 8, 2025

taharallouche commented Jan 10, 2025 • edited Loading

TamaraAtanasoska commented Jan 10, 2025 • edited Loading

TamaraAtanasoska commented Jan 11, 2025 • edited Loading

TamaraAtanasoska commented Jan 11, 2025

hildeweerts left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TamaraAtanasoska Jan 24, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TamaraAtanasoska commented Jan 24, 2025

TamaraAtanasoska left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

taharallouche commented Dec 28, 2024 •

edited

Loading

taharallouche commented Jan 10, 2025 •

edited

Loading

TamaraAtanasoska commented Jan 10, 2025 •

edited

Loading

TamaraAtanasoska commented Jan 11, 2025 •

edited

Loading

TamaraAtanasoska Jan 24, 2025 •

edited

Loading