Flaml: fix lgbm reproducibility #1369

dannycg1996 · 2024-10-24T16:17:37Z

Why are these changes needed?

In some cases, FLAML LGBM results are not reproducible, using the best model provided by FLAML. I can split the problem, and the solution, into two:

1. n_estimators on automl.model.model always = 1

As described in the title, the code self._model.set_params(n_estimators=best_iteration + 1) on LGBMEstimator.fit() causes the the n_estimators value to always be 1. I'm not sure why self._model.best_iteration_ is always 0 - I think it's due to using EarlyStopping? Regardless, we have no need to overwrite a correct n_estimators value, so I've removed this code (and my unit tests now pass!)

2. When a time_budget is set, LGBMEstimator sometimes is not reproducible, if the deadline is running out when the estimator is trained

This is a very similar issue to the one we recently resolved for the CatBoost model. Basically the callback causes the training of a model to differ, when the time budget is running out - thus making the loss impossible to reproduce. I've fixed this by simply deleting the if statement responsible - I think this is a clean solution.

Like with the CatBoost fix, I've also added a unit test, which fails when the if statement above is re-added to the codebase. This will help us identify if the issue is ever reimplemented somehow. It also allows others to test this issue, should they wish.

Also, am I okay to open an issue sometime to refactor this LGBMEstimator? There's a lot of defunct code, and it could be made both more readable and possibly more efficient.

Anyway, please let me know what you think!

Related issue number

Closes #1368

Checks

I've used pre-commit to lint the changes in this PR (note the same in integrated in our CI checks).
I've included any doc changes needed for https://microsoft.github.io/FLAML/. See https://microsoft.github.io/FLAML/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

… had n_estimators = 1

…n't reproducible

…ucible

thinkall

Thank you so much for the PR, @dannycg1996 !

flaml/automl/model.py

test/automl/test_regression.py

flaml/automl/model.py

Co-authored-by: Li Jiang <[email protected]>

dannycg1996 · 2024-10-29T11:41:34Z

Hi @thinkall, thanks for taking the time to explain - I understand why you wish to keep the early stopping!
I've added the early stopping back in as you requested - once tests pass, please feel free to merge.

flaml/automl/model.py

…cted

thinkall

Thank you sooooooooo much, @dannycg1996 ! LGTM!

dannycg1996 · 2024-10-31T14:32:46Z

Ah, it seems I have an issue with this test which I added. That's fine - we know it isn't always possible to reproduce scores when EarlyStopping is concerned! I'll tweak the test to ensure that our reproduced loss is at least as good as the loss reported by FLAML, and then it should be re-approvable/mergeable!

…AML results, when LGBM earlystopping is involved

dannycg1996 · 2024-10-31T14:40:06Z

Sorry about that @thinkall, I think it should be approvable and mergeable now!

Daniel Grindrod added 3 commits October 24, 2024 16:12

fix: Fixed bug where every underlying LGBMRegressor or LGBMClassifier…

9e5a074

… had n_estimators = 1

test: Added test showing case where FLAMLised CatBoostModel result is…

8f4b13a

…n't reproducible

fix: Fixing issue where callbacks cause LGBM results to not be reprod…

f90d3ab

…ucible

dannycg1996 requested a review from thinkall October 24, 2024 17:01

dannycg1996 assigned thinkall Oct 24, 2024

thinkall reviewed Oct 26, 2024

View reviewed changes

flaml/automl/model.py Outdated Show resolved Hide resolved

thinkall added 3 commits October 28, 2024 10:43

Merge branch 'main' into flaml-fix-lgbm-reproducibility

12ebd17

Merge branch 'main' into flaml-fix-lgbm-reproducibility

d236718

Merge branch 'main' into flaml-fix-lgbm-reproducibility

76cfe21

thinkall reviewed Oct 29, 2024

View reviewed changes

test/automl/test_regression.py Outdated Show resolved Hide resolved

flaml/automl/model.py Outdated Show resolved Hide resolved

dannycg1996 and others added 2 commits October 29, 2024 11:02

Update test/automl/test_regression.py

d43b8c1

Co-authored-by: Li Jiang <[email protected]>

fix: Adding back the LGBM EarlyStopping

654194d

thinkall reviewed Oct 30, 2024

View reviewed changes

flaml/automl/model.py Outdated Show resolved Hide resolved

Daniel Grindrod and others added 2 commits October 31, 2024 10:03

refactor: Fix tweaked to ensure other models aren't likely to be affe…

851747c

…cted

Merge branch 'main' into flaml-fix-lgbm-reproducibility

447e6ee

thinkall approved these changes Oct 31, 2024

View reviewed changes

Merge branch 'main' into flaml-fix-lgbm-reproducibility

a209d7a

dannycg1996 mentioned this pull request Oct 31, 2024

[Bug]: FLAML LGBM Metrics Aren't Reproducible dannycg1996/FLAML_fix#2

Closed

test: Fixed test to allow reproduced results to be better than the FL…

d8dfdc7

…AML results, when LGBM earlystopping is involved

thinkall merged commit 5a74227 into microsoft:main Nov 1, 2024
16 checks passed

dannycg1996 deleted the flaml-fix-lgbm-reproducibility branch November 1, 2024 08:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flaml: fix lgbm reproducibility #1369

Flaml: fix lgbm reproducibility #1369

dannycg1996 commented Oct 24, 2024 •

edited

Loading

thinkall left a comment

dannycg1996 commented Oct 29, 2024

thinkall left a comment

dannycg1996 commented Oct 31, 2024 •

edited

Loading

dannycg1996 commented Oct 31, 2024

Flaml: fix lgbm reproducibility #1369

Flaml: fix lgbm reproducibility #1369

Conversation

dannycg1996 commented Oct 24, 2024 • edited Loading

Why are these changes needed?

1. n_estimators on automl.model.model always = 1

2. When a time_budget is set, LGBMEstimator sometimes is not reproducible, if the deadline is running out when the estimator is trained

Related issue number

Checks

thinkall left a comment

Choose a reason for hiding this comment

dannycg1996 commented Oct 29, 2024

thinkall left a comment

Choose a reason for hiding this comment

dannycg1996 commented Oct 31, 2024 • edited Loading

dannycg1996 commented Oct 31, 2024

dannycg1996 commented Oct 24, 2024 •

edited

Loading

dannycg1996 commented Oct 31, 2024 •

edited

Loading