make performance test reproducible #837

sonichi · 2022-12-04T05:44:19Z

Why are these changes needed?

Make the performance test in test_notebook_example.py reproducible.
Improve the retrain logic as in #829.

Related issue number

Closes #829, #794, #777

Checks

I've used pre-commit to lint the changes in this PR, or I've made sure lint with flake8 output is two 0s.
I've included any doc changes needed for https://microsoft.github.io/FLAML/. See https://microsoft.github.io/FLAML/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

markharley · 2022-12-04T18:45:03Z

flaml/model.py

@@ -1459,6 +1463,8 @@ def config2params(self, config: dict) -> dict:
            )
        if self._task not in CLASSIFICATION and "criterion" in config:
            params.pop("criterion")
+        if "random_state" not in params:
+            params["random_state"] = 12032022


Seems odd to fix the random state here, might be surprising for users of the class. Maybe we can consider setting it for the test?

The random state for other estimators is fixed. For random forest and extra trees it's not. I consider this a hidden bug that prevents reproducibility. I agree it could be surprising for users. I'll add it in documentation.

Okay, makes sense 👍

qingyun-wu · 2022-12-05T17:18:22Z

flaml/automl.py

-        time_left = self._state.time_budget - self._state.time_from_start
+        time_budget_s = (
+            self._state.time_budget - self._state.time_from_start
+            if self._state.time_budget < 1e10


Explain in the documentation somewhere about this large constant?

removed it.

qingyun-wu · 2022-12-05T17:20:34Z

flaml/automl.py

@@ -648,6 +652,7 @@ def custom_metric(
                datasets, but will incur more overhead in time.
                If dict: the dict contains the keywords arguments to be passed to
                [ray.tune.run](https://docs.ray.io/en/latest/tune/api_docs/execution.html).
+            free_mem_ratio: float between 0 and 1, default=0. The free memory ratio to keep during training.


Provide a guideline (could be done in another PR) on when one should consider setting this ratio.

issue #841 created.

liususan091219 · 2022-12-05T17:45:23Z

flaml/automl.py

    ):
-        self.init_eci = learner_class.cost_relative2lgbm()
+        self.init_eci = learner_class.cost_relative2lgbm() if budget < 1e10 else 1


Define 1e10 as a constant variable?

removed it.

Related work items: microsoft#493, microsoft#777, microsoft#820, microsoft#837, microsoft#843, microsoft#848, microsoft#849, microsoft#850, microsoft#853, microsoft#855, microsoft#857, microsoft#869, microsoft#870, microsoft#888, microsoft#894, microsoft#923, microsoft#924, microsoft#925, microsoft#934, microsoft#952, microsoft#962, microsoft#973, microsoft#975, microsoft#995

sonichi added 2 commits December 4, 2022 05:42

make performance test reproducible

b287992

fix test error

21b09b4

sonichi requested review from int-chaos, liususan091219 and qingyun-wu December 4, 2022 16:42

Doc update and disable logging

5b5da0c

markharley reviewed Dec 4, 2022

View reviewed changes

document random_state and version

86dfcc0

markharley approved these changes Dec 4, 2022

View reviewed changes

sonichi requested a review from skzhang1 December 5, 2022 00:37

sonichi linked an issue Dec 5, 2022 that may be closed by this pull request

Performance test fails due to runtime variance #794

Closed

qingyun-wu reviewed Dec 5, 2022

View reviewed changes

liususan091219 requested changes Dec 5, 2022

View reviewed changes

int-chaos approved these changes Dec 5, 2022

View reviewed changes

sonichi mentioned this pull request Dec 5, 2022

Provide a guideline on when one should consider setting free_mem_ratio #841

Closed

sonichi added 2 commits December 5, 2022 23:58

remove hardcoded budget

3bc0909

fix test error and dependency; close #777

ceae514

sonichi linked an issue Dec 6, 2022 that may be closed by this pull request

Handle the case "y_train is a series while X_train is a numpy array" #777

Closed

sonichi requested review from thinkall and shreyas36 December 6, 2022 03:52

qingyun-wu approved these changes Dec 6, 2022

View reviewed changes

iloc

efbc814

liususan091219 approved these changes Dec 6, 2022

View reviewed changes

sonichi merged commit 92b7922 into main Dec 6, 2022

sonichi deleted the repro branch December 6, 2022 18:13

This was referenced Dec 6, 2022

disable tft logger #836

Closed

Change cost_attr when max_iter is used as the stopping criterion instead of time_budget in AutoML.fit #789

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make performance test reproducible #837

make performance test reproducible #837

sonichi commented Dec 4, 2022 •

edited

Loading

markharley Dec 4, 2022

sonichi Dec 4, 2022

markharley Dec 4, 2022

qingyun-wu Dec 5, 2022

sonichi Dec 6, 2022

qingyun-wu Dec 5, 2022

sonichi Dec 5, 2022

liususan091219 Dec 5, 2022

sonichi Dec 6, 2022

make performance test reproducible #837

make performance test reproducible #837

Conversation

sonichi commented Dec 4, 2022 • edited Loading

Why are these changes needed?

Related issue number

Checks

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sonichi commented Dec 4, 2022 •

edited

Loading