Hyperparameter Optimization with Ray Tune and Weights and Biases #76

ayulockin · 2022-10-28T14:17:50Z

Hyperparameter optimization for TRLX:

Stack:

Ray Tune for distributed hyperparameter optimization.
Weights and Biases for tracking and for finding the best hyperparameters.

Context:

The hyperparameter optimization system is designed based on the structure of the examples/ provided. For now I have built the system using examples/ppo_sentiments.py.

How to use it?

Setup search space

In Ray Tune's literature we need to define a param_space which is passed to tune.TuneConfig(...). To define a param_space:

Copying the default config provided in configs/*.yml to configs/ray_tune_configs/my_search_config.yml.
Use strategy and values to define the space as shown below:
```
seq_length:  # Size of LM context
    strategy: "choice"
    values: [36, 48, 52]
```
The allowed strategies are based on what's available by Ray Tune as per the docs here. The strategy and values are parsed using get_param_space function.

Setup tune config

The tune config describes the search algorithm, scheduler, metric and mode. This is defined in the configs/ray_tune_configs/my_search_config.yml itself.

Search Algo: Bayesian Optimization with Hyberband (BOHB) is tested and working. Use bohb to enable it. For random search use random.
Scheduler: BOHB requires [HyperBandForBOHB](https://docs.ray.io/en/latest/tune/api_docs/schedulers.html#tune-scheduler-bohb); pass hyperbandforbohb to enable it. By default it uses fifo.

The training function

The training function takes a config argument. It's the ppo_sentiments.py example packaged as a function. Multiple training functions can be implemented using the same format.

The `train_sweep.py` file

The main logic is implemented here.

Usage

python train_sweep.py --config configs/ray_tune_configs/ppo_config.yml --example-name ppo_sentiments

Notable changes

~~The W&B was initialized inside the accelerate_base_model.py; I have moved it to trlx.py.~~
W&B logging is disabled when ray.is_initialized. Ray Tune initializes multiple concurrent trials (experiments), and W&B was erroring out for a few trials and then a new run was initialized with metics logged to the errored out run. It was not an issue with random or grid search but with bayesian search only.

How is W&B tracking done?

Once all the trails are done, the metrics are logged using the log_trials function. Since we are not tracking live we will not have access to system metrics. But since we are looking to find the best hyperparameters and keep track of the experiments, I logged the metrics after the experiments are done. This also makes it flexible for any search algorithm to be used especially those that are hard to parallelize like Bayes opt.

Result

The experiments are logged to the wandb project_name; example here. We can easily convert it to W&B sweep using the UI.

Screen.Recording.2022-11-02.at.6.15.52.PM.mov

The most relevant hyperparameters

We can use the parameter importance chart.

We can easily find the best parameters from the parallel importance plot.

Screen.Recording.2022-11-02.at.6.20.20.PM.mov

TODOs:

The example is running using a single GPU per trial. We can easily provision multiple GPUs for each trial but I am working to distribute the training. The recommended solution is to use Ray Train as suggested here.
Automatically build W&B report with the analysis of the hyperparameter optimization job.

maxreciprocate

Overall this looks great, thank you for the effort!

I have a few more general questions about these changes.

The W&B was initialized inside the accelerate_base_model.py; I have moved it to trlx.py.

Can you explain why this had to be done?

W&B logging is disabled when ray.is_initialized. Ray Tune initializes multiple concurrent trials (experiments), and W&B was erroring out for a few trials and then a new run was initialized with metics logged to the errored out run. It was not an issue with random or grid search but with bayesian search only.

It seems to me hyperband as is, is a bit crippled, or maybe any other trial scheduler as well, since it keeps "pausing" runs, which effectively terminates them and then later restarts those runs from scratch. Have you looked into some way of customizing this process with resuming from checkpoints? Also can something similar may be done for wandb state?

Can you also help me with how to select parameter space for sweeps in W&B, since it doesn't pick up the correct ones. I follow your steps from the videos, but I still have to manually change sweep configuration. Had you done something special between your first video and the second one?

trlx/ray_tune/__init__.py

trlx/ray_tune/wandb.py

trlx/trlx.py

trlx/model/accelerate_base_model.py

trlx/ray_tune/wandb.py

train_sweep.py

configs/ray_tune_configs/ppo_config.yml

ayulockin · 2022-11-03T14:26:35Z

Thank you for the review.

Can you also help me with how to select parameter space for sweeps in W&B, since it doesn't pick up the correct ones. I follow your steps from the videos, but I still have to manually change sweep configuration. Had you done something special between your first video and the second one?

Hey @reciprocated, to answer this quickly, I am adding a feature that will generate a W&B report with relevant charts so that you or any user will not have to worry about doing it manually. I am working on it.

ayulockin · 2022-11-10T18:14:39Z

Hey @reciprocated, to answer this quickly, I am adding a feature that will generate a W&B report with relevant charts so that you or any user will not have to worry about doing it manually. I am working on it.

Hey @reciprocated, my latest commit adds the ability to create a W&B report automatically. I will share the result once the training is done.

ayulockin added 12 commits October 26, 2022 13:22

req + sweep config + sweep file + config done

7447bed

sample config + tune_config parsing

dd642d7

wip trainable fn

053698f

ray tune integrated

cf19b71

ray tune working

0341c94

add scheduler + search algo

4f880cf

fix config

a8cbcd4

update ilql model

e48a40d

condition on ray init

968deb8

condition on ray init

4d19f5b

remove torch to float convertion

b0de179

refactor

cf23408

LouisCastricato requested review from Dahoas and maxreciprocate and removed request for Dahoas October 28, 2022 15:42

ayulockin added 10 commits October 28, 2022 16:59

refactor utils + documentation

d6cf517

refactor + generalized

bd2d2c6

bayesian opt with hyperband

311141b

remove comment

363fb0a

Merge branch 'master' into hyperopt

55aab3d

decouple wandb to fix multiprocessing issues with bayes opt

5157c3c

log to wandb

19b052a

integrate wandb

cb39537

trained fully

7ea8671

remove empty file

89d2e60

ayulockin marked this pull request as ready for review November 2, 2022 12:55

maxreciprocate reviewed Nov 3, 2022

View reviewed changes

jon-tow mentioned this pull request Nov 7, 2022

Update TrainConfig optimizer hyperparameters #82

Merged

ayulockin added 4 commits November 9, 2022 18:27

remove run

d863986

revert print

a48f349

default=1

c40edbe

add programatic report

0ff95fb

ayulockin added 4 commits November 14, 2022 03:27

revert unnecessary changes

62a8063

support random search + fifo scheduler

72dbeec

Merge remote-tracking branch 'origin/master' into hyperopt

486c14e

adapt new config

dcaaeaf

LouisCastricato approved these changes Nov 14, 2022

View reviewed changes

LouisCastricato merged commit 59b3c65 into CarperAI:main Nov 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperparameter Optimization with Ray Tune and Weights and Biases #76

Hyperparameter Optimization with Ray Tune and Weights and Biases #76

ayulockin commented Oct 28, 2022 •

edited

Loading

maxreciprocate left a comment

ayulockin commented Nov 3, 2022

ayulockin commented Nov 10, 2022

Hyperparameter Optimization with Ray Tune and Weights and Biases #76

Hyperparameter Optimization with Ray Tune and Weights and Biases #76

Conversation

ayulockin commented Oct 28, 2022 • edited Loading

Hyperparameter optimization for TRLX:

Stack:

Context:

How to use it?

Setup search space

Setup tune config

The training function

The train_sweep.py file

Usage

Notable changes

How is W&B tracking done?

Result

The most relevant hyperparameters

TODOs:

maxreciprocate left a comment

Choose a reason for hiding this comment

ayulockin commented Nov 3, 2022

ayulockin commented Nov 10, 2022

ayulockin commented Oct 28, 2022 •

edited

Loading

The `train_sweep.py` file