Distributed, Multi-Objective Bayesian Optimisation using Ax, BoTorch, RayTune and PyTorch Lightning 🚀

Sidharrth Nagappan
University of Cambridge
[email protected]

This work was done as part of the Large-Scale Data Processing course at the University of Cambridge.

Abstract

This project details, implements Multi-Objective Bayesian Optimisation using Ax and BoTorch, using RayTune as a distributed backbone. Notably, it is the first effort to integrate multi-objective BoTorch directly into Ray Tune via the Ax Service API, by modifying RayTune's AxSearch class to allow custom experiments and multiple objectives. It is also the first work to adapt RayTune's existing Hyperband schedulers -- such as the Asynchronous Successive Halving algorithm -- for multi-objective settings. The empirical results compare critical hyperparameters for these advanced multi-objective schedulers, and show that they can reduce runtime by up to 51% while keeping the final hypervolume within 2% of the first-in-first-out (FIFO) scheduling baselines on the University of Cambridge Department of Computer Science's GPU server across 5 GPUs. Hypervolumes and Pareto fronts of dual and triple-objective optimisation settings are also computationally compared and analysed. The complete source code -- encompassing custom search algorithms, schedulers and training scripts -- is made publicly available.

How to Run

Install the required packages using your favourite environment manager:

pip install -r requirements.txt

Runs are instantiated through the shell script final_runs.sh that in turn calls ax_multiobjective.py.

sh final_runs.sh

Alternatively, you can run a single multi-objective experiment for inversely competing accuracy and model size objectives:

CUDA_VISIBLE_DEVICES=0,1,2,3,6 python3 ax_multiobjective.py --num_samples 25 
--max_num_epochs 8 
--objective_1 ptl/val_accuracy 
--objective_1_type max --objective_1_threshold 0.90 --objective_2 ptl/model_params 
--objective_2_threshold 100000 
--objective_2_type min 
--max_concurrent 10 
--accelerator gpu 
--data_path /home/sn666/large-scale-data-processing/miniproject/data 
--use_scheduler 
--scheduler_max_t 8 
--scheduler_grace_period 2 
--scheduler_reduction_factor 6 
--remark 8e/moasha-epsnet/maxt8gr2red6/maxaccminparam --use_scaling_config 
--results_folder final_results_7jan | tee -a "$LOG_FILE"

Acknowledgements

The ideas for this work were spawned from the following GitHub issues that began talking about multi-objective optimisation in RayTune:

Multi-Objective Asynchronous Successive Halving was inspired by the following paper: https://arxiv.org/pdf/2106.12639

References

For any questions, please reach out to me at sn666[at]cam.ac.uk

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
__pycache__		__pycache__
data/MNIST/raw		data/MNIST/raw
diagrams		diagrams
lib		lib
logs		logs
misc		misc
notebook_test_runs		notebook_test_runs
notebooks		notebooks
old_logs_results		old_logs_results
results		results
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
ax_multiobjective.py		ax_multiobjective.py
data.lock		data.lock
final_runs.sh		final_runs.sh
requirements.txt		requirements.txt
results_analysis.ipynb		results_analysis.ipynb
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed, Multi-Objective Bayesian Optimisation using Ax, BoTorch, RayTune and PyTorch Lightning 🚀

Abstract

How to Run

Acknowledgements

References

About

Releases

Packages

Languages

sidharrth2002/mobo

Folders and files

Latest commit

History

Repository files navigation

Distributed, Multi-Objective Bayesian Optimisation using Ax, BoTorch, RayTune and PyTorch Lightning 🚀

Abstract

How to Run

Acknowledgements

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages