Numerical Comparison of Bandit Algorithms for Best-arm Identification

Project for the Reinforcement Learning course by A. Lazaric at MVA

Description of the project

The objective of the project is to provide a thorough comparison among different best-arm identification algorithms in the settings of fixed budget and fixed confidence. Beside reviewing the current literature, the student is expected to produce a Matlab code which allows to easily implement and compare additional algorithms. An example of a code which could serve as a basis for this project is available at http://mloss.org/software/view/415.

About

The code is written in Matlab. Its structure is based on the structure of the maBandits toolbox available at http://mloss.org/software/view/415.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
arms		arms
policies		policies
.gitattributes		.gitattributes
.gitignore		.gitignore
BestArmIdentification.zip		BestArmIdentification.zip
README.md		README.md
comparison_budget.m		comparison_budget.m
comparison_confidence.m		comparison_confidence.m
experiment.m		experiment.m
plotResults.m		plotResults.m
setup.m		setup.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Numerical Comparison of Bandit Algorithms for Best-arm Identification

Description of the project

About

About

Releases

Packages

Languages

Evarin/BestArmIdentification

Folders and files

Latest commit

History

Repository files navigation

Numerical Comparison of Bandit Algorithms for Best-arm Identification

Description of the project

About

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages