This repository contains the source code for the paper Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning.
examples
is for Section 3.ql
is for Sections 4.1, 4.2.1, 4.2.2, and 4.2.3.inf_hor
is for Section 4.2.4.ql_noise
is for Section 4.2.5.dyna_vs_ql
is for Appendix A.5.- python scripts are to reproduce all plots and tables in Section 4 and Appendix A.
For export_fig
you need this.