Skip to content

svmgrg/fma-pg

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Code for the paper A general class of surrogate functions for stable and efficient reinforcement learning (https://arxiv.org/abs/2108.05828) published at AISTATS 2022.

Runs exact (non-stochastic) tabular methods from the FMA-PG framework. This code requires Numpy, Scipy, and Matplotlib to run.

The "current_code" folder contains the main code for running the tabular experiments given in the paper.

  • independent code for the CliffWorld and DeepSeaTreasure environments
  • code for tabular sMDPO (called sPPO in the code for now), MDPO, TRPO, and sPPO
  • code for running ablation experiments (regularized + fixed stepsize, regularized + line search, constrained + line search)
  • code for generating sensitivity plots/learning curvse and running scripts

The "notes" folder essentially contains the rough draft of appendix E and F of the main paper.

About

Code for testing FMA-PG framework.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published