Action Value Gradient Algorithm

This repo provides an implementation of the following incremental learning algorithms:

Action Value Gradient (AVG)
Incremental One-Step Actor-Critic (IAC)
Incremental Soft Actor Critic (SAC-1)

python avg.py --env "Humanoid-v4" --N 10001000

Robot Tasks

UR-Reacher-2	Create-Mover

Hyper-parameter search

AVG

cd incremental_rl
python hyp_sweep.py --algo "avg" --hyp_seed 122 --env "Hopper-v4" --N 10001000 --n_seeds 10
python replicate_run.py --algo "avg_norm_obs_scaled_td" --hyp_seed 129 --env "Ant-v4" --N 10001000

Incremental Actor Critic

cd incremental_rl
python hyp_sweep.py --algo "iac" --hyp_seed 122 --env "Hopper-v4" --N 10001000 --n_seeds 10
python replicate_run.py --algo "iac_all" --hyp_seed 294 --env "Hopper-v4" --N 10001000

Incremental Soft Actor Critic

cd incremental_rl
python hyp_sweep.py --algo "isac" --hyp_seed 146 --env "HalfCheetah-v4" --N 10001000

Cite

@inproceedings{vasan2024deep,
  title={Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers},
  author={Vasan, Gautham and Elsayed, Mohamed and Azimi, Seyed Alireza and He, Jiamin and Shahriar, Fahim and Bellinger, Colin and White, Martha and Mahmood, A Rupam},
  booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
incremental_rl		incremental_rl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
avg.py		avg.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Action Value Gradient Algorithm

Robot Tasks

Hyper-parameter search

Cite

About

Releases

Packages

Languages

License

gauthamvasan/avg

Folders and files

Latest commit

History

Repository files navigation

Action Value Gradient Algorithm

Robot Tasks

Hyper-parameter search

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages