DDPG Humanoid

We conducted experiments of the performance of DDPG algorithm on the OpenAI Humanoidv2 environment based on what we learned from Lillicrap et al. (2016) and Plappert et al. (2018). Lillicrap et al. (2016) is about the DDPG algorithm, with noise, sampled with the OrnsteinUhlenbeck process, in the action space for exploration, while Plappert et al. (2018) is about possible improvements with noise in the parameter space.

With the experiments we conducted, we saw that the DDPG algorithm without noise is by far the best performer in learning policies for the OpenAI gym Humanoid-v2 environment.

Demo

Experiment Report

https://github.com/mokeam/DDPG-Humanoid/blob/master/garba_makinwa_ddpg_humanoid_report.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
graphs		graphs
README.md		README.md
ddpg_agent.py		ddpg_agent.py
demo.gif		demo.gif
garba_makinwa_ddpg_humanoid_report.pdf		garba_makinwa_ddpg_humanoid_report.pdf
model.py		model.py
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DDPG Humanoid

Demo

Experiment Report

About

Releases

Packages

Contributors 2

Languages

mokeam/DDPG-Humanoid

Folders and files

Latest commit

History

Repository files navigation

DDPG Humanoid

Demo

Experiment Report

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages