Skip to content

camigord/Distributed_DDPG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distributed-DDPG

Overview

The purpose of this repository is to implement the Deep Deterministic Policy Gradient algorithm or DDPG in a distributed fashion as proposed here.

I will start by evaluating the performance of DDPG in simple cases and then comparing this performance when distributing the training process among several "workers".

MountainCarContinuous-v0 (OpenAI)

I evaluated the performance of the standard DDPG approach on the MountainCarContinuous task. The figure below shows the training curves until the problem is considered solved.

The provided results were obtained by running a single worker. To replicate the results run the following commands in two different consoles:

# Parameter server
python ddpg.py --job_name="ps" --task_index=0
# First worker
python ddpg.py --job_name="worker" --task_index=0

To visualize the training process using TensorBoard:

# TensorBoard
tensorboard --logdir=results/tboard_ddpg/

About

Parallel implementation of DDPG

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages