Deep Reinforcement Learning Nanodegree: Project 3 - Collaboration and Competition

This project includes the code for a simplified version of the deep genetic algorithm introduced in the paper "Deep Neuroevolution: Genetic Algorithms are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning" by Uber AI Labs. I wrote it to solve the Project 3 - Collaboration and Competition of the Deep Reinforcement Learning Nanodegree @ Udacity.

For more information on the implemented features refer to "Report.ipynb". The notebook includes a summary of all essential concepts used in the code.

Project 3 - Collaboration and Competition - Details:

In this project, two agents should be trained to control rackets to bounce a ball over a net. To maximize the reward, the agents need to learn how to hit the ball over the net and also how to avoid to let the ball hit the ground or fly out of bounds.

Random Agent

Trained Agent

Reward:

a reward of +0.1 is provided if an agent hits the ball over the net
a reward of -0.01 is provided if an agent lets a ball hit the ground or if an agent hits the ball out of bounds

Search Space

the observation space has 48 dimensions
- Two agents times 24 inputs
the action space has four dimensions
- Two agents times 2 actions
- every action is a continuous number between -1 and 1

Task

the task is episodic
two agents control rackets to bounce a ball over a net
agents need to learn how to hit the ball over the net
agents need to learn how to avoid letting the ball hit the ground or fly out of bounds
to solve the environment, the agent must get an average score of +0.5 over 100 consecutive episodes

Getting Started

Create (and activate) a new environment with Python 3.6.

conda create --name env_name python=3.6
source activate env_name

Download the environment from one of the links below and place it into \p3_collab-compet\

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here
your folder should now look something like this:

p3_collab-compet\Tennis_Linux\
     \Tennis_Data
     \Tennis.x86
     \Tennis.x86_64

Install Sourcecode dependencies

conda install -c pytorch pytorch
conda install -c anaconda numpy
pip install tensorboardX

unityagents is also required
- an easy way to get this is to install the Deep Reinforcement Learning Nanodegree with its dependencies

git clone https://github.com/udacity/deep-reinforcement-learning.git
cd deep-reinforcement-learning/python
pip install .

How to run the project

You can run the project by running the main.py file through the console.

open the console and run: python main.py -c "your_config_file.json"
to train the agent from scratch set "run_training" in the config file to true
to run the pre-trained agent set "run_training" in the config file to false

optional arguments:

-h, --help

- show help message

-c , --config

- Config file name - file must be available as .json in ./configs

Example: python main.py -c "tennis_linux.json"

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
checkpoints/Tennis_Linux		checkpoints/Tennis_Linux
configs		configs
images		images
monitor/Tennis_Linux/2018_10_29__21_37_16		monitor/Tennis_Linux/2018_10_29__21_37_16
README.md		README.md
Report.ipynb		Report.ipynb
fitness.py		fitness.py
main.py		main.py
mutation.py		mutation.py
network.py		network.py
normalizer.py		normalizer.py
population.py		population.py
session.py		session.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Reinforcement Learning Nanodegree: Project 3 - Collaboration and Competition

Project 3 - Collaboration and Competition - Details:

Random Agent

Trained Agent

Reward:

Search Space

Task

Getting Started

How to run the project

About

Releases

Packages

Languages

cpow-89/Deep_Reinforcement_Learning_Nanodegree_Project_3_Collaboration_and_Competition

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning Nanodegree: Project 3 - Collaboration and Competition

Project 3 - Collaboration and Competition - Details:

Random Agent

Trained Agent

Reward:

Search Space

Task

Getting Started

How to run the project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages