Skip to content

Udacity's Deep Reinforcement ND Project 1: Navigation

Notifications You must be signed in to change notification settings

lesaun/dqn-banana-hunting

Repository files navigation

Udacity's Deep Reinforcement ND Project 1: Navigation

Program Details

Trains an agent to collect yellow bananas and avoid purple bananas.

Trained Agent

State and action space details

The environment provides the state as a 37 dimension vector containing the agent's velocity and a ray-based perception of objects around the agent's forward direction. The reward provided by the environment is +1 for collecting a yellow banana and -1 for a purple banana.

The agent returns an integer in [0, 3] representing the following directions:

  • 0 - move forward.
  • 1 - move backward.
  • 2 - turn left.
  • 3 - turn right.

The environment is considered solved when the average culmative reward over 100 consecutive episodes is above 13. The current agent solves the environment after around 550 episodes.

Implementation details

Uses Double DQN with 3 layer FC network. See Report.md for more details.

Getting Started

  1. Download the environment for your operating system below.

  2. Extract the contents into banana_app/

  3. Install anaconda

  4. Install pytorch and unityagents

Instructions

Run either main.py or use navigation.ipynb to run environment on existing model or retrain.z

About

Udacity's Deep Reinforcement ND Project 1: Navigation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published