Project 1: Navigation

Introduction

For this project, you will train an agent to navigate (and collect bananas!) in a large, square world.

Environment Description

States

The state space has 37 dimensions.
Contains the agent's velocity.
As well as ray-based perception of objects around agent's forward direction.

Action

0 - move forward.
1 - move backward.
2 - turn left.
3 - turn right.

Rewards:

+1: collecting a yellow banana.
-1: for collecting a blue banana.

Goal:

To collect as many yellow bananas as possible while avoiding blue bananas.
It is considered solved if the agent get an average score of +13 over 100 consecutive episodes.

Project Structure

The repository contains the following files:

network.py Contains simple deep neural network.
dueling_network.py Contains a network implements Dueling Network from the paper
dqn_agent.py Contains Q-Network agent.
ddqn_agent.py Contains double Q-Network agent.
ddqn_prioritized_agent.py Contains double Q-Network agent with prioritized experience replay.
prioritized_replay_buffer.py Contains prioritized experience replay buffer implementation.
sum_tree.py Contains a more efficient priority-based sampling structure, the implementation of which references the one from Jaromir's blog post.
Navigation.ipynb Contains the agent training code for Unity Banana environment.
Report.md Contains the description of the implementation details.

Getting Started

Install Anaconda(https://conda.io/docs/user-guide/install/index.html)
Install dependencies by issue:

pip install -r requirements.txt

Download the environment from one of the links below. You need only select the environment that matches your operating system:
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
Place the file in the root folder, and unzip (or decompress) the file.

Instructions

Follow the steps in Navigation.ipynb to get started with training.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
assets		assets
.gitignore		.gitignore
Navigation.ipynb		Navigation.ipynb
README.md		README.md
Report.ipynb		Report.ipynb
ddqn.pth		ddqn.pth
ddqn_agent.py		ddqn_agent.py
ddqn_prioritized_agent.py		ddqn_prioritized_agent.py
dqn.pth		dqn.pth
dqn_agent.py		dqn_agent.py
dueling_network.pth		dueling_network.pth
dueling_network.py		dueling_network.py
dueling_network_per.pth		dueling_network_per.pth
network.py		network.py
prioritized_replay_buffer.py		prioritized_replay_buffer.py
requirement.txt		requirement.txt
sum_tree.py		sum_tree.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project 1: Navigation

Introduction

Environment Description

States

Action

Rewards:

Project Structure

Getting Started

Instructions

About

Releases

Packages

Contributors 2

Languages

weicheng113/drlnd_p1_navigation

Folders and files

Latest commit

History

Repository files navigation

Project 1: Navigation

Introduction

Environment Description

States

Action

Rewards:

Project Structure

Getting Started

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages