GitHub - Maggern3/SAC: Multi-Discrete Soft Actor Critic implementation on Unity's procedurally generated Obstacle Tower Environment.

Soft Actor Critic on UnityML's Obstacle Tower Environment

Project environment

A procedurally generated environment to challenge the state of the art algorithms in planning and generalization. In other words, the environment changes every time the agent sees it. Requiring an agent to truly generalize it's policies in order to succeed. Soft Actor critic has been shown to generalize well in highly complex tasks. The solution uses computer vision to go from pixels as inputs to actions as outputs.

The possible actions has 4 dimensions;

Movement (No-Op/Forward/Back)
Camera Rotation (No-Op/Counter-Clockwise/Clockwise)
Jump (No-Op/Jump)
Movement (No-Op/Right/Left)

The environment takes these in as a Numpy array with 4 elements where a number specifies the index of the action. The agent can perform one action in each of the 4 action dimensions at the same time.

Installation

First clone this repo. Then run

cd obstacle_tower_env
pip install -e .

this installs the required dependencies(tested on python 3.6.8 64-bit).

You should now be able to build and run the project.

Training the agent

Run the following command to train the agent

python runner.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
obstacle_tower_env		obstacle_tower_env
README.md		README.md
buffer.py		buffer.py
neuralnetwork.py		neuralnetwork.py
runner.py		runner.py
sac.py		sac.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Soft Actor Critic on UnityML's Obstacle Tower Environment

Project environment

Installation

Training the agent

About

Languages

Maggern3/SAC

Folders and files

Latest commit

History

Repository files navigation

Soft Actor Critic on UnityML's Obstacle Tower Environment

Project environment

Installation

Training the agent

About

Topics

Resources

Stars

Watchers

Forks

Languages