Skip to content

Latest commit

 

History

History
13 lines (9 loc) · 684 Bytes

README.md

File metadata and controls

13 lines (9 loc) · 684 Bytes

This repository is about solving the two reinforcement learning challenges cartPole and Pendulum of the openai gym.

The goal was to find solutions quickly and with as simple as possible learning algorithms.

Algorithms

Deep Q-learning

Naive Q-learing

Random actor (to compare with)

Results

The controller for the more difficult of the two problems, the pendulum, achieves a fast and stable upward swing. The trained network is available in q.json with weights in q.h5

pendulum upswing