This repository is about solving the two reinforcement learning challenges cartPole and Pendulum of the openai gym.

The goal was to find solutions quickly and with as simple as possible learning algorithms.

Algorithms

Deep Q-learning

Naive Q-learing

Random actor (to compare with)

Results

The controller for the more difficult of the two problems, the pendulum, achieves a fast and stable upward swing. The trained network is available in q.json with weights in q.h5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Algorithms

Deep Q-learning

Naive Q-learing

Random actor (to compare with)

Results

Files

README.md

Latest commit

History

README.md

File metadata and controls

Algorithms

Deep Q-learning

Naive Q-learing

Random actor (to compare with)

Results