Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 373 Bytes

README.md

File metadata and controls

12 lines (9 loc) · 373 Bytes

AI Gym Mountain car solutions

Comparing DP (Dynamic Programming) and QLearning implementations. 
As expected QLearning solution wins (under 500 episodes needed to reach the goal consistently). 

Different parameters were explored to explore and tune up the model :

  • Number of nodes in all layers
  • Learning rate parameters
  • Advanced activation functions