Skip to content

cranberrii/lunarlander-dqn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

lunarlander-dqn

Solving openai gym's lunar lander with deep Q network

Using a dqn as a non-linear function approximation of the Q values.

References:

  • Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller. Playing Atari with Deep Reinforcement Learning. DeepMind Technologies, 2013.
  • Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg & Demis Hassabis. Human-level control through deep reinforcement learning. Nature, 2015.
  • Peter Henderson, Riashat Islam, Philip Bachman, Joelle Pineau, Doina Precup, David Meger. Deep Reinforcement Learning that Matters. McGill University, Microsoft Maluuba, Montreal, Canada 2019

About

Solving gym's lunar lander with deep Q network

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages