LunarLander-v2-drlnd

The solution for the LunarLander-v2 gym environment. The code is based on materials from Udacity Deep Reinforcement Learning Nanodegree Program.

Project Details

The interaction with the environment is based on the following four discrete actions:

do nothing
fire left orientation engine
fire main engine
fire right orientation engine.

The environment returns the state vector, where the first two comprises coordinates. The episode finishes if the lander crashes or comes to rest. LunarLander-v2 defines "solving" as getting an average reward of 200 over 100 consecutive trials. (https://github.com/openai/gym/wiki/Leaderboard) The environment is solved by using Dueling Double DQN algorithm, where actions selection based on epsilon-greedy policy.

Getting Started

In order to train the model or inference the computed weights, the following need to be installed:

pytorch
pybox2d
gym

Instructions

Since the repository provides the jupyter notebook, follow the steps of execution.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Deep_Q_Network-Dueling-DDQN.ipynb		Deep_Q_Network-Dueling-DDQN.ipynb
LunarLander.gif		LunarLander.gif
README.md		README.md
checkpoint_Dueling_DDQN.pth		checkpoint_Dueling_DDQN.pth
dqn_agent.py		dqn_agent.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LunarLander-v2-drlnd

Project Details

Getting Started

Instructions

About

Releases

Packages

Languages

RMiftakhov/LunarLander-v2-drlnd

Folders and files

Latest commit

History

Repository files navigation

LunarLander-v2-drlnd

Project Details

Getting Started

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages