Actor-Critic

Solution for Lunar Lander environment v2 of Open AI gym. The algorithm used is actor-critic (vanilla policy gradient with baseline),

-> Dependencies:

    OpenAI gym

    PyTorch 0.4.1

    PIL

-> Hyperparameters can be changed by editing them in respective files

-> To train : run train.py

-> Converges within 1500 episodes

-> To test a pretrained model : run test.py

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
gif		gif
preTrained		preTrained
LICENSE		LICENSE
README.md		README.md
model.py		model.py
test.py		test.py
train.py		train.py

Provide feedback