高级机器学习-作业5

在CS294-112 HW3基础上实现了Double DQN算法

使用

以下是原仓库README

Dependencies:

Before doing anything, first replace gym/envs/box2d/lunar_lander.py with the provided lunar_lander.py file.

The only files that you need to look at are dqn.py and train_ac_f18.py, which you will implement.

See the HW3 PDF for further instructions.

The starter code was based on an implementation of Q-learning for Atari generously provided by Szymon Sidor from OpenAI.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
others		others
.gitignore		.gitignore
README.md		README.md
atari_wrappers.py		atari_wrappers.py
dqn.py		dqn.py
dqn_utils.py		dqn_utils.py
plot.py		plot.py
pong_double_plot.pdf		pong_double_plot.pdf
pong_hyper_parameters.pdf		pong_hyper_parameters.pdf
requirements.txt		requirements.txt
run_dqn_atari.py		run_dqn_atari.py
run_dqn_ram.py		run_dqn_ram.py