Rainbow DQN

This is a concise Pytorch implementation of Rainbow DQN, including Double Q-learning, Dueling network, Noisy network, PER and n-steps Q-learning.

Dependencies

python==3.7.9
numpy==1.19.4
pytorch==1.5.0
tensorboard==0.6.0
gym==0.21.0

How to use my code?

You can dircetly run Rainbow_DQN_main.py in your own IDE.

Trainning environments

You can set the 'env_index' in the code to change the environments.
env_index=0 represent 'CartPole-v1'
env_index=1 represent 'LunarLander-v2'

How to see the training results?

You can use the tensorboard to visualize the training curves, which are saved in the file 'runs'.
The rewards data are saved as numpy in the file 'data_train'.
The training curves are shown below.
The right picture is smoothed by averaging over a window of 10 steps. The solid line and the shadow respectively represent the average and standard deviation over three different random seeds. (seed=0, 10, 100)

Reference

[1] Mnih V, Kavukcuoglu K, Silver D, et al. Human-level control through deep reinforcement learning[J]. nature, 2015, 518(7540): 529-533.
[2] Van Hasselt H, Guez A, Silver D. Deep reinforcement learning with double q-learning[C]//Proceedings of the AAAI conference on artificial intelligence. 2016, 30(1).
[3] Wang Z, Schaul T, Hessel M, et al. Dueling network architectures for deep reinforcement learning[C]//International conference on machine learning. PMLR, 2016: 1995-2003.
[4] Fortunato M, Azar M G, Piot B, et al. Noisy networks for exploration[J]. arXiv preprint arXiv:1706.10295, 2017.
[5] Schaul T, Quan J, Antonoglou I, et al. Prioritized experience replay[J]. arXiv preprint arXiv:1511.05952, 2015.
[6] Hessel M, Modayil J, Van Hasselt H, et al. Rainbow: Combining improvements in deep reinforcement learning[C]//Thirty-second AAAI conference on artificial intelligence. 2018.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data_train		data_train
runs/DQN		runs/DQN
LICENSE		LICENSE
README.md		README.md
Rainbow_DQN_main.py		Rainbow_DQN_main.py
network.py		network.py
rainbow_dqn.py		rainbow_dqn.py
rainbow_dqn_result.png		rainbow_dqn_result.png
replay_buffer.py		replay_buffer.py
sum_tree.py		sum_tree.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rainbow DQN

Dependencies

How to use my code?

Trainning environments

How to see the training results?

Reference

About

Releases

Packages

Languages

License

Lizhi-sjtu/Rainbow-DQN-pytorch

Folders and files

Latest commit

History

Repository files navigation

Rainbow DQN

Dependencies

How to use my code?

Trainning environments

How to see the training results?

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages