Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Atari benchmark 7/28/2019 #396

Merged
merged 11 commits into from
Jul 31, 2019
Merged

Atari benchmark 7/28/2019 #396

merged 11 commits into from
Jul 31, 2019

Conversation

kengz
Copy link
Owner

@kengz kengz commented Jul 29, 2019

Atari benchmark

This is a benchmark ran using the current master branch. All the results are shown below and the data folders including the metrics and models are uploaded to the SLM Lab public Dropbox with file prefix PR396-.

To Reproduce

  1. JSON spec: See the spec/benchmark folder
  2. git SHA (contained in the file above): 8360612
Env. \ Alg. A2C (GAE) A2C (n-step) PPO DQN DDQN+PER
Breakout
graph
389.99
graph
391.32
graph
425.89
graph
65.04
graph
181.72
graph
Pong
graph
20.04
graph
19.66
graph
20.09
graph
18.34
graph
20.44
graph
Qbert
graph
13,328.32
graph
13,259.19
graph
13,691.89
graph
4,787.79
graph
11,673.52
graph
Seaquest
graph
892.68
graph
1,686.08
graph
1,583.04
graph
1,118.50
graph
3,751.34
graph

Terminology

  • A2C (GAE): Advantage Actor-Critic with GAE as advantage estimation
  • A2C (n-step): Advantage Actor-Critic with n-step return as advantage estimation
  • DDQN+PER: Double Deep Q-Learning with Prioritized Experience Replay
  • DQN: Deep Q-learning
  • PPO: Proximal Policy Optimization
ppo beamrider ppo breakout ppo kungfumaster ppo mspacman
BeamRider Breakout KungFuMaster MsPacman
ppo pong ppo qbert ppo seaquest ppo spaceinvaders
Pong Qbert Seaquest Sp.Invaders

@kengz kengz added the result experiment result upload label Jul 29, 2019
BENCHMARK.md Show resolved Hide resolved
@kengz kengz changed the title A2C and PPO Atari small benchmark basic Atari benchmark 7/28/2019 Jul 29, 2019
@kengz kengz changed the title basic Atari benchmark 7/28/2019 Atari benchmark 7/28/2019 Jul 29, 2019
@kengz kengz merged commit 5da1c98 into master Jul 31, 2019
@kengz kengz deleted the bench branch July 31, 2019 17:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
result experiment result upload
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant