diff --git a/BENCHMARK.md b/BENCHMARK.md index 544057a7c..08f8f62cf 100644 --- a/BENCHMARK.md +++ b/BENCHMARK.md @@ -66,8 +66,8 @@ SLM Lab's benchmark includes environments from the following offerings: | Env. \ Alg. | DQN | DDQN+PER | A2C (GAE) | A2C (n-step) | PPO | SAC | | Breakout
graph
| 80.88 | 182 | 377 | 398 | **443** | 3.51* | | Pong
graph
| 18.48 | 20.5 | 19.31 | 19.56 | **20.58** | 19.87* | -| Seaquest
graph
| 1185 | **4405** | 1070 | 1684 | 1715 | 171* | -| Qbert
graph
| 5494 | 11426 | 12405 | **13590** | 13460 | 923* | +| Qbert
graph
| 5494 | 11426 | 12405 | **13590** | 13460 | 923* | +| Seaquest
graph
| 1185 | **4405** | 1070 | 1684 | 1715 | 171* | | LunarLander
graph
| 192 | 233 | 25.21 | 68.23 | 214 | **276** | | UnityHallway
graph
| -0.32 | 0.27 | 0.08 | -0.96 | **0.73** | 0.01 | | UnityPushBlock
graph
| 4.88 | 4.93 | 4.68 | 4.93 | **4.97** | -0.70 | diff --git a/README.md b/README.md index a1f1c64d7..515ed307c 100644 --- a/README.md +++ b/README.md @@ -56,8 +56,8 @@ Due to their standardized design, all the algorithms can be parallelized asynchr | Env. \ Alg. | DQN | DDQN+PER | A2C (GAE) | A2C (n-step) | PPO | SAC | | Breakout
graph
| 80.88 | 182 | 377 | 398 | **443** | 3.51* | | Pong
graph
| 18.48 | 20.5 | 19.31 | 19.56 | **20.58** | 19.87* | -| Seaquest
graph
| 1185 | **4405** | 1070 | 1684 | 1715 | 171* | -| Qbert
graph
| 5494 | 11426 | 12405 | **13590** | 13460 | 923* | +| Qbert
graph
| 5494 | 11426 | 12405 | **13590** | 13460 | 923* | +| Seaquest
graph
| 1185 | **4405** | 1070 | 1684 | 1715 | 171* | | LunarLander
graph
| 192 | 233 | 25.21 | 68.23 | 214 | **276** | | UnityHallway
graph
| -0.32 | 0.27 | 0.08 | -0.96 | **0.73** | 0.01 | | UnityPushBlock
graph
| 4.88 | 4.93 | 4.68 | 4.93 | **4.97** | -0.70 |