-
Notifications
You must be signed in to change notification settings - Fork 155
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
176 changed files
with
4,709 additions
and
1,956 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
# Deepmind Control (https://github.com/deepmind/dm_control) | ||
|
||
* I could not find any ppo deepmind_control benchmark. It is a first version only. Will be updated later. | ||
|
||
## How to run: | ||
* **Humanoid (Stand, Walk or Run)** | ||
``` | ||
poetry install -E envpool | ||
poetry run pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html | ||
poetry run python runner.py --train --file rl_games/configs/dm_control/humanoid_walk.yaml | ||
``` | ||
|
||
## Results: | ||
|
||
* No tuning. I just run it on a couple of envs. | ||
* I used 4000 epochs which is ~32M steps for almost all envs except HumanoidRun. But a few millions of steps was enough for the most of the envs. | ||
* Deepmind used a pretty strange reward and training rules. A simple reward transformation: log(reward + 1) achieves best scores faster. | ||
|
||
| Env | Rewards | | ||
| ------------- | ------------- | | ||
| Ball In Cup Catch | 938 | | ||
| Cartpole Balance | 988 | | ||
| Cheetah Run | 685 | | ||
| Fish Swim | 600 | | ||
| Hopper Stand | 557 | | ||
| Humanoid Stand | 653 | | ||
| Humanoid Walk | 621 | | ||
| Humanoid Run | 200 | | ||
| Pendulum Swingup | 706 | | ||
| Walker Stand | 907 | | ||
| Walker Walk | 917 | | ||
| Walker Run | 702 | |
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.