Intrinsic-Rewards

A collection of deep reinforcement learning algorithms with intrinsic rewards, based on Rainy and PyTorch.

Setup

First, install pipenv. E.g. you can install it via

pip install pipenv --user

Then you can create a virtual environment for isolated installing of related packages.

pipenv --site-packages --three install

Run

RND

With 32 parallel workers:

pipenv run experiments/rnd_atari.py --override='config.nworkers=32' train

With 64 parallel workers:

pipenv run experiments/rnd_atari.py train

With 128 parallel workers(needs horovod):

horovodrun -np 2 -H localhost:1,$other_host_name:1 pipenv run python experiments/rnd_atari.py train

Implemented Algorithms

Random Network Distillation

https://arxiv.org/abs/1810.12894
command: pipenv run python experiments/rnd_atari.py

Results

Commit hash: aa4ebf0c3e9090d11fbd88a5de44aa2189f1d232

RND
- 128 parallel enviroments, No MPI + CNN policy(NO LSTM)
- All parameters are the same as the paper
PPO
- with the same setting
- All parameters are in ppo_atari.py

Score

Intrinsic rewards

License

This project is licensed under Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0).

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.github/workflows		.github/workflows
experiments		experiments
int_rew		int_rew
pictures		pictures
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intrinsic-Rewards

Setup

Run

RND

Implemented Algorithms

Random Network Distillation

Results

Score

Intrinsic rewards

License

About

Releases

Packages

Languages

License

kngwyu/intrinsic-rewards

Folders and files

Latest commit

History

Repository files navigation

Intrinsic-Rewards

Setup

Run

RND

Implemented Algorithms

Random Network Distillation

Results

Score

Intrinsic rewards

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages