abcdRL (Implement a RL algorithm in four simple steps)

abcdRL is a Modular Single-file Reinforcement Learning Algorithms Library that provides modular design without strict and clean single-file implementation.

Understand the full implementation details of the algorithm in a single file quickly when reading the code; Benefit from a lightweight modular design, only need to focus on a small number of modules when modifying the algorithm.

abcdRL mainly references the single-file design philosophy of vwxyzjn/cleanrl and the module design of PaddlePaddle/PARL.

Documentation ➡️ docs.abcdrl.xyz

Roadmap🗺️ #57

🚀 Quickstart

Open the project in Gitpod🌐 and start coding immediately.

Using Docker📦:

# 0. Prerequisites: Docker & Nvidia Drive & NVIDIA Container Toolkit
# 1. Run DQN algorithm
docker run --rm --gpus all sdpkjc/abcdrl python abcdrl/dqn_torch.py

For detailed installation instructions 👀

🐼 Features

👨‍👩‍👧‍👦 Unified code structure
📄 Single-file implementation
🐷 Low code reuse
📐 Minimizing code differences
📈 Tensorboard & Wandb integration
🛤 PEP8(code style) & PEP526(type hint) compliant

🗽 Design Philosophy

"Copy📋", ~~not "Inheritance🧬"~~
"Single-file📜", ~~not "Multi-file📚"~~
"Features reuse🛠", ~~not "Algorithms reuse🖨"~~
"Unified logic🤖", ~~not "Unified interface🔌"~~

✅ Implemented Algorithms

Weights & Biases Benchmark Report ➡️ report.abcdrl.xyz

Deep Q Network (DQN) _{dqn_torch.py, dqn_tf.py, dqn_atari_torch.py, dqn_atari_tf.py}
Deep Deterministic Policy Gradient (DDPG) _{ddpg_torch.py}
Twin Delayed Deep Deterministic Policy Gradient (TD3) _{td3_torch.py}
Soft Actor-Critic (SAC) _{sac_torch.py}
Proximal Policy Optimization (PPO) _{ppo_torch.py}

Double Deep Q Network (DDQN) _{ddqn_torch.py, ddqn_tf.py}
Prioritized Deep Q Network (PDQN) _{pdqn_torch.py, pdqn_tf.py}

Citing abcdRL

@misc{zhao_abcdrl_2022,
    author = {Yanxiao, Zhao},
    month = {12},
    title = {{abcdRL: Modular Single-file Reinforcement Learning Algorithms Library}},
    url = {https://github.com/sdpkjc/abcdrl},
    year = {2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.github		.github
abcdrl		abcdrl
docs		docs
requirements		requirements
tests		tests
.gitignore		.gitignore
.gitpod.Dockerfile		.gitpod.Dockerfile
.gitpod.yml		.gitpod.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
CITATION.bib		CITATION.bib
Dockerfile		Dockerfile
LICENSE		LICENSE
README.cn.md		README.cn.md
README.md		README.md
benchmark.py		benchmark.py
benchmark.toml		benchmark.toml
cu113.Dockerfile		cu113.Dockerfile
docker.test.yml		docker.test.yml
example_eval_model.py		example_eval_model.py
example_tuner_sweep.py		example_tuner_sweep.py
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

abcdRL (Implement a RL algorithm in four simple steps)

🚀 Quickstart

🐼 Features

🗽 Design Philosophy

✅ Implemented Algorithms

Citing abcdRL

About

Releases 4

Contributors 2

Languages

License

sdpkjc/abcdrl

Folders and files

Latest commit

History

Repository files navigation

abcdRL (Implement a RL algorithm in four simple steps)

🚀 Quickstart

🐼 Features

🗽 Design Philosophy

✅ Implemented Algorithms

Citing abcdRL

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Contributors 2

Languages