Skip to content
This repository was archived by the owner on Jan 27, 2023. It is now read-only.

Project on multi agent reinforcement learning applied on patrolling agents

License

Notifications You must be signed in to change notification settings

bdvllrs/marl-patrolling-agents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

f2924d8 · Dec 10, 2019
Dec 10, 2019
Jan 27, 2019
Jan 27, 2019
Jan 26, 2019
Jan 27, 2019
Jan 17, 2019
Jan 27, 2019
Dec 10, 2019
Mar 12, 2019
Jan 27, 2019
Jan 27, 2019
Jan 27, 2019
Feb 2, 2019
Jan 15, 2019

Repository files navigation

MARL for patrolling agents

We provide here an environment for a predator/prey game. We explore two methods: a simple DQN architecture as well as a true Multi-Agent algorithm architecture using a Policy Gradient approach: Multi-Agent Deep Deterministic Policy Gradient (Lowe, R., Wu, Y., Tamar, A., Harb, J., Abbeel, O. P., & Mordatch, I. (2017). Multi-agent actor-critic for mixed cooperative-competitive environments. In Advances in Neural Information Processing Systems (pp. 6379-6390)).

Some results

After 1400 episodes of training.

DDQN 2vs2 MADDPG 2vs2 DDQN 2v1 Magic Switch

Environment

Blue dots represent preys and orange dots are predators.

Action space

The action space is discrete. Every agent can do one of none, left, right, top, bottom.

State space

The state is perfectly known by all the agents.

The state is the 3D coordinates (x, y, z) for every agent.

About

Project on multi agent reinforcement learning applied on patrolling agents

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages