CommNet-BiCnet

CommNet and BiCnet implementation in tensorflow

Training

Train CommNet using DDPG algorithm

python train_comm_net.py

Hypersearch

To find the optimal hyperparameters such as actor_lr or critic_lr, a simple grid search has been implemented. It launches multiple instances of the trainer in parallel based on the number of CPU cores.

python hypersearch.py

Guessing sum environment

It is a simple game described in the BiCnet paper for testing if the communication works. The environment implements the crucial methods of the core gym interface from OpenAI

Each agent receives a scalar sampled between [−10, 10] under a truncated Gaussian. Each agent needs to output the sum of all inputs received among the agents. An agent gets a normalized reward between [0, 1] based on the absolute difference between the sum and its output.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
docs		docs
summaries		summaries
.gitignore		.gitignore
README.md		README.md
bicnet.py		bicnet.py
comm_net.py		comm_net.py
guessing_sum_env.py		guessing_sum_env.py
hypersearch.py		hypersearch.py
replay_buffer.py		replay_buffer.py
train_bicnet.py		train_bicnet.py
train_comm_net.py		train_comm_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CommNet-BiCnet

Training

Hypersearch

Guessing sum environment

Results

Training CommNet in the Guessing sum env with 2 agents

About

Releases

Packages

Languages

dameng123/CommNet-BiCnet

Folders and files

Latest commit

History

Repository files navigation

CommNet-BiCnet

Training

Hypersearch

Guessing sum environment

Results

Training CommNet in the Guessing sum env with 2 agents

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages