VDN #22

schroederdewitt · 2020-07-02T08:35:28Z

This PR includes Tarun's VDN implementation, alongside a partial implementation of PPO-S (where the bits left to do have to do with network architecture issues for the centralized critic)

bug fix for state shape

Denys88 · 2020-07-02T17:41:42Z

common/experience.py

+            Max number of transitions to store in the buffer. When the buffer
+            overflows the old memories are dropped.
+        """
+        self._storage = []


I made measurements and found faster way to implement replay buffer with numpy only and preallocationg memory.
You can take a look into the common/experience.py

schroederdewitt and others added 30 commits May 25, 2020 10:44

interfaced with sacred

f899a23

add experiment infrastructure

116c9e2

fixed docker

49a41a5

fixes for docker file

81ae28d

fixed Dockerfile

a8d6b61

torch runner update

680f8f5

fixes

3c6f54b

fixed run.sh

b2aacac

scalar logging problem

e3bbc33

Merge branch 'master' of github.com:schroederdewitt/rl_games

1743460

added some config

ac724fc

added shell scripts

caa084b

Merge branch 'master' of github.com:schroederdewitt/rl_games

130f78f

added 3s_vs_5z configs

641344a

Merge branch 'master' of github.com:schroederdewitt/rl_games

0f004a5

minor

a9d06ae

minor

05310ac

minor

e7fcda2

added MM2_torch.yaml

7061aef

lunch

902f789

Merge branch 'master' of github.com:schroederdewitt/rl_games

b165180

interfaced tf code

7b4f1be

Merge branch 'master' of github.com:schroederdewitt/rl_games

d5b3e37

added tf baselines

e2598d2

added more config

5503326

added additional maps

dc5f049

fix

fe71d50

vdn start

f7a54c6

up

8b9c3db

updates

d7e0d91

schroederdewitt and others added 2 commits July 2, 2020 08:50

Merge pull request #2 from schroederdewitt/vdn_s

554258c

bug fix for state shape

minor logging fix

a3c5a6d

Denys88 reviewed Jul 2, 2020

View reviewed changes

schroederdewitt and others added 11 commits July 15, 2020 18:55

fixed rl_games dockerfile cuda version

7c82843

Merge branch 'master' of github.com:schroederdewitt/rl_games

d48e63c

minor

31c08ab

fixed launch servers

8f8be0c

minor

c19537f

Merge branch 'master' of github.com:schroederdewitt/rl_games

4e0999f

updated experience replay buffer

aa2ae51

iql with normal buffer

19dbf55

bug update

a7ced6d

update

8c7d251

updated config

f97e82a

tarun018 force-pushed the master branch from f21b4de to f97e82a Compare July 25, 2020 13:08

tarun018 and others added 13 commits July 25, 2020 14:24

test dynamic growth

42a954d

no devide placement

a67d631

config update

0e4269a

added stag_hunt (not yet fully working)

79916cf

dockerfile with cpu

e2fdc72

Merge branch 'master' of github.com:schroederdewitt/rl_games

6ede081

removed central state code

a971642

added staghunt (no central state) for ppo

6ceb9e1

staghunt print fix

bfaee18

stag hunt fix

bf11583

Merge branch 'master' of github.com:schroederdewitt/rl_games into master

438a4a5

new config

466ef5e

update max epochs

91f9ea5

tarun018 force-pushed the master branch from d7571e8 to 91f9ea5 Compare December 31, 2020 04:33

new c

444e71e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VDN #22

VDN #22

schroederdewitt commented Jul 2, 2020

Denys88 Jul 2, 2020

VDN #22

Are you sure you want to change the base?

VDN #22

Conversation

schroederdewitt commented Jul 2, 2020

Denys88 Jul 2, 2020

Choose a reason for hiding this comment