v2.0.0 Singleton mode #153

kengz · 2018-09-01T02:17:31Z

v2.0.0: make components independent of the framework so it can be used outside of SLM-Lab for development and production, and improve usability. Backward-incompatible with v1.x.

Singleton Mode as Default

singleton case (single-agent-env-body) is now the default. Any implementations need only to worry about singleton.
space case (multi-agent-env-body) is now an extension from singleton case. Simply add space_{method} to handle the space logic.
make components more independent from framework
major logic simplification to improve usability. Simplify the AEB and init sequences. remove post_body_init()

Distributed and Cuda

add distributed cases to unit tests
make distributed usable for both singleton (single agent) and space (multiagent) cases.
add attribute Net.cuda_id for device assignment (per network basis), and auto-calculate the cuda_id by trial and session index to distribute jobs
enable cuda and add GPU support for all algorithms, except for distributed (A3C, DPPO etc.)

Refactoring and Improvements

save() and load() now include network optimizers
refactor set_manual_seed to util
rename StackReplay to ConcatReplay for clarity
improve network training check of weights and grad norms
introduce BaseEnv as base class to OpenAIEnv and UnityEnv
optimize computations, major refactoring
update Dockerfile and release

kengz added 30 commits August 14, 2018 20:52

save and load optimizer too

4b63edf

retire useless net_util methods

9c026dc

implement distributed training applicable to all algorithms

aba294d

make lr decay log debug

ed4c461

Merge remote-tracking branch 'origin/master' into distributed

6aa0819

Merge remote-tracking branch 'origin/master' into distributed

7f6c73a

add "distributed" key to meta spec

c6e916c

fix ac global get net typo

2b72690

add distributed tests without creating extra spec files

fa1a4b8

update README

50eab9d

add parallelism to ci config

d139b9b

remove parallelism

dc69975

parametrize CI parallel test

fd94f56

mute dist tests for CI

6a2dd27

remove dupe

4416e74

Merge remote-tracking branch 'origin/master' into singleton

e1a8d52

make lab_api a plain decorator since there's no need to log

de77cd6

create simple singleton OpenAIEnv

fe5705d

refactor set_manual_seed

58ffa92

use safer net iter for algorithms

ed23cc9

retire util.is_episodic

1cb0312

create singleton session, agent, body for the mean time

81e1a46

define random seed at info_space level

aec1e80

add agent.save method

3daa820

turn PG methods into singleton

2161647

update spec construction of aeb size

c1e4b1d

use tuple in aeb_space.add for efficiency

9232e60

add singleton data to aeb_space

f155b8b

init aeb_space properly

21e26db

fix data epi count from 1, time count offset

08f65e7

kengz added 2 commits September 2, 2018 22:42

enable cuda; properly set device in math_util

c20a812

properly place device in tensor init; remove cpu()

0c34d22

kengz force-pushed the singleton branch from 848becc to 0c34d22 Compare September 3, 2018 05:58

kengz added 24 commits September 2, 2018 23:09

safer retain graph in PPO

cbeeb75

make sil rets calc proper

1d3a8b7

fix key

32462ee

restore sil, make calc_returns work for numpy too

d74769a

revert retain graph

028092f

fix PPO ratio detach

688d4bb

fix SIL log prob detach

d6c64cd

mute distributed test for now

35f93bd

remove redundant set device in actor_critic

8e8f443

remove all redundant to(device)

0012ccc

fix hydra logprob

357acf2

fix typo

d3e5af5

set proper device in policy util

a769dd3

use algo.net to set

e0f19e4

set action_a device

24a3fa4

proper device for argmax dist

c918f0b

mute hydra eps test now

a46a8db

restpre. to fix DQN

29945b0

set device in rand action uniform dist

d19a1d3

fix policy rand dist

d82b65d

remove like device

25c3733

remove final redundant device spec

35855b6

refactor get_git_sha, add close_fds to subprocess

49ed85d

increase CI test timeout

464d86d

kengz merged commit 861657d into master Sep 3, 2018

kengz deleted the singleton branch September 3, 2018 16:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2.0.0 Singleton mode #153

v2.0.0 Singleton mode #153

kengz commented Sep 1, 2018 •

edited

Loading

v2.0.0 Singleton mode #153

v2.0.0 Singleton mode #153

Conversation

kengz commented Sep 1, 2018 • edited Loading

Singleton Mode as Default

Distributed and Cuda

Refactoring and Improvements

kengz commented Sep 1, 2018 •

edited

Loading