Skip to content

Component locks; virtualenv/conda installation support

Compare
Choose a tag to compare
@kengz kengz released this 15 Apr 21:53
· 126 commits to master since this release

Component Locks

PR: #120

We have a lot of components, and not all of them are compatible with another. When scheduling experiments and designing specs it is hard to keep all of them in check. This adds a component locks that does automatic checking of all specs when importing, by using the specified locks in rl/spec/component_locks.json. Uses the minimum description length design principle. When adding new components, be sure to update this file.

  • add double-network component lock
  • add discrete-action component lock; assume continuous agent can handle discrete action spaces as a generalization

Improved Installation

PR: #121
Solves: #113, #114, #115

  • fix broken gym installation. See gym PR 558
  • layout installation steps in doc, use binaries for server setup
  • introduce version lock for dependencies with requirements.txt, environment.yml
  • support installation by system python, virtualenv, conda, integrate into Grunt
  • add quickstart_dqn for example quickstart in doc

Bug Fixes

DoubleDQN

PR: #119

  • restore missing recompile_model call to the second model in DoubleDQN.