Allow environment to set next action #57

mryellow · 2016-09-02T23:26:16Z

No description provided.

mryellow · 2016-09-02T23:30:13Z

This might also help with Hierarchical DQN #9, ignoring the complexities with storing skill state in experiences.

The environment could be executing a skill network and feeding back to the core that it is repeating the skill action (or the individual actions being taken as part of the skill if desired).

mryellow · 2016-09-02T23:55:07Z

p.s. This isn't implemented for validation agents. For my use-case it makes sense to have them not using any hard-coded behaviours but giving metrics on what has actually been learnt.

For Hierarchical DQN or any kind of skill execution type setup you'd want the validation agents to act in the same way, with environment having some control in what is being scored. Perhaps that is best done in the environment, with validation agent being detected somehow and told to work differently if so desired.

mryellow · 2016-09-03T01:26:11Z

https://github.com/mryellow/dqn_assets/blob/85a90375f349b37399a6f2ecf2d47ac25f697f66/rlenvs/Kulbabu.lua#L185-L195

An implementation, if agent hasn't moved "greatly" within "reasonable" time, repeat either a left or right turn for a number of seconds.

edit: Updated it to only run when training. p.s. Noticed training evaluate aren't documented in rlenvs readme.

Kaixhin · 2016-09-03T09:26:16Z

I realise that this is more efficient than overwriting an action the agent has already decided upon, but worry that it's too restrictive. This forcing is conditional on the state, but the previous approach was conditional on both the state and the action.

For a real example that I am working with, take a crane game. The agent should be allowed to move as it wants, but we would like to stop it from executing the expensive action of dropping the claw if it's far from its target.

Allow environment to set next action

aed3a02

mryellow mentioned this pull request Sep 2, 2016

Allow overwritting action by environment #56

Open

Add nextAction to custom docs

546131b

mryellow mentioned this pull request Sep 2, 2016

Document nextAction optional return signature Kaixhin/rlenvs#7

Closed

mryellow closed this Sep 3, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow environment to set next action #57

Allow environment to set next action #57

mryellow commented Sep 2, 2016

mryellow commented Sep 2, 2016 •

edited

Loading

mryellow commented Sep 2, 2016

mryellow commented Sep 3, 2016 •

edited

Loading

Kaixhin commented Sep 3, 2016

Allow environment to set next action #57

Allow environment to set next action #57

Conversation

mryellow commented Sep 2, 2016

mryellow commented Sep 2, 2016 • edited Loading

mryellow commented Sep 2, 2016

mryellow commented Sep 3, 2016 • edited Loading

Kaixhin commented Sep 3, 2016

mryellow commented Sep 2, 2016 •

edited

Loading

mryellow commented Sep 3, 2016 •

edited

Loading