implement generalized policy iteration in `ValueFunction` #40

sritchie · 2019-10-08T14:08:53Z

Current in ValueFunction we have value iteration going... but we don't have a way to decide what to do at the end of each sweep, within a sweep, and across sweeps.

One idea would be to code specific optimizations. Another would be to code a set of functions that would show what happens at each level.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement generalized policy iteration in `ValueFunction` #40

implement generalized policy iteration in `ValueFunction` #40

sritchie commented Oct 8, 2019

implement generalized policy iteration in ValueFunction #40

implement generalized policy iteration in ValueFunction #40

Comments

sritchie commented Oct 8, 2019

implement generalized policy iteration in `ValueFunction` #40

implement generalized policy iteration in `ValueFunction` #40