-
Notifications
You must be signed in to change notification settings - Fork 93
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[BC] Update
explore_at_state_generator
with new exploration strategy (
#389) Updating the exploration strategy as discussed with @alekh. The strategy works as follows. At the current exploration state, the exploration policy will return an array of logits. The logit of the action currently taken is set to -np.Inf and an exploration action is sampled from the remaining available actions. This update required updating the tests for ModuleExplorerWorker. Additionally, added module_explorer_type to ModuleWorker to enable overriding the exploration strategy by overriding the ModuleExplorer class.
- Loading branch information
Showing
2 changed files
with
40 additions
and
22 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters