Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fixed findmax unconsistency #521

Conversation

3rdCore
Copy link
Contributor

@3rdCore 3rdCore commented Oct 7, 2021

Following the discussion of the Issue #520, this is my proposition of solution for the bug on the Base.findmax function due to the recent changes on Julia Core package and JuliaLang/Compat.jl#748. (The find_all_max is not to be impacted by the recent changes)

Originally, depending if the explorer with return the greedy choice or a random choice, the output was respectively :

  • for the greedy choice : the index of the selected value in the subset of the authorized values, instead of the original set of values.
  • for the random choice : the index of the selected value in the original set of values.

Copy link
Member

@findmyway findmyway left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

Added a suggestion for a more readable version.

src/ReinforcementLearningCore/src/utils/base.jl Outdated Show resolved Hide resolved
@findmyway findmyway linked an issue Oct 7, 2021 that may be closed by this pull request
@findmyway findmyway merged commit c6c5053 into JuliaReinforcementLearning:master Oct 8, 2021
@findmyway
Copy link
Member

@all-contributors please add @3rdCore for bug and code.

@allcontributors
Copy link
Contributor

@findmyway

I've put up a pull request to add @3rdCore! 🎉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Inconsistency of the EpsilonGreedyExplorer selection function
2 participants