We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Traceback (most recent call last): File "test_agent_kerasrl.py", line 59, in dqn_success, _ = evaluate( model_policy(dqn) ) File "test_agent_kerasrl.py", line 23, in evaluate action = action_function( bytez ) File "test_agent_kerasrl.py", line 53, in f action_index = boltzmann_action( q_values ) # alternative: best_action File "test_agent_kerasrl.py", line 42, in boltzmann_action = lambda x : np.argmax( np.random.multinomial( 1, softmax(x).flatten())) File "mtrand.pyx", line 4617, in mtrand.RandomState.multinomial (numpy/random/mtrand/mtrand.c:37769) ValueError: sum(pvals[:-1]) > 1.0
The text was updated successfully, but these errors were encountered:
update boltzmann_action definition
18c18b7
To solve the ValueError: sum(pvals[:-1]) > 1.0 as described in endgameinc#7
No branches or pull requests
Traceback (most recent call last):
File "test_agent_kerasrl.py", line 59, in
dqn_success, _ = evaluate( model_policy(dqn) )
File "test_agent_kerasrl.py", line 23, in evaluate
action = action_function( bytez )
File "test_agent_kerasrl.py", line 53, in f
action_index = boltzmann_action( q_values ) # alternative: best_action
File "test_agent_kerasrl.py", line 42, in
boltzmann_action = lambda x : np.argmax( np.random.multinomial( 1, softmax(x).flatten()))
File "mtrand.pyx", line 4617, in mtrand.RandomState.multinomial (numpy/random/mtrand/mtrand.c:37769)
ValueError: sum(pvals[:-1]) > 1.0
The text was updated successfully, but these errors were encountered: