You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The bug that some examples use the same random seed across envs for env.seed is fixed.
The bug that batch training with n-step return and/or recurrent models is not successful is fixed.
The bug that examples/ale/train_dqn_ale.py uses LinearDecayEpsilonGreedy even when NoisyNet is used is fixed.
The bug that examples/ale/train_dqn_ale.py does not use the value specified by --noisy-net-sigma is fixed.
The bug that chainerrl.links.to_factorized_noisy does not work correctly with chainerrl.links.Sequence is fixed.
Important destructive changes
chainerrl.experiments.train_agent_async now requires eval_n_steps (number of timesteps for each evaluation phase) and eval_n_episodes (number of episodes for each evaluation phase) to be explicitly specified, with one of them being None.
examples/ale/dqn_phi.py is removed.
chainerrl.initializers.LeCunNormal is removed. Use chainer.initializers.LeCunNormal instead.