You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Im just training DQN and notice that most of training loops will use the range() function to iterate, and save in every 5 episode via a judgement if episode % 5 == 0: .
I guess you want to save the model after each 5 loops, but you know, range() will start at 0. So I suppose the judgement here should be if episode % 5 == 4: or if (episode+1) % 5 == 0:. :-)
The text was updated successfully, but these errors were encountered:
Hi,
Im just training DQN and notice that most of training loops will use the
range()
function to iterate, and save in every 5 episode via a judgementif episode % 5 == 0:
.I guess you want to save the model after each 5 loops, but you know,
range()
will start at 0. So I suppose the judgement here should beif episode % 5 == 4:
orif (episode+1) % 5 == 0:
. :-)The text was updated successfully, but these errors were encountered: