You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for your interest! We didn't update our recent version to our master branch.
You should go to dLSTM branch dlstm_a2c folder to check the new one.
However, our code still doesn't work.
We're still on the progress, so please keep interests into our github repo.
Thank you for your interest! We didn't update our recent version to our master branch.
You should go to dLSTM branch dlstm_a2c folder to check the new one.
However, our code still doesn't work.
We're still on the progress, so please keep interests into our github repo.
what do you mean by 'doesn't work', you mean the score doesn't increase after many episodes or the learning curve doesn't converge(ie, policy loss, value loss)? I've designed my own universal HRL algorithm similar to FuN, yet it didn't even converge.
There is no difference between the two folders. ?
The text was updated successfully, but these errors were encountered: