Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft: DroQ and TD3+TQC jax implementation #272

Draft
wants to merge 31 commits into
base: master
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from 21 commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
33df709
Clone file
araffin Sep 6, 2022
6ed7655
Fixes and reformating
araffin Sep 6, 2022
6a289ca
Add dropout and layernorm
araffin Sep 6, 2022
d3ef56b
Add evaluation and tqdm progress bar
araffin Sep 16, 2022
9704f1d
Different dropout keys
araffin Sep 16, 2022
f0cc8ff
Separate q network target update
araffin Sep 16, 2022
23e4d3b
Try to jit the for loop
araffin Sep 16, 2022
df61ae5
Add no jit train version
araffin Sep 16, 2022
f7b4e7c
Revert "Add no jit train version"
araffin Sep 16, 2022
373aabb
Revert "Try to jit the for loop"
araffin Sep 16, 2022
85fa143
Revert "Separate q network target update"
araffin Sep 16, 2022
60f63e1
TQC + TD3 + DroQ first attempt
araffin Sep 16, 2022
44f3a9b
Add number of quantiles to drop as param
araffin Sep 16, 2022
5156d78
Fixes and reformat
araffin Sep 16, 2022
8aaca4f
n_units as param
araffin Sep 16, 2022
aabf789
Add train method
araffin Sep 17, 2022
cc74d9e
JIT train loop
araffin Sep 17, 2022
8058979
Debug jit
araffin Sep 18, 2022
99686c8
Cleanup + faster eval
araffin Sep 18, 2022
d5704b3
Try ADAN
araffin Sep 18, 2022
047c314
Revert "Try ADAN"
araffin Sep 18, 2022
443dc71
Revert "Revert "Try ADAN""
araffin Sep 18, 2022
8f3beec
Sort important and Try ADAN again
araffin Sep 18, 2022
940a4b6
Back to ADAM
araffin Sep 18, 2022
bcfee18
Rename file
araffin Sep 19, 2022
d68b262
Add fast eval for TD3 + DroQo
araffin Sep 19, 2022
70aa57d
Add buggy sac implementation
araffin Sep 23, 2022
21361c3
Bug fixes and faster sampling (still not working)
araffin Sep 23, 2022
c883386
Bug fixes, SAC now workingo
araffin Sep 23, 2022
f455b4e
Cleanup
araffin Sep 23, 2022
7eb2c4f
Match DroQ implementation
araffin Sep 24, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading