-
Notifications
You must be signed in to change notification settings - Fork 225
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Impl adam_w #2957
Impl adam_w #2957
Conversation
restarted the CI, @et16kr Please resolve the merge conflict. |
driver/adam_driver.hpp
Outdated
template <typename Tgpu, typename Tref, bool adamw, bool is_amp, typename Tgrad> | ||
int AdamDriver<Tgpu, Tref, adamw, is_amp, Tgrad>::ParseCmdLineArgs(int argc, char* argv[]) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I missed a previous PR, but since a variable has been added here, I would ask is it necessary having all those variables as a template parameters?
At least is_amp
and adamw
should be runtime parameters to avoid code bloating and long compilation time.
adam float16
adam float32
amp adam (parameter float32, gradient float16)