Add amsgrad optimizer #382

merajhashemi · 2022-07-25T01:59:13Z

Hi,
This pr implements Amsgrad—an extension to Adam that improves its convergence properties.

merajhashemi · 2022-08-08T16:31:36Z

Hi @mkunesch, any chance you could review this? Thanks.

mtthss · 2022-08-23T08:56:17Z

@merajhashemi thanks for the PR!
It fails the CI tests (see above) could address the failures?

merajhashemi · 2022-08-23T17:12:04Z

@mtthss Done! Could you run the CI again?

mkunesch · 2022-09-19T12:41:20Z

optax/_src/alias.py

@@ -189,6 +189,7 @@ def adam(
    b2: float = 0.999,
    eps: float = 1e-8,
    eps_root: float = 0.0,
+    amsgrad: bool = False,


Let's introduce a separate alias amsgrad for this rather than adding a flag to Adam. My thinking is that there will be other improvements/modifications to Adam in the future and we should avoid an accumulation of options in the simple adam setup. Furthermore, I think it would make it easier for user to find amsgrad in optax and more obvious in the code that amsgrad is being used.

optax/_src/transform.py

mkunesch

(sorry, I pressed send to early)

Thank you so much for this contribution! This is excellent!

I've just added very minor comments.

Could you add a test for this PR? The easiest thing might be to just add amsgrad to the optimizer list in alias_test.py so that it gets tested on a parabola. It might also be nice to test the nu_max behavior explicitly in transform_test.py but the parabola test is the more important one.

Also, just to say that we will only be able to merge this after the ICLR deadline, but we can approve it before then so that we can merge immediately on the 29th of September.

Thanks a lot again for this excellent contribution!

hbq1 · 2022-10-18T16:28:50Z

Hi @merajhashemi, thanks a lot for your contribution! Any change you could address the comments made by @mkunesch, so we could merge this PR in the next version of Optax? Thanks!

mkunesch · 2022-10-19T09:56:24Z

Actually, the changes are quite minimal (introducing a separate alias + adding it to the test). I'd be happy to approve and make these changes upon merging if that's okay with you @merajhashemi?

merajhashemi · 2022-10-20T00:28:34Z

Hi @hbq1 @mkunesch

Thanks for the comments.
The reason I kept amsgrad in Adam was to keep it similar to other libraries; both pytorch and tensorflow have amsgrad as an argument inside their Adam optimizer. Still, I could split it out and add the tests on the weekend. How does that sound?

mkunesch · 2022-10-24T16:30:04Z

Hi! Ah, thanks a lot for that context - that makes a lot of sense.

I'm probably still leaning towards splitting as we have generally tried to avoid boolean flags in optax for a while now and don't mirror pytorch and tensorflow in other optimizers too.

@hbq1 : was your 👍 a vote for leaving it as an argument or for splitting it?

Thanks a lot!

hbq1 · 2022-10-24T16:40:15Z

I like the idea of splitting it into a separate optimiser for clarity 👍

mkunesch

Looks great to me! Thanks a lot for splitting the optimizer from adam.

(I'll make some very minor formatting edits and fix the conflicts with master as I merge the PR)

Thanks a lot for the contribution again!

mkunesch · 2022-11-01T16:41:58Z

(sorry for the 3 commits to your branch - I had to merge master before importing the PR and I messed up the merge in the github editor)

Add amsgrad optimizer

d5d7260

merajhashemi force-pushed the mh-amsgrad branch from 655720b to d5d7260 Compare August 23, 2022 17:08

mkunesch self-requested a review September 11, 2022 13:33

mkunesch self-assigned this Sep 12, 2022

mkunesch requested changes Sep 19, 2022

View reviewed changes

Add amsgrad to tests

5675f67

Split Amsgrad from Adam

b6341b3

mkunesch approved these changes Oct 31, 2022

View reviewed changes

mkunesch added 3 commits November 1, 2022 16:10

Merge branch 'master' into mh-amsgrad

48c62d7

Add missing comma after amsgrad in __init__.py.

1fc405c

Fix error introduced while merging master.

510c323

copybara-service bot merged commit 47fe655 into google-deepmind:master Nov 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add amsgrad optimizer #382

Add amsgrad optimizer #382

merajhashemi commented Jul 25, 2022

merajhashemi commented Aug 8, 2022

mtthss commented Aug 23, 2022

merajhashemi commented Aug 23, 2022

mkunesch Sep 19, 2022

mkunesch left a comment •

edited

Loading

hbq1 commented Oct 18, 2022

mkunesch commented Oct 19, 2022 •

edited

Loading

merajhashemi commented Oct 20, 2022

mkunesch commented Oct 24, 2022 •

edited

Loading

hbq1 commented Oct 24, 2022

mkunesch left a comment

mkunesch commented Nov 1, 2022

Add amsgrad optimizer #382

Add amsgrad optimizer #382

Conversation

merajhashemi commented Jul 25, 2022

merajhashemi commented Aug 8, 2022

mtthss commented Aug 23, 2022

merajhashemi commented Aug 23, 2022

mkunesch Sep 19, 2022

Choose a reason for hiding this comment

mkunesch left a comment • edited Loading

Choose a reason for hiding this comment

hbq1 commented Oct 18, 2022

mkunesch commented Oct 19, 2022 • edited Loading

merajhashemi commented Oct 20, 2022

mkunesch commented Oct 24, 2022 • edited Loading

hbq1 commented Oct 24, 2022

mkunesch left a comment

Choose a reason for hiding this comment

mkunesch commented Nov 1, 2022

mkunesch left a comment •

edited

Loading

mkunesch commented Oct 19, 2022 •

edited

Loading

mkunesch commented Oct 24, 2022 •

edited

Loading