Skip to content

Updating FusedAdamW epsilon to Habana default 1e-6#22

Merged
regisss merged 1 commit into
mainfrom
fix_default_fusedadamW_params
Apr 11, 2022
Merged

Updating FusedAdamW epsilon to Habana default 1e-6#22
regisss merged 1 commit into
mainfrom
fix_default_fusedadamW_params

Conversation

@MohitIntel
Copy link
Copy Markdown
Contributor

What does this PR do?

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

Copy link
Copy Markdown
Collaborator

@regisss regisss left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I propose a small change so that only the default value is modified.

Also, not related to optimum-habana, would it be possible to update the documentation of Gaudi so that it is explicitly mentioned that the default value of epsilon is not the same one as for torch.optim.AdamW? Because here the doc says to refer to torch.optim.AdamW, which is misleading since there the default value of espilon is 1e-8.

Comment thread optimum/habana/training_args.py Outdated
@MohitIntel MohitIntel force-pushed the fix_default_fusedadamW_params branch from 9f13b92 to 81c6b3a Compare April 11, 2022 19:55
@regisss regisss merged commit c6b7e73 into main Apr 11, 2022
@libinta
Copy link
Copy Markdown
Collaborator

libinta commented Apr 11, 2022

@regisss we will update our doc regarding the default value. Thanks

@MohitIntel MohitIntel deleted the fix_default_fusedadamW_params branch April 19, 2022 19:43
regisss pushed a commit that referenced this pull request Nov 26, 2024
Co-authored-by: Urszula Golowicz <urszula.golowicz@intel.com>
MohitIntel pushed a commit that referenced this pull request Nov 26, 2024
Co-authored-by: Urszula Golowicz <urszula.golowicz@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants