Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

eps for Adam #29

Open
jsuit opened this issue Aug 29, 2018 · 2 comments
Open

eps for Adam #29

jsuit opened this issue Aug 29, 2018 · 2 comments

Comments

@jsuit
Copy link

jsuit commented Aug 29, 2018

Is there a reason why the default for eps in the adam optimizer is so high? Currently, it is 1e-3 [line 104 in shared_optim.py]. Usually, it's around 1e-08. Just wanted to see if this was done intentionally (e.g., it works better than when it is lower) or not.

@dgriff777
Copy link
Owner

The epsilon value 1e-3 is actually often my default choice for adam optimizer and I find it helps with with stability. Although 1e-08 is often listed as default for adam its not a strongly suggested best choice and its commonly known to not be the best choice in many cases and in my experience has never been best choice in my various use cases.

@ppwwyyxx
Copy link

ppwwyyxx commented Nov 2, 2018

Also, in https://www.tensorflow.org/api_docs/python/tf/train/AdamOptimizer:

The default value of 1e-8 for epsilon might not be a good default in general. For example, when training an Inception network on ImageNet a current good choice is 1.0 or 0.1.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants