Skip to content

unstable gradient #98

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
ih4cku opened this issue Sep 16, 2016 · 2 comments
Open

unstable gradient #98

ih4cku opened this issue Sep 16, 2016 · 2 comments

Comments

@ih4cku
Copy link
Owner

ih4cku commented Sep 16, 2016

vanishing and exploding gradient / sensitivity

  • (must see) X. Glorot and Y. Bengio. Understanding the difficulty of trainingdeep feedforward neural networks. InAISTATS, 2010.
  • (must see) Pascanu, Razvan, Tomas Mikolov, and Yoshua Bengio. "On the difficulty of training recurrent neural networks." ICML (3) 28 (2013): 1310-1318.
  • Why are deep neural networks hard to train?
@ih4cku
Copy link
Owner Author

ih4cku commented Sep 16, 2016

weights initialization

@ih4cku
Copy link
Owner Author

ih4cku commented Sep 16, 2016

Highway and Residual network

following works:

  • FractalNet
  • Resnet in Resnet
  • Deep Networks with Stochastic Depth

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant