unstable gradient #98

ih4cku · 2016-09-16T07:53:50Z

vanishing and exploding gradient / sensitivity

(must see) X. Glorot and Y. Bengio. Understanding the difficulty of trainingdeep feedforward neural networks. InAISTATS, 2010.
(must see) Pascanu, Razvan, Tomas Mikolov, and Yoshua Bengio. "On the difficulty of training recurrent neural networks." ICML (3) 28 (2013): 1310-1318.
Why are deep neural networks hard to train?

ih4cku · 2016-09-16T08:04:19Z

ih4cku · 2016-09-16T08:40:58Z

following works:

ih4cku added the research/deep_learning label Sep 16, 2016