Sciweavers

NN
2000
Springer

Local minima and plateaus in hierarchical structures of multilayer perceptrons

13 years 11 months ago
Local minima and plateaus in hierarchical structures of multilayer perceptrons
Local minima and plateaus pose a serious problem in learning of neural networks. We investigate the hierarchical geometric structure of the parameter space of three-layer perceptrons in order to show the existence of local minima and plateaus. It is proved that a critical point of the model with H ;1 hidden units always gives many critical points of the model with H hidden units. These critical points consist of many lines in the parameter space, which can cause plateaus in learning of neural networks. Based on this result, we prove that a point in the critical lines corresponding to the global minimum of the smaller model can be a local minimum or a saddle point of the larger model. We give a necessary and su cient condition for this, and show that this kind of local minima exist as a line segment if any. The results are universal in the sense that they do not require special properties of the target, loss functions, and activation functions, but only use the hierarchical structure o...
Kenji Fukumizu, Shun-ichi Amari
Added 19 Dec 2010
Updated 19 Dec 2010
Type Journal
Year 2000
Where NN
Authors Kenji Fukumizu, Shun-ichi Amari
Comments (0)