Hidden Layer Training via Hessian Matrix Information

15 years 9 months ago

Download www.aaai.org

The output weight optimization-hidden weight optimization (OWO-HWO) algorithm for training the multilayer perceptron alternately updates the output weights and the hidden weights. This layer-by-layer training strategy greatly improves convergence speed. However, in HWO, the desired net function actually evolves in the gradient direction, which inevitably reduces efficiency. In this paper, two improvements to the OWO-HWO algorithm are presented. New desired net functions are proposed for hidden layer training, which use Hessian matrix information rather than gradients. A weighted hidden layer error function, taking saturation into consideration, is derived directly from the global error function. Both techniques greatly increase training speed. Faster convergence is verified by simulations with remote sensing data sets.

Changhua Yu, Michael T. Manry, Jiang Li

Real-time Traffic

Artificial Intelligence | FLAIRS 2004 | Hidden Layer | Net Functions | Output Weights |

claim paper

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	FLAIRS
Authors	Changhua Yu, Michael T. Manry, Jiang Li

Sciweavers

Hidden Layer Training via Hessian Matrix Information

Artificial Intelligence | FLAIRS 2004 | Hidden Layer | Net Functions | Output Weights |

Explore & Download

Productivity Tools

Sciweavers