Evaluation of Pooling Operations in Convolutional Architectures for Object Recognition

14 years 4 months ago

Download www.ais.uni-bonn.de

Abstract. A common practice to gain invariant features in object recognition models is to aggregate multiple low-level features over a small neighborhood. However, the differences between those models makes a comparison of the properties of different aggregation functions hard. Our aim is to gain insight into different functions by directly comparing them on a fixed architecture for several common object recognition tasks. Empirical results show that a maximum pooling operation significantly outperforms subsampling operations. Despite their shift-invariant properties, overlapping pooling windows are no significant improvement over non-overlapping pooling windows. By applying this knowledge, we achieve state-of-the-art error rates of 4.57% on the NORB normalized-uniform dataset and 5.6% on the NORB jittered-cluttered dataset.

Dominik Scherer, Andreas Müller, Sven Behnke

Real-time Traffic

ICANN 2010 | Neural Networks | Object Recognition | Object Recognition Models | Pooling Windows |

claim paper

Post Info
More Details (n/a)

Added	09 Nov 2010
Updated	09 Nov 2010
Type	Conference
Year	2010
Where	ICANN
Authors	Dominik Scherer, Andreas Müller, Sven Behnke

Comments (0)

Sciweavers

Evaluation of Pooling Operations in Convolutional Architectures for Object Recognition

ICANN 2010 | Neural Networks | Object Recognition | Object Recognition Models | Pooling Windows |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers