A Theoretical Analysis of Feature Pooling in Visual Recognition

14 years 8 months ago

Download yann.lecun.com

Many modern visual recognition algorithms incorporate a step of spatial `pooling', where the outputs of several nearby feature detectors are combined into a local or global `bag of features', in a way that preserves task-related information while removing irrelevant details. Pooling is used to achieve invariance to image transformations, more compact representations, and better robustness to noise and clutter. Several papers have shown that the details of the pooling operation can greatly influence the performance, but studies have so far been purely empirical. In this paper, we show that the reasons underlying the performance of various pooling methods are obscured by several confounding factors, such as the link between the sample cardinality in a spatial pool and the resolution at which low-level features have been extracted. We provide a detailed theoretical analysis of max pooling and average pooling, and give extensive empirical comparisons for object recognition tasks...

Y-Lan Boureau, Jean Ponce, Yann LeCun

Real-time Traffic

ICML 2010 | Machine Learning | Max Pooling | Pooling Operation | Various Pooling Methods |

claim paper

Post Info
More Details (n/a)

Added	09 Nov 2010
Updated	09 Nov 2010
Type	Conference
Year	2010
Where	ICML
Authors	Y-Lan Boureau, Jean Ponce, Yann LeCun

Comments (0)

Sciweavers

A Theoretical Analysis of Feature Pooling in Visual Recognition

ICML 2010 | Machine Learning | Max Pooling | Pooling Operation | Various Pooling Methods |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers