Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

13 years 9 months ago

Download www.colt2010.org

We present a new family of subgradient methods that dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradient-based learning. The adaptation, in essence, allows us to find needles in haystacks in the form of very predictive yet rarely observed features. Our paradigm stems from recent advances in online learning which employ proximal functions to control the gradient steps of the algorithm. We describe and analyze an apparatus for adaptively modifying the proximal function, which significantly simplifies the task of setting a learning rate and results in regret guarantees that are provably as good as the best proximal function that can be chosen in hindsight. We corroborate our theoretical results with experiments on a text classification task, showing substantial improvements for classification with sparse datasets.

John Duchi, Elad Hazan, Yoram Singer

Real-time Traffic

COLT 2010 | Earlier Iterations | Machine Learning | Proximal Function | Text Classification Task |

claim paper

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	COLT
Authors	John Duchi, Elad Hazan, Yoram Singer

Comments (0)

Sciweavers

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

COLT 2010 | Earlier Iterations | Machine Learning | Proximal Function | Text Classification Task |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers