Search Sciweavers | Sciweavers

97 search results - page 6 / 20

» Logarithmic Regret Algorithms for Online Convex Optimization

152

click to vote

CORR
2010
Springer

112views Education» more CORR 2010»

Optimal Distributed Online Prediction using Mini-Batches

15 years 3 months ago

Download research.microsoft.com

Online prediction methods are typically presented as serial algorithms running on a single processor. However, in the age of web-scale prediction problems, it is increasingly comm...

Ofer Dekel, Ran Gilad-Bachrach, Ohad Shamir, Lin X...

claim paper

Read More »

163

Voted

AAAI
2004

163views Intelligent Agents» more AAAI 2004»

Regrets Only! Online Stochastic Optimization under Time Constraints

15 years 7 months ago

Download www.cs.brown.edu

This paper considers online stochastic optimization problems where time constraints severely limit the number of offline optimizations which can be performed at decision time and/...

Russell Bent, Pascal Van Hentenryck

claim paper

Read More »

166

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 3 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

145

click to vote

COLT
2008
Springer

124views Machine Learning» more COLT 2008»

High-Probability Regret Bounds for Bandit Online Linear Optimization

15 years 7 months ago

Download colt2008.cs.helsinki.fi

We present a modification of the algorithm of Dani et al. [8] for the online linear optimization problem in the bandit setting, which with high probability has regret at most O ( ...

Peter L. Bartlett, Varsha Dani, Thomas P. Hayes, S...

claim paper

Read More »

150

click to vote

ICML
2009
IEEE

169views Machine Learning» more ICML 2009»

Proximal regularization for online and batch learning

16 years 6 months ago

Download ai.stanford.edu

Many learning algorithms rely on the curvature (in particular, strong convexity) of regularized objective functions to provide good theoretical performance guarantees. In practice...

Chuong B. Do, Quoc V. Le, Chuan-Sheng Foo

claim paper

Read More »

« Prev « First page 6 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers