Search Sciweavers | Sciweavers

263 search results - page 6 / 53

» Regret Bounds for Prediction Problems

203

click to vote

CORR
2010
Springer

112views Education» more CORR 2010»

Optimal Distributed Online Prediction using Mini-Batches

15 years 4 months ago

Download research.microsoft.com

Online prediction methods are typically presented as serial algorithms running on a single processor. However, in the age of web-scale prediction problems, it is increasingly comm...

Ofer Dekel, Ran Gilad-Bachrach, Ohad Shamir, Lin X...

claim paper

Read More »

173

Voted

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 7 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

182

click to vote

COLT
2007
Springer

144views Machine Learning» more COLT 2007»

Improved Rates for the Stochastic Continuum-Armed Bandit Problem

16 years 1 months ago

Download www.sztaki.hu

Abstract. Considering one-dimensional continuum-armed bandit problems, we propose an improvement of an algorithm of Kleinberg and a new set of conditions which give rise to improve...

Peter Auer, Ronald Ortner, Csaba Szepesvári

claim paper

Read More »

158

Voted

CORR
2010
Springer

55views Education» more CORR 2010»

Prediction with Advice of Unknown Number of Experts

15 years 7 months ago

Download event.cwi.nl

In the framework of prediction with expert advice, we consider a recently introduced kind of regret bounds: the bounds that depend on the effective instead of nominal number of ex...

Alexey V. Chernov, Vladimir Vovk

claim paper

Read More »

232

click to vote

NIPS
2008

124views Information Technology» more NIPS 2008»

On the Complexity of Linear Prediction: Risk Bounds, Margin Bounds, and Regularization

15 years 8 months ago

Download ttic.uchicago.edu

This work characterizes the generalization ability of algorithms whose predictions are linear in the input vector. To this end, we provide sharp bounds for Rademacher and Gaussian...

Sham M. Kakade, Karthik Sridharan, Ambuj Tewari

claim paper

Read More »

« Prev « First page 6 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers