Search Sciweavers | Sciweavers

22 search results - page 2 / 5

» High-Probability Regret Bounds for Bandit Online Linear Opti...

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

13 years 5 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

13 years 7 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Online Learning of Rested and Restless Bandits

13 years 2 months ago

Download www.eecs.umich.edu

In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...

Cem Tekin, Mingyan Liu

claim paper

Read More »

click to vote

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

14 years 4 months ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

click to vote

NIPS
2008

127views Information Technology» more NIPS 2008»

On the Generalization Ability of Online Strongly Convex Programming Algorithms

13 years 9 months ago

Download ttic.uchicago.edu

This paper examines the generalization properties of online convex programming algorithms when the loss function is Lipschitz and strongly convex. Our main result is a sharp bound...

Sham M. Kakade, Ambuj Tewari

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers