Search Sciweavers | Sciweavers

263 search results - page 8 / 53

» Regret Bounds for Prediction Problems

202

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Multi-armed bandit problems with dependent arms

16 years 8 months ago

Download www.cs.cmu.edu

We provide a framework to exploit dependencies among arms in multi-armed bandit problems, when the dependencies are in the form of a generative model on clusters of arms. We find ...

Sandeep Pandey, Deepayan Chakrabarti, Deepak Agarw...

claim paper

Read More »

166

click to vote

CORR
2008
Springer

64views Education» more CORR 2008»

Linearly Parameterized Bandits

15 years 7 months ago

Download legacy.orie.cornell.edu

We consider bandit problems involving a large (possibly infinite) collection of arms, in which the expected reward of each arm is a linear function of an r-dimensional random vect...

Paat Rusmevichientong, John N. Tsitsiklis

claim paper

Read More »

245

click to vote

COLT
2010
Springer

183views Machine Learning» more COLT 2010»

Regret Minimization With Concept Drift

15 years 5 months ago

Download www.seas.upenn.edu

In standard online learning, the goal of the learner is to maintain an average loss that is "not too big" compared to the loss of the best-performing function in a fixed...

Koby Crammer, Yishay Mansour, Eyal Even-Dar, Jenni...

claim paper

Read More »

177

click to vote

NIPS
2008

127views Information Technology» more NIPS 2008»

On the Generalization Ability of Online Strongly Convex Programming Algorithms

15 years 9 months ago

Download ttic.uchicago.edu

This paper examines the generalization properties of online convex programming algorithms when the loss function is Lipschitz and strongly convex. Our main result is a sharp bound...

Sham M. Kakade, Ambuj Tewari

claim paper

Read More »

227

click to vote

COLT
2010
Springer

169views Machine Learning» more COLT 2010»

Learning Rotations with Little Regret

15 years 5 months ago

Download users.soe.ucsc.edu

We describe online algorithms for learning a rotation from pairs of unit vectors in Rn . We show that the expected regret of our online algorithm compared to the best fixed rotati...

Elad Hazan, Satyen Kale, Manfred K. Warmuth

claim paper

Read More »

« Prev « First page 8 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers