Search Sciweavers | Sciweavers

22 search results - page 3 / 5

» High-Probability Regret Bounds for Bandit Online Linear Opti...

click to vote

CORR
2012
Springer

210views Education» more CORR 2012»

Towards minimax policies for online linear optimization with bandit feedback

12 years 3 months ago

Download www.princeton.edu

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of...

Sébastien Bubeck, Nicolò Cesa-Bianch...

claim paper

Read More »

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

13 years 9 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

click to vote

CORR
2007
Springer

106views Education» more CORR 2007»

Bandit Algorithms for Tree Search

13 years 7 months ago

Download hal.inria.fr

Bandit based methods for tree search have recently gained popularity when applied to huge trees, e.g. in the game of go [6]. Their eﬃcient exploration of the tree enables to ret...

Pierre-Arnaud Coquelin, Rémi Munos

claim paper

Read More »

click to vote

COLT
2008
Springer

98views Machine Learning» more COLT 2008»

Extracting Certainty from Uncertainty: Regret Bounded by Variation in Costs

13 years 9 months ago

Download colt2008.cs.helsinki.fi

Prediction from expert advice is a fundamental problem in machine learning. A major pillar of the field is the existence of learning algorithms whose average loss approaches that ...

Elad Hazan, Satyen Kale

claim paper

Read More »

click to vote

CORR
2010
Springer

171views Education» more CORR 2010»

Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach

13 years 2 months ago

Download www.eecs.umich.edu

We consider an opportunistic spectrum access (OSA) problem where the time-varying condition of each channel (e.g., as a result of random fading or certain primary users' activ...

Cem Tekin, Mingyan Liu

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers