Search Sciweavers | Sciweavers

66 search results - page 8 / 14

» The Nonstochastic Multiarmed Bandit Problem

138

click to vote

NIPS
2008

116views Information Technology» more NIPS 2008»

Algorithms for Infinitely Many-Armed Bandits

15 years 8 months ago

Download www.stat.lsa.umich.edu

We consider multi-armed bandit problems where the number of arms is larger than the possible number of experiments. We make a stochastic assumption on the mean-reward of a new sel...

Yizao Wang, Jean-Yves Audibert, Rémi Munos

claim paper

Read More »

206

click to vote

ALT
2006
Springer

156views Machine Learning» more ALT 2006»

Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring

15 years 10 months ago

Download www.szit.bme.hu

In this paper the sequential prediction problem with expert advice is considered when the loss is unbounded under partial monitoring scenarios. We deal with a wide class of the par...

Chamy Allenberg, Peter Auer, László ...

claim paper

Read More »

186

click to vote

CDC
2009
IEEE

123views Control Systems» more CDC 2009»

On the myopic policy for a class of restless bandit problems with applications in dynamic multichannel access

15 years 11 months ago

Download www.ece.ucdavis.edu

We consider a class of restless multi-armed bandit problems that arises in multi-channel opportunistic communications, where channels are modeled as independent and stochastically...

Keqin Liu, Qing Zhao

claim paper

Read More »

164

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

15 years 8 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

276

click to vote

Publication

334views

Rollout Sampling Approximate Policy Iteration

16 years 4 months ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 8 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers