Search Sciweavers | Sciweavers

66 search results - page 3 / 14

» The Nonstochastic Multiarmed Bandit Problem

204

click to vote

COLT
2003
Springer

121views Machine Learning» more COLT 2003»

Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem

16 years 6 days ago

Download www.ece.mcgill.ca

We consider the Multi-armed bandit problem under the PAC (“probably approximately correct”) model. It was shown by Even-Dar et al. [5] that given n arms, it suﬃces to play th...

Shie Mannor, John N. Tsitsiklis

claim paper

Read More »

237

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Online Learning of Rested and Restless Bandits

15 years 2 months ago

Download www.eecs.umich.edu

In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...

Cem Tekin, Mingyan Liu

claim paper

Read More »

217

click to vote

AGI
2011

231views Artificial Intelligence» more AGI 2011»

Reinforcement Learning and the Bayesian Control Rule

14 years 10 months ago

Download metatip.com

We present an actor-critic scheme for reinforcement learning in complex domains. The main contribution is to show that planning and I/O dynamics can be separated such that an intra...

Pedro Alejandro Ortega, Daniel Alexander Braun, Si...

claim paper

Read More »

176

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Mortal Multi-Armed Bandits

15 years 8 months ago

Download www.cs.cmu.edu

We formulate and study a new variant of the k-armed bandit problem, motivated by e-commerce applications. In our model, arms have (stochastic) lifetime after which they expire. In...

Deepayan Chakrabarti, Ravi Kumar, Filip Radlinski,...

claim paper

Read More »

166

click to vote

TSP
2010

138views Artificial Intelligence» more TSP 2010»

Dynamic multichannel access with imperfect channel state detection

15 years 1 months ago

Download www.ece.ucdavis.edu

A restless multi-armed bandit problem that arises in multichannel opportunistic communications is considered, where channels are modeled as independent and identical Gilbert

Keqin Liu, Qing Zhao, Bhaskar Krishnamachari

claim paper

Read More »

« Prev « First page 3 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers