Search Sciweavers | Sciweavers

44 search results - page 3 / 9

» A structured multiarmed bandit problem and the greedy policy

click to vote

TSP
2010

138views Artificial Intelligence» more TSP 2010»

Dynamic multichannel access with imperfect channel state detection

13 years 2 months ago

Download www.ece.ucdavis.edu

A restless multi-armed bandit problem that arises in multichannel opportunistic communications is considered, where channels are modeled as independent and identical Gilbert

Keqin Liu, Qing Zhao, Bhaskar Krishnamachari

claim paper

Read More »

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

13 years 7 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

click to vote

ALT
2011
Springer

259views Machine Learning» more ALT 2011»

Deviations of Stochastic Bandit Regret

12 years 7 months ago

Download certis.enpc.fr

This paper studies the deviations of the regret in a stochastic multi-armed bandit problem. When the total number of plays n is known beforehand by the agent, Audibert et al. (2009...

Antoine Salomon, Jean-Yves Audibert

claim paper

Read More »

click to vote

CORR
2010
Springer

127views Education» more CORR 2010»

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

13 years 7 months ago

Download wireless.cs.uh.edu

We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...

Cem Tekin, Mingyan Liu

claim paper

Read More »

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Online Learning of Rested and Restless Bandits

13 years 2 months ago

Download www.eecs.umich.edu

In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...

Cem Tekin, Mingyan Liu

claim paper

Read More »

« Prev « First page 3 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers