Search Sciweavers | Sciweavers

17 search results - page 2 / 4

» Distributed learning in multi-armed bandit with multiple pla...

213

click to vote

ICASSP
2010
IEEE

224views Signal Processing» more ICASSP 2010»

Distributed learning in cognitive radio networks: Multi-armed bandit with distributed multiple players

15 years 7 months ago

Download www.ece.ucdavis.edu

—We consider a cognitive radio network with distributed multiple secondary users, where each user independently searches for spectrum opportunities in multiple channels without e...

Keqin Liu, Qing Zhao

claim paper

Read More »

206

click to vote

TSP
2010

170views Artificial Intelligence» more TSP 2010»

Distributed learning in multi-armed bandit with multiple players

15 years 2 months ago

Download www.ece.ucdavis.edu

We formulate and study a decentralized multi-armed bandit (MAB) problem. There are distributed players competing for independent arms. Each arm, when played, offers i.i.d. reward a...

Keqin Liu, Qing Zhao

claim paper

Read More »

253

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Online Learning of Rested and Restless Bandits

15 years 2 months ago

Download www.eecs.umich.edu

In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...

Cem Tekin, Mingyan Liu

claim paper

Read More »

249

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

15 years 5 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

211

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 9 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

« Prev « First page 2 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers