Search Sciweavers | Sciweavers

74 search results - page 7 / 15

» Regret Bounds for Gaussian Process Bandit Problems

175

click to vote

ICASSP
2010
IEEE

224views Signal Processing» more ICASSP 2010»

Distributed learning in cognitive radio networks: Multi-armed bandit with distributed multiple players

15 years 6 months ago

Download www.ece.ucdavis.edu

—We consider a cognitive radio network with distributed multiple secondary users, where each user independently searches for spectrum opportunities in multiple channels without e...

Keqin Liu, Qing Zhao

claim paper

Read More »

151

click to vote

COLT
2004
Springer

78views Machine Learning» more COLT 2004»

Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary

15 years 11 months ago

Download www.cs.cmu.edu

We give an algorithm for the bandit version of a very general online optimization problem considered by Kalai and Vempala [1], for the case of an adaptive adversary. In this proble...

H. Brendan McMahan, Avrim Blum

claim paper

Read More »

176

click to vote

COLT
2010
Springer

149views Machine Learning» more COLT 2010»

Open Loop Optimistic Planning

15 years 4 months ago

Download www.colt2010.org

We consider the problem of planning in a stochastic and discounted environment with a limited numerical budget. More precisely, we investigate strategies exploring the set of poss...

Sébastien Bubeck, Rémi Munos

claim paper

Read More »

163

click to vote

COLT
2004
Springer

112views Machine Learning» more COLT 2004»

Regret Bounds for Hierarchical Classification with Linear-Threshold Functions

15 years 9 months ago

Download eprints.pascal-network.org

We study the problem of classifying data in a given taxonomy when classifications associated with multiple and/or partial paths are allowed. We introduce an incremental algorithm u...

Nicolò Cesa-Bianchi, Alex Conconi, Claudio ...

claim paper

Read More »

147

click to vote

SIAMCOMP
2002

124views more SIAMCOMP 2002»

The Nonstochastic Multiarmed Bandit Problem

15 years 5 months ago

Download homes.dsi.unimi.it

Abstract. In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot machines to play in a sequence of trials so as to maximize his reward. This class...

Peter Auer, Nicolò Cesa-Bianchi, Yoav Freun...

claim paper

Read More »

« Prev « First page 7 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers