bandit problem | Sciweavers

183

TMC
2011

137views more TMC 2011»

Cognitive Medium Access: Exploration, Exploitation, and Competition

15 years 1 months ago

— This paper establishes the equivalence between cognitive medium access and the competitive multi-armed bandit problem. First, the scenario in which a single cognitive user wish...

Lifeng Lai, Hesham El Gamal, Hai Jiang, H. Vincent...

claim paper

Read More »

183

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 4 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

157

click to vote

SIAMCOMP
2002

124views more SIAMCOMP 2002»

The Nonstochastic Multiarmed Bandit Problem

15 years 6 months ago

Download homes.dsi.unimi.it

Abstract. In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot machines to play in a sequence of trials so as to maximize his reward. This class...

Peter Auer, Nicolò Cesa-Bianchi, Yoav Freun...

claim paper

Read More »

186

click to vote

CORR
2008
Springer

136views Education» more CORR 2008»

Multi-Armed Bandits in Metric Spaces

15 years 6 months ago

Download www.cs.cornell.edu

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of n trials so as to maximize the total payoff of the chosen strategies. While ...

Robert Kleinberg, Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

195

click to vote

LION
2010
Springer

190views Optimization» more LION 2010»

Algorithm Selection as a Bandit Problem with Unbounded Losses

15 years 10 months ago

Download como.vub.ac.be

Abstract. Algorithm selection is typically based on models of algorithm performance learned during a separate ofﬂine training sequence, which can be prohibitively expensive. In r...

Matteo Gagliolo, Jürgen Schmidhuber

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers