Search Sciweavers | Sciweavers

11 search results - page 1 / 3

» An Asymptotically Optimal Algorithm for the Max k-Armed Band...

click to vote

AAAI
2006

105views Intelligent Agents» more AAAI 2006»

An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem

13 years 10 months ago

Download www.aaai.org

We present an asymptotically optimal algorithm for the max variant of the k-armed bandit problem. Given a set of k slot machines, each yielding payoff from a fixed (but unknown) d...

Matthew J. Streeter, Stephen F. Smith

claim paper

Read More »

click to vote

CP
2006
Springer

121views Artificial Intelligence» more CP 2006»

A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem

14 years 21 days ago

Download www.cs.cmu.edu

The max k-armed bandit problem is a recently-introduced online optimization problem with practical applications to heuristic search. Given a set of k slot machines, each yielding p...

Matthew J. Streeter, Stephen F. Smith

claim paper

Read More »

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Online Learning of Rested and Restless Bandits

13 years 4 months ago

Download www.eecs.umich.edu

In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...

Cem Tekin, Mingyan Liu

claim paper

Read More »

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

13 years 7 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

12 years 4 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

« Prev « First page 1 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers