Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

55

AAAI
2006

favoriteEmaildiscussreport

105views Intelligent Agents» more AAAI 2006»

An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem

14 years 8 months ago

An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem

Download www.aaai.org

We present an asymptotically optimal algorithm for the max variant of the k-armed bandit problem. Given a set of k slot machines, each yielding payoff from a fixed (but unknown) distribution, we wish to allocate trials to the machines so as to maximize the expected maximum payoff received over a series of n trials. Subject to certain distributional assumptions, we show that O " k ln(k )ln(n)2 2 " trials are sufficient to identify, with probability at least 1 - , a machine whose expected maximum payoff is within of optimal. This result leads to a strategy for solving the problem that is asymptotically optimal in the following sense: the gap between the expected maximum payoff obtained by using our strategy for n trials and that obtained by pulling the single best arm for all n trials approaches zero as n .

Matthew J. Streeter, Stephen F. Smith

Real-time Traffic

AAAI 2006 | Asymptotically Optimal Algorithm | Intelligent Agents | Maximum Payoff | Yielding Payoff |

claim paper

Related Content

» A Simple DistributionFree Approach to the Max kArmed Bandit Problem

» Online Learning of Rested and Restless Bandits

» An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

» Fractional Moments on Bandit Problems

» Active Learning in Multiarmed Bandits

» Defensive Universal Learning with Experts

» Analysis of the 11 EA for a Noisy OneMax

» Optimal Inapproximability Results for MaxCut and Other 2Variable CSPs

» Toward comparisonbased adaptive operator selection

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2006
Where	AAAI
Authors	Matthew J. Streeter, Stephen F. Smith

Comments (0)