Search Sciweavers | Sciweavers

23

COLT
2008
Springer

115views Machine Learning» more COLT 2008»

Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization

13 years 9 months ago

We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...

Jacob Abernethy, Elad Hazan, Alexander Rakhlin

claim paper

Read More »

23

click to vote

NIPS
2007

135views Information Technology» more NIPS 2007»

The Price of Bandit Information for Online Optimization

13 years 8 months ago

Download books.nips.cc

In the online linear optimization problem, a learner must choose, in each round, a decision from a set D ⊂ Rn in order to minimize an (unknown and changing) linear cost function...

Varsha Dani, Thomas P. Hayes, Sham Kakade

claim paper

Read More »

25

click to vote

SDM
2007
SIAM

167views Data Mining» more SDM 2007»

Bandits for Taxonomies: A Model-based Approach

13 years 8 months ago

Download www.cs.cmu.edu

We consider a novel problem of learning an optimal matching, in an online fashion, between two feature spaces that are organized as taxonomies. We formulate this as a multi-armed ...

Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabar...

claim paper

Read More »

17

click to vote

CP
2006
Springer

121views Artificial Intelligence» more CP 2006»

A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem

13 years 11 months ago

Download www.cs.cmu.edu

The max k-armed bandit problem is a recently-introduced online optimization problem with practical applications to heuristic search. Given a set of k slot machines, each yielding p...

Matthew J. Streeter, Stephen F. Smith

claim paper

Read More »

29

click to vote

CORR
2011
Springer

210views Education» more CORR 2011»

Online Learning of Rested and Restless Bandits

13 years 2 months ago

Download www.eecs.umich.edu

In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...

Cem Tekin, Mingyan Liu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers