Search Sciweavers | Sciweavers

6 search results - page 1 / 2

» Experience-efficient learning in associative bandit problems

192

Voted

ICML
2006
IEEE

108views Machine Learning» more ICML 2006»

Experience-efficient learning in associative bandit problems

16 years 8 months ago

Download paul.rutgers.edu

We formalize the associative bandit problem framework introduced by Kaelbling as a learning-theory problem. The learning environment is modeled as a k-armed bandit where arm payof...

Alexander L. Strehl, Chris Mesterharm, Michael L. ...

claim paper

Read More »

285

click to vote

CORR
2010
Springer

175views Education» more CORR 2010»

On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards

15 years 1 months ago

Download ceng.usc.edu

We consider a combinatorial generalization of the classical multi-armed bandit problem that is defined as follows. There is a given bipartite graph of M users and N M resources. F...

Yi Gai, Bhaskar Krishnamachari, Mingyan Liu

claim paper

Read More »

173

click to vote

ALT
2008
Springer

171views Machine Learning» more ALT 2008»

Active Learning in Multi-armed Bandits

16 years 4 months ago

Download www.sztaki.hu

In this paper we consider the problem of actively learning the mean values of distributions associated with a ﬁnite number of options (arms). The algorithms can select which opti...

András Antos, Varun Grover, Csaba Szepesv&a...

claim paper

Read More »

212

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 9 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

286

click to vote

Publication

466views

Multi-Armed Bandit Mechanisms for Multi-Slot Sponsored Search Auctions

16 years 6 months ago

Download arxiv.org

In pay-per click sponsored search auctions which are cur- rently extensively used by search engines, the auction for a keyword involves a certain number of advertisers (say k) c...

Akash Das Sarma, Sujit Gujar, Y. Narahari

posted by sujit

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers