Search Sciweavers | Sciweavers

66 search results - page 9 / 14

» The Nonstochastic Multiarmed Bandit Problem

114

click to vote

ICDM
2007
IEEE

138views Data Mining» more ICDM 2007»

Bandit-Based Algorithms for Budgeted Learning

15 years 9 months ago

Download cse.unl.edu

We explore the problem of budgeted machine learning, in which the learning algorithm has free access to the training examples’ labels but has to pay for each attribute that is s...

Kun Deng, Chris Bourke, Stephen D. Scott, Julie Su...

claim paper

Read More »

123

click to vote

ICML
2010
IEEE

193views Machine Learning» more ICML 2010»

Efficient Selection of Multiple Bandit Arms: Theory and Practice

15 years 4 months ago

Download www.cs.utexas.edu

We consider the general, widely applicable problem of selecting from n real-valued random variables a subset of size m of those with the highest means, based on as few samples as ...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

134

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 1 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

139

click to vote

TMC
2011

137views more TMC 2011»

Cognitive Medium Access: Exploration, Exploitation, and Competition

14 years 10 months ago

Download www.ece.ubc.ca

— This paper establishes the equivalence between cognitive medium access and the competitive multi-armed bandit problem. First, the scenario in which a single cognitive user wish...

Lifeng Lai, Hesham El Gamal, Hai Jiang, H. Vincent...

claim paper

Read More »

102

click to vote

ALT
2007
Springer

134views Machine Learning» more ALT 2007»

Tuning Bandit Algorithms in Stochastic Environments

16 years 1 days ago

Download www.sztaki.hu

Algorithms based on upper-conﬁdence bounds for balancing exploration and exploitation are gaining popularity since they are easy to implement, eﬃcient and eﬀective. In this p...

Jean-Yves Audibert, Rémi Munos, Csaba Szepe...

claim paper

Read More »

« Prev « First page 9 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers