Search Sciweavers | Sciweavers

5 search results - page 1 / 1

» Lower Bounds on the Sample Complexity of Exploration in the ...

Voted

COLT
2003
Springer

121views Machine Learning» more COLT 2003»

Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem

14 years 21 days ago

Download www.ece.mcgill.ca

We consider the Multi-armed bandit problem under the PAC (“probably approximately correct”) model. It was shown by Even-Dar et al. [5] that given n arms, it suﬃces to play th...

Shie Mannor, John N. Tsitsiklis

claim paper

Read More »

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

12 years 3 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

click to vote

COLT
2008
Springer

96views Machine Learning» more COLT 2008»

The True Sample Complexity of Active Learning

13 years 9 months ago

Download www.cs.cmu.edu

We describe and explore a new perspective on the sample complexity of active learning. In many situations where it was generally believed that active learning does not help, we sh...

Maria-Florina Balcan, Steve Hanneke, Jennifer Wort...

claim paper

Read More »

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

14 years 8 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

ATAL
2008
Springer

140views Intelligent Agents» more ATAL 2008»

Approximating power indices

13 years 9 months ago

Download www.cs.huji.ac.il

Many multiagent domains where cooperation among agents is crucial to achieving a common goal can be modeled as coalitional games. However, in many of these domains, agents are une...

Yoram Bachrach, Evangelos Markakis, Ariel D. Proca...

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers