Search Sciweavers | Sciweavers

3412 search results - page 7 / 683

» Efficient Reinforcement Learning

142

Voted

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 3 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

247

Voted

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

14 years 1 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

120

Voted

SLOGICA
2008

120views more SLOGICA 2008»

Emergence of Information Transfer by Inductive Learning

15 years 2 months ago

Download www.lps.uci.edu

We study a simple game theoretic model of information transfer which we consider to be a baseline model for capturing strategic aspects of epistemological questions. In particular,...

Simon M. Huttegger, Brian Skyrms

claim paper

Read More »

136

Voted

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

15 years 3 months ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

242

Voted

Publication

240views

Bayesian multitask inverse reinforcement learning

14 years 1 months ago

Download arxiv.org

We generalise the problem of inverse reinforcement learning to multiple tasks, from multiple demonstrations. Each one may represent one expert trying to solve a different task, or ...

Christos Dimitrakakis, Constantin A. Rothkopf

posted by olethros

Read More »

« Prev « First page 7 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers