Search Sciweavers | Sciweavers

165 search results - page 18 / 33

» Exploration and apprenticeship learning in reinforcement lea...

click to vote

NIPS
1996

117views Information Technology» more NIPS 1996»

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

13 years 10 months ago

Download anytime.cs.umass.edu

Closed-loop control relies on sensory feedback that is usually assumed to be free. But if sensing incurs a cost, it may be coste ective to take sequences of actions in open-loop m...

Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstei...

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 10 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

click to vote

ATAL
2008
Springer

136views Intelligent Agents» more ATAL 2008»

Efficient multi-agent reinforcement learning through automated supervision

13 years 11 months ago

Download www.cs.umass.edu

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in large-scale systems. In this work, we develop a supervision fr...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

14 years 9 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

click to vote

ICML
2002
IEEE

138views Machine Learning» more ICML 2002»

Reinforcement Learning and Shaping: Encouraging Intended Behaviors

14 years 9 months ago

Download www.grappa.univ-lille3.fr

We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial...

Adam Laud, Gerald DeJong

claim paper

Read More »

« Prev « First page 18 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers