reinforcement learning

43

AAAI
2010

199views Intelligent Agents» more AAAI 2010»

Bayesian Policy Search for Multi-Agent Role Discovery

14 years 1 months ago

Bayesian inference is an appealing approach for leveraging prior knowledge in reinforcement learning (RL). In this paper we describe an algorithm for discovering different classes...

Aaron Wilson, Alan Fern, Prasad Tadepalli

claim paper

Read More »

42

click to vote

AAAI
1992

128views Intelligent Agents» more AAAI 1992»

Automatic Programming of Robots Using Genetic Programming

14 years 1 months ago

Download www.genetic-programming.com

The goal in automatic programming is to get a computer to perform a task by telling it what needs to be done, rather than by explicitly programming it. This paper considers the ta...

John R. Koza, James Rice

claim paper

Read More »

27

click to vote

NIPS
1996

117views Information Technology» more NIPS 1996»

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

14 years 1 months ago

Download anytime.cs.umass.edu

Closed-loop control relies on sensory feedback that is usually assumed to be free. But if sensing incurs a cost, it may be coste ective to take sequences of actions in open-loop m...

Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstei...

claim paper

Read More »

40

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

14 years 1 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

35

click to vote

NIPS
1994

152views Information Technology» more NIPS 1994»

Finding Structure in Reinforcement Learning

14 years 1 months ago

Download www.ri.cmu.edu

Reinforcement learning addresses the problem of learning to select actions in order to maximize one's performance inunknownenvironments. Toscale reinforcement learning to com...

Sebastian Thrun, Anton Schwartz

claim paper

Read More »

40

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

14 years 1 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

34

click to vote

AAAI
1994

185views Intelligent Agents» more AAAI 1994»

Learning to Coordinate without Sharing Information

14 years 1 months ago

Download www.agent.ai

Researchers in the eld of Distributed Arti cial Intelligence (DAI) have been developing e cient mechanisms to coordinate the activities of multiple autonomous agents. The need for...

Sandip Sen, Mahendra Sekaran, John Hale

claim paper

Read More »

31

click to vote

AAAI
1993

107views Intelligent Agents» more AAAI 1993»

Complexity Analysis of Real-Time Reinforcement Learning

14 years 1 months ago

Download www.ri.cmu.edu

This paper analyzes the complexity of on-line reinforcement learning algorithms, namely asynchronous realtime versions of Q-learning and value-iteration, applied to the problem of...

Sven Koenig, Reid G. Simmons

claim paper

Read More »

25

click to vote

NIPS
1997

94views Information Technology» more NIPS 1997»

Reinforcement Learning with Hierarchies of Machines

14 years 1 months ago

Download www.cs.berkeley.edu

We present a new approach to reinforcement learning in which the policies considered by the learning process are constrained by hierarchies of partially speciﬁed machines. This ...

Ronald Parr, Stuart J. Russell

claim paper

Read More »

17

click to vote

NIPS
2000

112views Information Technology» more NIPS 2000»

Balancing Multiple Sources of Reward in Reinforcement Learning

14 years 1 months ago

Download www.cc.gatech.edu

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...

Christian R. Shelton

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers