Search Sciweavers | Sciweavers

178 search results - page 13 / 36

» Probabilistic policy reuse in a reinforcement learning agent

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

13 years 10 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

click to vote

AGI
2011

231views Artificial Intelligence» more AGI 2011»

Reinforcement Learning and the Bayesian Control Rule

13 years 7 days ago

Download metatip.com

We present an actor-critic scheme for reinforcement learning in complex domains. The main contribution is to show that planning and I/O dynamics can be separated such that an intra...

Pedro Alejandro Ortega, Daniel Alexander Braun, Si...

claim paper

Read More »

click to vote

NIPS
2004

120views Information Technology» more NIPS 2004»

Multi-agent Cooperation in Diverse Population Games

13 years 10 months ago

Download books.nips.cc

We consider multi-agent systems whose agents compete for resources by striving to be in the minority group. The agents adapt to the environment by reinforcement learning of the pr...

K. Y. Michael Wong, S. W. Lim, Zhuo Gao

claim paper

Read More »

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

13 years 10 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

14 years 9 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

« Prev « First page 13 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers