Sciweavers

1236 search results - page 8 / 248
» Opposition-Based Reinforcement Learning
Sort
View
ICCCI
2011
Springer
12 years 8 months ago
Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors
Multiagent reinforcement learning problems are especially difficult because of their dynamism and the size of joint state space. In this paper a new benchmark problem is proposed, ...
Florin Leon
NECO
2010
97views more  NECO 2010»
13 years 7 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...
AGI
2011
13 years 5 days ago
Reinforcement Learning and the Bayesian Control Rule
We present an actor-critic scheme for reinforcement learning in complex domains. The main contribution is to show that planning and I/O dynamics can be separated such that an intra...
Pedro Alejandro Ortega, Daniel Alexander Braun, Si...
JAIR
2000
131views more  JAIR 2000»
13 years 8 months ago
An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method...
Marilyn A. Walker
GECCO
2011
Springer
276views Optimization» more  GECCO 2011»
13 years 1 days ago
Evolution of reward functions for reinforcement learning
The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...
Scott Niekum, Lee Spector, Andrew G. Barto