Sciweavers

1236 search results - page 8 / 248
» Opposition-Based Reinforcement Learning
Sort
View
209
Voted
ICCCI
2011
Springer
14 years 2 months ago
Evolving Equilibrium Policies for a Multiagent Reinforcement Learning Problem with State Attractors
Multiagent reinforcement learning problems are especially difficult because of their dynamism and the size of joint state space. In this paper a new benchmark problem is proposed, ...
Florin Leon
126
Voted
NECO
2010
97views more  NECO 2010»
15 years 1 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...
149
Voted
AGI
2011
14 years 6 months ago
Reinforcement Learning and the Bayesian Control Rule
We present an actor-critic scheme for reinforcement learning in complex domains. The main contribution is to show that planning and I/O dynamics can be separated such that an intra...
Pedro Alejandro Ortega, Daniel Alexander Braun, Si...
136
Voted
JAIR
2000
131views more  JAIR 2000»
15 years 2 months ago
An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method...
Marilyn A. Walker
197
Voted
GECCO
2011
Springer
276views Optimization» more  GECCO 2011»
14 years 6 months ago
Evolution of reward functions for reinforcement learning
The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...
Scott Niekum, Lee Spector, Andrew G. Barto