Sciweavers

1233 search results - page 134 / 247
» Feudal Reinforcement Learning
Sort
View
ICML
1994
IEEE
13 years 11 months ago
Markov Games as a Framework for Multi-Agent Reinforcement Learning
In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....
Michael L. Littman
UAI
2001
13 years 9 months ago
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...
Lex Weaver, Nigel Tao
ECAI
2010
Springer
13 years 9 months ago
The Dynamics of Multi-Agent Reinforcement Learning
Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...
Luke Dickens, Krysia Broda, Alessandra Russo
IJRR
2008
151views more  IJRR 2008»
13 years 8 months ago
Trajectory Optimization using Reinforcement Learning for Map Exploration
Automatically building maps from sensor data is a necessary and fundamental skill for mobile robots; as a result, considerable research attention has focused on the technical chall...
Thomas Kollar, Nicholas Roy
ESOA
2006
13 years 11 months ago
Reinforcement Learning for Online Control of Evolutionary Algorithms
The research reported in this paper is concerned with assessing the usefulness of reinforcment learning (RL) for on-line calibration of parameters in evolutionary algorithms (EA). ...
A. E. Eiben, Mark Horvath, Wojtek Kowalczyk, Marti...