Sciweavers

1512 search results - page 81 / 303
» Qualitative reinforcement learning
Sort
View
ICML
1994
IEEE
13 years 11 months ago
Markov Games as a Framework for Multi-Agent Reinforcement Learning
In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....
Michael L. Littman
UAI
2001
13 years 9 months ago
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...
Lex Weaver, Nigel Tao
ECAI
2010
Springer
13 years 9 months ago
The Dynamics of Multi-Agent Reinforcement Learning
Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...
Luke Dickens, Krysia Broda, Alessandra Russo
IJRR
2008
151views more  IJRR 2008»
13 years 7 months ago
Trajectory Optimization using Reinforcement Learning for Map Exploration
Automatically building maps from sensor data is a necessary and fundamental skill for mobile robots; as a result, considerable research attention has focused on the technical chall...
Thomas Kollar, Nicholas Roy
AROBOTS
1999
104views more  AROBOTS 1999»
13 years 7 months ago
Reinforcement Learning Soccer Teams with Incomplete World Models
We use reinforcement learning (RL) to compute strategies for multiagent soccer teams. RL may pro t signi cantly from world models (WMs) estimating state transition probabilities an...
Marco Wiering, Rafal Salustowicz, Jürgen Schm...