Sciweavers

226 search results - page 19 / 46
» A Convergent Reinforcement Learning Algorithm in the Continu...
Sort
View
AROBOTS
1999
104views more  AROBOTS 1999»
13 years 6 months ago
Reinforcement Learning Soccer Teams with Incomplete World Models
We use reinforcement learning (RL) to compute strategies for multiagent soccer teams. RL may pro t signi cantly from world models (WMs) estimating state transition probabilities an...
Marco Wiering, Rafal Salustowicz, Jürgen Schm...
ICML
1999
IEEE
14 years 7 months ago
Implicit Imitation in Multiagent Reinforcement Learning
Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...
Bob Price, Craig Boutilier
ICML
2000
IEEE
14 years 7 months ago
Reinforcement Learning in POMDP's via Direct Gradient Ascent
This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...
Jonathan Baxter, Peter L. Bartlett
ECML
2004
Springer
14 years 4 days ago
Batch Reinforcement Learning with State Importance
Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions....
Lihong Li, Vadim Bulitko, Russell Greiner
NECO
2010
103views more  NECO 2010»
13 years 5 months ago
Posterior Weighted Reinforcement Learning with State Uncertainty
Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn f...
Tobias Larsen, David S. Leslie, Edmund J. Collins,...