Sciweavers

827 search results - page 7 / 166
» Variational methods for Reinforcement Learning
Sort
View
134
Voted
IJCAI
2001
15 years 3 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
113
Voted
NIPS
2001
15 years 3 months ago
Reinforcement Learning with Long Short-Term Memory
This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage learning and directed exploration can...
Bram Bakker
122
Voted
ICMLA
2003
15 years 3 months ago
A Distributed Reinforcement Learning Approach to Pattern Inference in Go
— This paper shows that the distributed representation found in Learning Vector Quantization (LVQ) enables reinforcement learning methods to cope with a large decision search spa...
Myriam Abramson, Harry Wechsler
116
Voted
PKDD
2009
Springer
181views Data Mining» more  PKDD 2009»
15 years 9 months ago
Active Learning for Reward Estimation in Inverse Reinforcement Learning
Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...
Manuel Lopes, Francisco S. Melo, Luis Montesano
KES
2004
Springer
15 years 7 months ago
Coordination in Multiagent Reinforcement Learning Systems
This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action ...
M. A. S. Kamal, Junichi Murata