Sciweavers

1233 search results - page 3 / 247
» Reinforcement learning
Sort
View
CORR
1998
Springer
164views Education» more  CORR 1998»
13 years 7 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris
IAT
2003
IEEE
14 years 22 days ago
Asymmetric Multiagent Reinforcement Learning
A gradient-based method for both symmetric and asymmetric multiagent reinforcement learning is introduced in this paper. Symmetric multiagent reinforcement learning addresses the ...
Ville Könönen
NECO
2002
105views more  NECO 2002»
13 years 7 months ago
Multiple Model-Based Reinforcement Learning
We propose a modular reinforcement learning architecture for non-linear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic i...
Kenji Doya, Kazuyuki Samejima, Ken-ichi Katagiri, ...
ICML
2000
IEEE
14 years 8 months ago
Algorithms for Inverse Reinforcement Learning
Andrew Y. Ng, Stuart J. Russell