Sciweavers

3274 search results - page 47 / 655
» Using Learning in a Control Agent
Sort
View
ATAL
2005
Springer
15 years 7 months ago
Improving reinforcement learning function approximators via neuroevolution
Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...
Shimon Whiteson
139
Voted
IJCAI
2001
15 years 3 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz
AI
1999
Springer
15 years 1 months ago
Building Agent Teams Using an Explicit Teamwork Model and Learning
Milind Tambe, Jafar Adibi, Y. Alonaizon, Ali Erdem...
127
Voted
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
15 years 8 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng
DAGSTUHL
2003
15 years 3 months ago
Toward a Cognitive System Algebra: Application to Facial Expression Learning and Imitation
In this paper, we try to demonstrate the capability of a very simple architecture to learn to recognize and reproduce facial expressions without the innate capability to recognize ...
Philippe Gaussier, Ken Prepin, Jacqueline Nadel