Sciweavers

458 search results - page 63 / 92
» Q-Decomposition for Reinforcement Learning Agents
Sort
View
AAAI
2010
13 years 9 months ago
Multi-Agent Learning with Policy Prediction
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...
Chongjie Zhang, Victor R. Lesser
ICML
2003
IEEE
14 years 29 days ago
The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy
Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...
Clifford Kotnik, Jugal K. Kalita
ATAL
2008
Springer
13 years 9 months ago
A new perspective to the keepaway soccer: the takers
Keepaway is a sub-problem of RoboCup Soccer Simulator in which 'the keepers' try to maintain the possession of the ball, while 'the takers' try to steal the ba...
Atil Iscen, Umut Erogul
AAAI
1992
13 years 8 months ago
Automatic Programming of Robots Using Genetic Programming
The goal in automatic programming is to get a computer to perform a task by telling it what needs to be done, rather than by explicitly programming it. This paper considers the ta...
John R. Koza, James Rice
SOCROB
2010
126views Robotics» more  SOCROB 2010»
13 years 6 months ago
Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief
Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...
Antoine Hiolle, Lola Cañamero, Pierre Andry...