Sciweavers

4544 search results - page 160 / 909
» Reinforcement Learning with Time
Sort
View
ICMLA
2010
13 years 8 months ago
Multimodal Parameter-exploring Policy Gradients
Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...
Frank Sehnke, Alex Graves, Christian Osendorfer, J...
ITNG
2007
IEEE
14 years 4 months ago
Input Fuzzy Modeling for the Recognition of Handwritten Hindi Numerals
This paper presents the recognition of Handwritten Hindi Numerals based on the modified exponential membership function fitted to the fuzzy sets derived from normalized distance f...
Madasu Hanmandlu, J. Grover, Vamsi Krishna Madasu,...
ATAL
2003
Springer
14 years 3 months ago
A selection-mutation model for q-learning in multi-agent systems
Although well understood in the single-agent framework, the use of traditional reinforcement learning (RL) algorithms in multi-agent systems (MAS) is not always justified. The fe...
Karl Tuyls, Katja Verbeeck, Tom Lenaerts
ATAL
2008
Springer
14 years 5 days ago
A new perspective to the keepaway soccer: the takers
Keepaway is a sub-problem of RoboCup Soccer Simulator in which 'the keepers' try to maintain the possession of the ball, while 'the takers' try to steal the ba...
Atil Iscen, Umut Erogul
AAAI
1992
13 years 11 months ago
Automatic Programming of Robots Using Genetic Programming
The goal in automatic programming is to get a computer to perform a task by telling it what needs to be done, rather than by explicitly programming it. This paper considers the ta...
John R. Koza, James Rice