Sciweavers

1799 search results - page 5 / 360
» Filtered Reinforcement Learning
Sort
View
ICML
1998
IEEE
14 years 8 months ago
Multi-criteria Reinforcement Learning
Csaba Szepesvári, Zoltán Gábo...
ICML
1996
IEEE
14 years 8 months ago
On-Line Adaptation of a Signal Predistorter through Dual Reinforcement Learning
Patrick Goetz, Shailesh Kumar, Risto Miikkulainen
AROBOTS
2008
131views more  AROBOTS 2008»
13 years 7 months ago
Active audition using the parameter-less self-organising map
This paper presents a novel method for enabling a robot to determine the position of a sound source in three dimensions using just two microphones and interaction with its environm...
Erik Berglund, Joaquin Sitte, Gordon Wyeth
CORR
1998
Springer
164views Education» more  CORR 1998»
13 years 7 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris