Sciweavers

2011 search results - page 5 / 403
» Universal Reinforcement Learning
Sort
View
ICML
1998
IEEE
14 years 8 months ago
Multi-criteria Reinforcement Learning
Csaba Szepesvári, Zoltán Gábo...
ICML
1996
IEEE
14 years 8 months ago
On-Line Adaptation of a Signal Predistorter through Dual Reinforcement Learning
Patrick Goetz, Shailesh Kumar, Risto Miikkulainen
CORR
1998
Springer
164views Education» more  CORR 1998»
13 years 6 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris
ESANN
2006
13 years 8 months ago
Reducing policy degradation in neuro-dynamic programming
We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...
Thomas Gabel, Martin Riedmiller