Sciweavers

340 search results - page 25 / 68
» Kernelized value function approximation for reinforcement le...
Sort
View
ICML
2003
IEEE
16 years 3 months ago
TD(0) Converges Provably Faster than the Residual Gradient Algorithm
In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...
Ralf Schoknecht, Artur Merke
128
Voted
NIPS
2004
15 years 3 months ago
Brain Inspired Reinforcement Learning
Successful application of reinforcement learning algorithms often involves considerable hand-crafting of the necessary non-linear features to reduce the complexity of the value fu...
François Rivest, Yoshua Bengio, John Kalask...
IROS
2006
IEEE
190views Robotics» more  IROS 2006»
15 years 8 months ago
Q-RAN: A Constructive Reinforcement Learning Approach for Robot Behavior Learning
Abstract— This paper presents a learning system that uses Qlearning with a resource allocating network (RAN) for behavior learning in mobile robotics. The RAN is used as a functi...
Jun Li, Achim J. Lilienthal, Tomás Mart&iac...
NIPS
1998
15 years 3 months ago
Risk Sensitive Reinforcement Learning
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...
Ralph Neuneier, Oliver Mihatsch
ICML
2010
IEEE
15 years 3 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...