Sciweavers

121 search results - page 11 / 25
» Toward Off-Policy Learning Control with Function Approximati...
Sort
View
FBIT
2007
IEEE
14 years 3 months ago
Learning to Drive a Real Car in 20 Minutes
The paper describes our first experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...
Martin Riedmiller, Michael Montemerlo, Hendrik Dah...
IROS
2006
IEEE
190views Robotics» more  IROS 2006»
14 years 3 months ago
Q-RAN: A Constructive Reinforcement Learning Approach for Robot Behavior Learning
Abstract— This paper presents a learning system that uses Qlearning with a resource allocating network (RAN) for behavior learning in mobile robotics. The RAN is used as a functi...
Jun Li, Achim J. Lilienthal, Tomás Mart&iac...
NIPS
2008
13 years 10 months ago
Regularized Policy Iteration
In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...
GECCO
2008
Springer
363views Optimization» more  GECCO 2008»
13 years 10 months ago
Towards high speed multiobjective evolutionary optimizers
One of the major difficulties when applying Multiobjective Evolutionary Algorithms (MOEA) to real world problems is the large number of objective function evaluations. Approximate...
A. K. M. Khaled Ahsan Talukder
IJON
2006
90views more  IJON 2006»
13 years 9 months ago
Reinforcement learning of a simple control task using the spike response model
In this work, we propose a variation of a direct reinforcement learning algorithm, suitable for usage with spiking neurons based on the spike response model (SRM). The SRM is a bi...
Murilo Saraiva de Queiroz, Roberto Coelho de Berr&...