Sciweavers

171 search results - page 14 / 35
» Principled Methods for Advising Reinforcement Learning Agent...
Sort
View
IROS
2007
IEEE
157views Robotics» more  IROS 2007»
14 years 2 months ago
Autonomous blimp control using model-free reinforcement learning in a continuous state and action space
— In this paper, we present an approach that applies the reinforcement learning principle to the problem of learning height control policies for aerial blimps. In contrast to pre...
Axel Rottmann, Christian Plagemann, Peter Hilgers,...
PRIMA
2009
Springer
14 years 2 months ago
Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...
Itsuki Noda
IAT
2008
IEEE
13 years 7 months ago
Scaling Up Multi-agent Reinforcement Learning in Complex Domains
TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...
Dan Xiao, Ah-Hwee Tan
NN
2007
Springer
105views Neural Networks» more  NN 2007»
13 years 7 months ago
Guiding exploration by pre-existing knowledge without modifying reward
Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...
Kary Främling
ISDA
2009
IEEE
14 years 2 months ago
Postponed Updates for Temporal-Difference Reinforcement Learning
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson