Sciweavers

59 search results - page 4 / 12
» Reinforcement learning of a simple control task using the sp...
Sort
View
ICRA
2009
IEEE
111views Robotics» more  ICRA 2009»
14 years 2 months ago
Model-based and model-free reinforcement learning for visual servoing
— To address the difficulty of designing a controller for complex visual-servoing tasks, two learning-based uncalibrated approaches are introduced. The first method starts by b...
Amir Massoud Farahmand, Azad Shademan, Martin J&au...
JMLR
2002
133views more  JMLR 2002»
13 years 7 months ago
Learning Precise Timing with LSTM Recurrent Networks
The temporal distance between events conveys information essential for numerous sequential tasks such as motor control and rhythm detection. While Hidden Markov Models tend to ign...
Felix A. Gers, Nicol N. Schraudolph, Jürgen S...
IJCAI
2001
13 years 9 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz
FLAIRS
2006
13 years 9 months ago
Refining Human Behavior Models in a Context-based Architecture
This paper describes an investigation into the refinement of context-based human behavior models through the use of experiential learning. Specifically, a tactical agent was endow...
David Aihe, Avelino J. Gonzalez
NIPS
1998
13 years 9 months ago
Risk Sensitive Reinforcement Learning
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...
Ralph Neuneier, Oliver Mihatsch