Sciweavers

181 search results - page 20 / 37
» On Policy Learning in Restricted Policy Spaces
Sort
View
ICRA
2002
IEEE
141views Robotics» more  ICRA 2002»
14 years 17 days ago
Movement Imitation with Nonlinear Dynamical Systems in Humanoid Robots
This article presents a new approach to movement planning, on-line trajectory modiļ¬cation, and imitation learning by representing movement plans based on a set of nonlinear diļ¬...
Auke Jan Ijspeert, Jun Nakanishi, Stefan Schaal
ATAL
2010
Springer
13 years 8 months ago
Strategy exploration in empirical games
Empirical analyses of complex games necessarily focus on a restricted set of strategies, and thus the value of empirical game models depends on effective methods for selectively e...
Patrick R. Jordan, L. Julian Schvartzman, Michael ...
EWRL
2008
13 years 9 months ago
Regularized Fitted Q-Iteration: Application to Planning
We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-i...
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...
NIPS
1998
13 years 9 months ago
Risk Sensitive Reinforcement Learning
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...
Ralph Neuneier, Oliver Mihatsch
AAAI
1998
13 years 9 months ago
Tree Based Discretization for Continuous State Space Reinforcement Learning
Reinforcement learning is an effective technique for learning action policies in discrete stochastic environments, but its efficiency can decay exponentially with the size of the ...
William T. B. Uther, Manuela M. Veloso