Sciweavers

113 search results - page 17 / 23
» Model Approximation for HEXQ Hierarchical Reinforcement Lear...
Sort
View
ECML
2005
Springer
14 years 1 months ago
Model-Based Online Learning of POMDPs
Abstract. Learning to act in an unknown partially observable domain is a difficult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...
Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony
JMLR
2010
119views more  JMLR 2010»
13 years 2 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
IROS
2007
IEEE
168views Robotics» more  IROS 2007»
14 years 1 months ago
Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression
Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...
Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...
SMC
2007
IEEE
118views Control Systems» more  SMC 2007»
14 years 1 months ago
One-class learning with multi-objective genetic programming
One-class classification naturally only provides one class of exemplars on which to construct the classification model. In this work, multiobjective genetic programming (GP) all...
Robert Curry, Malcolm I. Heywood
ATAL
2005
Springer
14 years 1 months ago
An integrated framework for adaptive reasoning about conversation patterns
We present an integrated approach for reasoning about and learning conversation patterns in multiagent communication. The approach is based on the assumption that information abou...
Michael Rovatsos, Felix A. Fischer, Gerhard Wei&sz...