Sciweavers

779 search results - page 37 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ML
2002
ACM
121views Machine Learning» more  ML 2002»
13 years 7 months ago
Near-Optimal Reinforcement Learning in Polynomial Time
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...
Michael J. Kearns, Satinder P. Singh
ECML
2005
Springer
14 years 1 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
CSL
2012
Springer
12 years 3 months ago
Reinforcement learning for parameter estimation in statistical spoken dialogue systems
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...
Filip Jurcícek, Blaise Thomson, Steve Young
FLAIRS
2003
13 years 9 months ago
Learning from Reinforcement and Advice Using Composite Reward Functions
1 Reinforcement learning has become a widely used methodology for creating intelligent agents in a wide range of applications. However, its performance deteriorates in tasks with s...
Vinay N. Papudesi, Manfred Huber
ICRA
2006
IEEE
131views Robotics» more  ICRA 2006»
14 years 1 months ago
Using Reinforcement Learning to Improve Exploration Trajectories for Error Minimization
Abstract— The mapping and localization problems have received considerable attention in robotics recently. The exploration problem that drives mapping has started to generate sim...
Thomas Kollar, Nicholas Roy