Sciweavers

2108 search results - page 140 / 422
» Tracking in Reinforcement Learning
Sort
View
ICML
1995
IEEE
14 years 11 months ago
Tracking the Best Expert
Mark Herbster, Manfred K. Warmuth
COLT
2006
Springer
14 years 1 months ago
Tracking the Best Hyperplane with a Simple Budget Perceptron
Nicolò Cesa-Bianchi, Claudio Gentile
SOCROB
2010
126views Robotics» more  SOCROB 2010»
13 years 8 months ago
Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief
Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...
Antoine Hiolle, Lola Cañamero, Pierre Andry...
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
14 years 4 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
COLING
2000
13 years 11 months ago
Automatic Optimization of Dialogue Management
Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...
Diane J. Litman, Michael S. Kearns, Satinder P. Si...