Sciweavers

2108 search results - page 140 / 422
» Tracking in Reinforcement Learning
Sort
View
94
Voted
ICML
1995
IEEE
16 years 3 months ago
Tracking the Best Expert
Mark Herbster, Manfred K. Warmuth
93
Voted
COLT
2006
Springer
15 years 6 months ago
Tracking the Best Hyperplane with a Simple Budget Perceptron
Nicolò Cesa-Bianchi, Claudio Gentile
126
Voted
SOCROB
2010
126views Robotics» more  SOCROB 2010»
15 years 1 months ago
Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief
Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...
Antoine Hiolle, Lola Cañamero, Pierre Andry...
119
Voted
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
15 years 9 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
148
Voted
COLING
2000
15 years 4 months ago
Automatic Optimization of Dialogue Management
Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...
Diane J. Litman, Michael S. Kearns, Satinder P. Si...