Sciweavers

1512 search results - page 144 / 303
» Qualitative reinforcement learning
Sort
View
SOCROB
2010
126views Robotics» more  SOCROB 2010»
13 years 7 months ago
Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief
Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...
Antoine Hiolle, Lola Cañamero, Pierre Andry...
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
14 years 3 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
COLING
2000
13 years 10 months ago
Automatic Optimization of Dialogue Management
Designing the dialogue strategy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing di...
Diane J. Litman, Michael S. Kearns, Satinder P. Si...
NIPS
2008
13 years 10 months ago
Goal-directed decision making in prefrontal cortex: a computational framework
Research in animal learning and behavioral neuroscience has distinguished between two forms of action control: a habit-based form, which relies on stored action values, and a goal...
Matthew Botvinick, James An
ATAL
2006
Springer
14 years 29 days ago
Learning the required number of agents for complex tasks
Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...
Sébastien Paquet, Brahim Chaib-draa