Sciweavers

584 search results - page 69 / 117
» Reinforcement Learning Task Clustering
Sort
View
129
Voted
ICML
2009
IEEE
16 years 3 months ago
Regularization and feature selection in least-squares temporal difference learning
We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...
J. Zico Kolter, Andrew Y. Ng
141
Voted
ATAL
2004
Springer
15 years 7 months ago
Learning User Preferences for Wireless Services Provisioning
The problem of interest is how to dynamically allocate wireless access services in a competitive market which implements a take-it-or-leave-it allocation mechanism. In this paper ...
George Lee, Steven Bauer, Peyman Faratin, John Wro...
ICASSP
2008
IEEE
15 years 8 months ago
Using dialogue acts to learn better repair strategies for spoken dialogue systems
Repair or error-recovery strategies are an important design issue in Spoken Dialogue Systems (SDSs) - how to conduct the dialogue when there is no progress (e.g. due to repeated A...
Matthew Frampton, Oliver Lemon
128
Voted
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
15 years 8 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng
90
Voted
ECAI
2004
Springer
15 years 7 months ago
Piece-Wise Model Fitting Using Local Data Patterns
In this paper we propose a novel classification algorithm that fits models of different complexity on separate regions of the input space. The goal is to achieve a balance betwee...
Ricardo Vilalta, Murali-Krishna Achari, Christoph ...