Search Sciweavers | Sciweavers

584 search results - page 69 / 117

» Reinforcement Learning Task Clustering

195

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 7 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

213

click to vote

ATAL
2004
Springer

149views Intelligent Agents» more ATAL 2004»

Learning User Preferences for Wireless Services Provisioning

16 years 13 days ago

Download people.csail.mit.edu

The problem of interest is how to dynamically allocate wireless access services in a competitive market which implements a take-it-or-leave-it allocation mechanism. In this paper ...

George Lee, Steven Bauer, Peyman Faratin, John Wro...

claim paper

Read More »

171

click to vote

ICASSP
2008
IEEE

121views Signal Processing» more ICASSP 2008»

Using dialogue acts to learn better repair strategies for spoken dialogue systems

16 years 1 months ago

Download www.stanford.edu

Repair or error-recovery strategies are an important design issue in Spoken Dialogue Systems (SDSs) - how to conduct the dialogue when there is no progress (e.g. due to repeated A...

Matthew Frampton, Oliver Lemon

claim paper

Read More »

183

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

16 years 1 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

162

click to vote

ECAI
2004
Springer

104views Artificial Intelligence» more ECAI 2004»

Piece-Wise Model Fitting Using Local Data Patterns

16 years 13 days ago

Download www2.cs.uh.edu

In this paper we propose a novel classiﬁcation algorithm that ﬁts models of different complexity on separate regions of the input space. The goal is to achieve a balance betwee...

Ricardo Vilalta, Murali-Krishna Achari, Christoph ...

claim paper

Read More »

« Prev « First page 69 / 117 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers