Search Sciweavers | Sciweavers

259 search results - page 15 / 52

» Reinforcement Learning with the Use of Costly Features

194

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 4 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

188

click to vote

AGENTS
1998
Springer

175views Security Privacy» more AGENTS 1998»

Learning Situation-Dependent Costs: Improving Planning from Probabilistic Robot Execution

15 years 10 months ago

Download www.cs.cmu.edu

Physical domains are notoriously hard to model completely and correctly, especially to capture the dynamics of the environment. Moreover, since environments change, it is even mor...

Karen Zita Haigh, Manuela M. Veloso

claim paper

Read More »

154

click to vote

RAS
2010

164views more RAS 2010»

Bridging the gap between feature- and grid-based SLAM

15 years 4 months ago

Download www.informatik.uni-freiburg.de

One important design decision for the development of autonomously navigating mobile robots is the choice of the representation of the environment. This includes the question which...

Kai M. Wurm, Cyrill Stachniss, Giorgio Grisetti

claim paper

Read More »

154

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 9 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

152

click to vote

ICASSP
2008
IEEE

121views Signal Processing» more ICASSP 2008»

Using dialogue acts to learn better repair strategies for spoken dialogue systems

16 years 7 days ago

Download www.stanford.edu

Repair or error-recovery strategies are an important design issue in Spoken Dialogue Systems (SDSs) - how to conduct the dialogue when there is no progress (e.g. due to repeated A...

Matthew Frampton, Oliver Lemon

claim paper

Read More »

« Prev « First page 15 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers