Sciweavers

259 search results - page 15 / 52
» Reinforcement Learning with the Use of Costly Features
Sort
View
CORR
2010
Springer
204views Education» more  CORR 2010»
13 years 8 months ago
Predictive State Temporal Difference Learning
We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identification. In practical applications...
Byron Boots, Geoffrey J. Gordon
AGENTS
1998
Springer
14 years 2 months ago
Learning Situation-Dependent Costs: Improving Planning from Probabilistic Robot Execution
Physical domains are notoriously hard to model completely and correctly, especially to capture the dynamics of the environment. Moreover, since environments change, it is even mor...
Karen Zita Haigh, Manuela M. Veloso
RAS
2010
164views more  RAS 2010»
13 years 8 months ago
Bridging the gap between feature- and grid-based SLAM
One important design decision for the development of autonomously navigating mobile robots is the choice of the representation of the environment. This includes the question which...
Kai M. Wurm, Cyrill Stachniss, Giorgio Grisetti
ECML
2006
Springer
14 years 1 months ago
Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks
Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...
Sébastien Jodogne, Cyril Briquet, Justus H....
ICASSP
2008
IEEE
14 years 4 months ago
Using dialogue acts to learn better repair strategies for spoken dialogue systems
Repair or error-recovery strategies are an important design issue in Spoken Dialogue Systems (SDSs) - how to conduct the dialogue when there is no progress (e.g. due to repeated A...
Matthew Frampton, Oliver Lemon