Sciweavers

1512 search results - page 212 / 303
» Qualitative reinforcement learning
Sort
View
ICCBR
2010
Springer
14 years 1 months ago
Reducing the Memory Footprint of Temporal Difference Learning over Finitely Many States by Using Case-Based Generalization
In this paper we present an approach for reducing the memory footprint requirement of temporal difference methods in which the set of states is finite. We use case-based generaliza...
Matt Dilts, Héctor Muñoz-Avila
ECAI
2008
Springer
13 years 11 months ago
Learning to Select Object Recognition Methods for Autonomous Mobile Robots
Selecting which algorithms should be used by a mobile robot computer vision system is a decision that is usually made a priori by the system developer, based on past experience and...
Reinaldo A. C. Bianchi, Arnau Ramisa, Ramon L&oacu...
AOIS
2004
13 years 10 months ago
Market-Based Recommender Systems: Learning Users' Interests by Quality Classification
Recommender systems are widely used to cope with the problem of information overload and, consequently, many recommendation methods have been developed. However, no one technique i...
Yan Zheng Wei, Luc Moreau, Nicholas R. Jennings
NIPS
1993
13 years 10 months ago
Temporal Difference Learning of Position Evaluation in the Game of Go
The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...
Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...
JAIR
2007
124views more  JAIR 2007»
13 years 9 months ago
Closed-Loop Learning of Visual Control Policies
In this paper we present a general, flexible framework for learning mappings from images to actions by interacting with the environment. The basic idea is to introduce a feature-...
Sébastien Jodogne, Justus H. Piater