Sciweavers

4544 search results - page 203 / 909
» Reinforcement Learning with Time
Sort
View
IJCAI
2003
13 years 11 months ago
Use of Off-line Dynamic Programming for Efficient Image Interpretation
An interpretation system finds the likely mappings from portions of an image to real-world objects. An interpretation policy specifies when to apply which imaging operator, to whi...
Ramana Isukapalli, Russell Greiner
NIPS
1998
13 years 11 months ago
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...
Michael J. Kearns, Satinder P. Singh
LAWEB
2008
IEEE
14 years 4 months ago
Evolution of the Chilean Web: A Larger Study
In this paper we extend our previous and only study on the dynamics of the Chilean Web. This new study doubles the time period and to the best of our knowledge is the only study o...
Eduardo Graells, Ricardo A. Baeza-Yates
GIS
2009
ACM
14 years 2 months ago
Machine learning approach to report prioritization with an application to travel time dissemination
This paper looks at the problem of data prioritization, commonly found in mobile ad-hoc networks. The proposed general solution uses a machine learning approach in order to learn ...
Piotr Szczurek, Bo Xu, Jie Lin, Ouri Wolfson
ICRA
2009
IEEE
125views Robotics» more  ICRA 2009»
14 years 5 months ago
Learning motor primitives for robotics
— The acquisition and self-improvement of novel motor skills is among the most important problems in robotics. Motor primitives offer one of the most promising frameworks for the...
Jens Kober, Jan Peters