Sciweavers

1340 search results - page 11 / 268
» Kalman Temporal Differences
Sort
View
ICML
2001
IEEE
14 years 8 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
ICIP
1997
IEEE
14 years 9 months ago
An 8x8-Block Based Motion Estimation Using Kalman Filter
It is now quite common in the pel-recursive approaches for motion estimation, to find applications of the Kalman filtering technique both in time and frequency domains. In the blo...
V. Ruiz, Vassilis E. Fotopoulos, Athanassios N. Sk...
ICML
2003
IEEE
14 years 8 months ago
Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning
We present a novel Bayesian approach to the problem of value function estimation in continuous state spaces. We define a probabilistic generative model for the value function by i...
Yaakov Engel, Shie Mannor, Ron Meir
ICML
2000
IEEE
14 years 8 months ago
Relative Loss Bounds for Temporal-Difference Learning
Jürgen Forster, Manfred K. Warmuth