Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

113

JAIR
2010

favoriteEmaildiscussreport

108views more JAIR 2010»

Kalman Temporal Differences

15 years 15 days ago

Kalman Temporal Differences

Download www.cs.uwaterloo.ca

This paper deals with value (and Q-) function approximation in deterministic Markovian decision processes (MDPs). A general statistical framework based on the Kalman ﬁltering paradigm is introduced. Its principle is to adopt a parametric representation of the value function, to model the associated parameter vector as a random variable and to minimize the mean-squared error of the parameters conditioned on past observed transitions. From this general framework, which will be called Kalman Temporal Differences (KTD), and using an approximation scheme called the unscented transform, a family of algorithms is derived. Contrary to most of function approximation schemes, this framework inherently allows to derive uncertainty information over the value function, which can be notably useful for the exploration/exploitation dilemma.

Matthieu Geist, Olivier Pietquin

Real-time Traffic

Approximation Schemes | Deterministic Markovian Decision | JAIR 2010 | Value Function |

claim paper

Related Content

» A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal Differenc...

» Gaussian Mixture GM Passive Localization using Time Difference of Arrival TDOA

» Bayesian Reward Filtering

» Tuning the Temporal Characteristics of a KalmanFilter Method for EndtoEnd Bandwidth Estima...

» A Dynamic Model for RealTime Tracking of Hands in Bimanual Movements

» Visual Motion Estimation and Prediction A Probabilistic Network Model for Temporal Coheren...

» SpatialTemporal Junction Extraction and Semantic Interpretation

» Resolution Improvement from Stereo Images with 3D Pose Differences

» Combined feature evaluation for adaptive visual object tracking

Post Info
More Details (n/a)

Added	28 Jan 2011
Updated	28 Jan 2011
Type	Journal
Year	2010
Where	JAIR
Authors	Matthieu Geist, Olivier Pietquin

Comments (0)