Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

168

Voted

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

16 years 7 months ago

Stable Function Approximation in Dynamic Programming

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experiments in this area have produced mixed results; there have been both notable successes and notable disappointments. Theory has been scarce, mostly due to the difculty of reasoning about function approximators that generalize beyond the observed data. We provide a proof of convergence for a wide class of temporal di erence methods involving function approximators such as k-nearest-neighbor, and show experimentally that these methods can be useful. The proof is based on a view of function approximators as expansion or contraction mappings. In addition, we present a novel view of tted value iteration: an approximate algorithm for one environment turns out to be an exact algorithm for a di erent environment.

Geoffrey J. Gordon

Real-time Traffic

Di Erence Methods | ICML 1995 | Machine Learning | Methods Involving Function | Temporal Di Erence |

claim paper

Related Content

» Stable Dual Dynamic Programming

» Approximate robust dynamic programming and robustly stable MPC

» A Constraint Generation Approach to Learning Stable Linear Dynamical Systems

» Approximate dynamic programming using fluid and diffusion approximations with applications...

» Automatic basis function construction for approximate dynamic programming and reinforcemen...

» Universal stabilization using control Lyapunov functions adaptive derivative feedback and ...

» Design of Asymptotically Stable Walking for a 5Link Planar Biped Walker via Optimization

» Dynamic Policy Programming

» Generalization in Reinforcement Learning Safely Approximating the Value Function

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	1995
Where	ICML
Authors	Geoffrey J. Gordon

Comments (0)