Stable Dual Dynamic Programming

15 years 8 months ago

Download webdocs.cs.ualberta.ca

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions instead of value functions. In this paper, we investigate the convergence properties of these dual algorithms both theoretically and empirically, and show how they can be scaled up by incorporating function approximation.

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D

Real-time Traffic

Dynamic Programming | Explicit Representations | Information Technology | NIPS 2007 | Stationary Distributions |

claim paper

» Stable Function Approximation in Dynamic Programming

» Approximate robust dynamic programming and robustly stable MPC

» Programming and coordinating Grid environments and applications

» Reliable DualBand Based Contour Detection A Double Dynamic Programming Approach

» On the convergence of stochastic dual dynamic programming and related methods

» Dual reciprocity BEM and dynamic programming filter for inverse elastodynamic problems

» Analysis of stochastic dual dynamic programming method

» Semantics for Dynamic Logic Programming A PrincipleBased Approach

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	NIPS
Authors	Tao Wang, Daniel J. Lizotte, Michael H. Bowling, Dale Schuurmans

Comments (0)

Sciweavers

Stable Dual Dynamic Programming

Dynamic Programming | Explicit Representations | Information Technology | NIPS 2007 | Stationary Distributions |

Explore & Download

Productivity Tools

Sciweavers