Sciweavers

NIPS
2007

Stable Dual Dynamic Programming

14 years 29 days ago
Stable Dual Dynamic Programming
Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions instead of value functions. In this paper, we investigate the convergence properties of these dual algorithms both theoretically and empirically, and show how they can be scaled up by incorporating function approximation.
Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2007
Where NIPS
Authors Tao Wang, Daniel J. Lizotte, Michael H. Bowling, Dale Schuurmans
Comments (0)