APRICODD: Approximate Policy Construction Using Decision Diagrams

14 years 2 months ago

Download www.cs.ubc.ca

We propose a method of approximate dynamic programming for Markov decision processes (MDPs) using algebraic decision diagrams (ADDs). We produce near-optimal value functions and policies with much lower time and space requirements than exact dynamic programming. Our method reduces the sizes of the intermediate value functions generated during value iteration by replacing the values at the terminals of the ADD with ranges of values. Our method is demonstrated on a class of large MDPs (with up to 34 billion states), and we compare the results with the optimal value functions.

Robert St-Aubin, Jesse Hoey, Craig Boutilier

Real-time Traffic

Dynamic Programming | Near-optimal Value Functions | NIPS 2000 | NIPS 2007 | Value Functions |

claim paper

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2000
Where	NIPS
Authors	Robert St-Aubin, Jesse Hoey, Craig Boutilier

Comments (0)

Sciweavers

APRICODD: Approximate Policy Construction Using Decision Diagrams

Dynamic Programming | Near-optimal Value Functions | NIPS 2000 | NIPS 2007 | Value Functions |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers