Computing and Using Lower and Upper Bounds for Action Elimination in MDP Planning

14 years 6 months ago

Download www.cs.umd.edu

Abstract. We describe a way to improve the performance of MDP planners by modifying them to use lower and upper bounds to eliminate non-optimal actions during their search. First, we discuss a particular straction formulation of MDP planning problems and how to use that formulation to compute bounds on the Q-functions of those planning problems. Then, we describe how to incorporate those bounds into a large class of MDP planning algorithms to control their search during planning. We provide theorems establishing the correctness of this technique and an experimental evaluation to demonstrate its eﬀectiveness. We incorporated our ideas into two MDP planners: the Real Time Dynamic Programming (RTDP) algorithm [1] and the Adaptive Multistage (AMS) sampling algorithm [2], taken respectively from automated planning and operations research communities. Our experiments on an Unmanned Aerial Vehicles (UAVs) path planning problem demonstrate that our action-elimination technique provides signi...

Ugur Kuter, Jiaqiao Hu

Real-time Traffic

Artificial Intelligence | MDP Planners | MDP Planning | Planning Problem | SARA 2007 |

claim paper

Post Info
More Details (n/a)

Added	09 Jun 2010
Updated	09 Jun 2010
Type	Conference
Year	2007
Where	SARA
Authors	Ugur Kuter, Jiaqiao Hu

Comments (0)

Sciweavers

Computing and Using Lower and Upper Bounds for Action Elimination in MDP Planning

Artificial Intelligence | MDP Planners | MDP Planning | Planning Problem | SARA 2007 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers