Improving Anytime Point-Based Value Iteration Using Principled Point Selections

14 years 4 months ago

Download ai.stanford.edu

Planning in partially-observable dynamical systems (such as POMDPs and PSRs) is a computationally challenging task. Popular approximation techniques that have proved successful are point-based planning methods including pointbased value iteration (PBVI), which works by approximating the solution at a ﬁnite set of points. These point-based methods typically are anytime algorithms, whereby an initial solution is obtained using a small set of points, and the solution may be incrementally improved by including additional points. We introduce a family of anytime PBVI algorithms that use the information present in the current solution for identifying and adding new points that have the potential to best improve the next solution. We motivate and present two different methods for choosing points and evaluate their performance empirically, demonstrating that high-quality solutions can be obtained with signiﬁcantly fewer points than previous PBVI approaches.

Michael R. James, Michael E. Samples, Dmitri A. Do

Real-time Traffic

Artificial Intelligence | IJCAI 2007 | Partially-observable Dynamical Systems | Point-based Planning Methods | Solutions |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	IJCAI
Authors	Michael R. James, Michael E. Samples, Dmitri A. Dolgov

Comments (0)

Sciweavers

Improving Anytime Point-Based Value Iteration Using Principled Point Selections

Artificial Intelligence | IJCAI 2007 | Partially-observable Dynamical Systems | Point-based Planning Methods | Solutions |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers