Memory-Bounded Dynamic Programming for DEC-POMDPs

14 years 2 months ago

Download anytime.cs.umass.edu

Decentralized decision making under uncertainty has been shown to be intractable when each agent has different partial information about the domain. Thus, improving the applicability and scalability of planning algorithms is an important challenge. We present the ﬁrst memory-bounded dynamic programming algorithm for ﬁnite-horizon decentralized POMDPs. A set of heuristics is used to identify relevant points of the inﬁnitely large belief space. Using these belief points, the algorithm successively selects the best joint policies for each horizon. The algorithm is extremely efﬁcient, having linear time and space complexity with respect to the horizon length. Experimental results show that it can handle horizons that are multiple orders of magnitude larger than what was previously possible, while achieving the same or better solution quality. These results signiﬁcantly increase the applicability of decentralized decision-making techniques.

Sven Seuken, Shlomo Zilberstein

Real-time Traffic

Algorithm | Artificial Intelligence | Decentralized Decision Making | IJCAI 2007 | ﬁnite-horizon Decentralized Pomdps |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	IJCAI
Authors	Sven Seuken, Shlomo Zilberstein

Comments (0)

Sciweavers

Memory-Bounded Dynamic Programming for DEC-POMDPs

Algorithm | Artificial Intelligence | Decentralized Decision Making | IJCAI 2007 | ﬁnite-horizon Decentralized Pomdps |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers