Lossless clustering of histories in decentralized POMDPs

15 years 8 months ago

Download www.science.uva.nl

Decentralized partially observable Markov decision processes (Dec-POMDPs) constitute a generic and expressive framework for multiagent planning under uncertainty. However, planning optimally is diﬃcult because solutions map local observation histories to actions, and the number of such histories grows exponentially in the planning horizon. In this work, we identify a criterion that allows for lossless clustering of observation histories: i.e., we prove that when two histories satisfy the criterion, they have the same optimal value and thus can be treated as one. We show how this result can be exploited in optimal policy search and demonstrate empirically that it can provide a speed-up of multiple orders of magnitude, allowing the optimal solution of signiﬁcantly larger problems. We also perform an empirical analysis of the generality of our clustering method, which suggests that it may also be useful in other (approximate) Dec-POMDP solution methods. Categories and Subject Descrip...

Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J.

Real-time Traffic

Artificial Intelligence | ATAL 2009 | Local Observation Histories | Observable Markov Decision | Observation Histories |

claim paper

Post Info
More Details (n/a)

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	ATAL
Authors	Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J. Spaan

Comments (0)

Sciweavers

Lossless clustering of histories in decentralized POMDPs

Artificial Intelligence | ATAL 2009 | Local Observation Histories | Observable Markov Decision | Observation Histories |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers