Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 3 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision process (POMDP). Yet, despite the growing importance and applications of decentralized POMDP models in the multiagents arena, few algorithms have been developed for efficiently deriving joint policies for these models. This paper presents a new class of locally optimal algorithms called "Joint Equilibriumbased search for policies (JESP)". We first describe an exhaustive version of JESP and subsequently a novel dynamic programming approach to JESP. Our complexity analysis reveals the potential for exponential speedups due to the dynamic programming approach. These theoretical results are verified via empirical comparisons of the two JESP versions with each other and with a globally optimal brute-force search algorithm. Finally, we prove piece-wise linear and convexity (PWLC) properties, thus taking steps t...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V.

Real-time Traffic

Dynamic Programming Approach | IJCAI 2003 | IJCAI 2007 | Joint Policies | Partially Observable Markov Decision Process |

claim paper

» Multiagent Planning Under Uncertainty with Stochastic Communication Delays

» Constraintbased dynamic programming for decentralized POMDPs with structured interactions

» Mixed Integer Linear Programming for Exact FiniteHorizon Planning in Decentralized Pomdps

» Towards a unifying characterization for quantifying weak coupling in decPOMDPs

» TrialBased Dynamic Programming for MultiAgent Planning

» Pointbased incremental pruning heuristic for solving finitehorizon DECPOMDPs

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	IJCAI
Authors	Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. Pynadath, Stacy Marsella

Comments (0)

Sciweavers

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

Dynamic Programming Approach | IJCAI 2003 | IJCAI 2007 | Joint Policies | Partially Observable Markov Decision Process |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers