Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

15 years 6 months ago

Download anytime.cs.umass.edu

POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their computational complexity, however, presents an important research challenge. One approach that effectively addresses the intractable memory requirements of current algorithms is based on representing agent policies as finite-state controllers. In this paper, we propose a new approach that uses this representation and formulates the problem as a nonlinear program (NLP). The NLP defines an optimal policy of a desired size for each agent. This new representation allows a wide range of powerful nonlinear programming algorithms to be used to solve POMDPs and DEC-POMDPs. Although solving the NLP optimally is often intractable, the results we obtain using an off-the-shelf optimization method are competitive with stateof-the-art POMDP algorithms and outperform state-of-the-art DEC-POMDP algorithms. Our approach is easy to implement and it opens up ...

Christopher Amato, Daniel S. Bernstein, Shlomo Zil

Real-time Traffic

AAMAS 2010 | Algorithms | Intelligent Agents | Intractable Memory Requirements | Nonlinear Programming |

claim paper

» Quasi deterministic POMDPs and DecPOMDPs

» Mixed Integer Linear Programming for Exact FiniteHorizon Planning in Decentralized Pomdps

» Stochastic Local Search for POMDP Controllers

» Multiagent Planning Under Uncertainty with Stochastic Communication Delays

» A Framework of Stochastic Power Management Using Hidden Markov Model

» Reinforcement Learning in POMDPs via Direct Gradient Ascent

» A POMDP approach to P300based braincomputer interfaces

» Decentralized planning under uncertainty for teams of communicating agents

Post Info
More Details (n/a)

Added	08 Dec 2010
Updated	08 Dec 2010
Type	Journal
Year	2010
Where	AAMAS
Authors	Christopher Amato, Daniel S. Bernstein, Shlomo Zilberstein

Comments (0)

Sciweavers

Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

AAMAS 2010 | Algorithms | Intelligent Agents | Intractable Memory Requirements | Nonlinear Programming |

Explore & Download

Productivity Tools

Sciweavers