Trial-Based Dynamic Programming for Multi-Agent Planning

15 years 3 months ago

Download rbr.cs.umass.edu

Trial-based approaches offer an efficient way to solve singleagent MDPs and POMDPs. These approaches allow agents to focus their computations on regions of the environment they encounter during the trials, leading to significant computational savings. We present a novel trial-based dynamic programming (TBDP) algorithm for DEC-POMDPs that extends these benefits to multi-agent settings. The algorithm uses trial-based methods for both belief generation and policy evaluation. Policy improvement is implemented efficiently using linear programming and a sub-policy reuse technique that helps bound the amount of memory. The results show that TBDP can produce significant value improvements and is much faster than the best existing planning algorithms.

Feng Wu, Shlomo Zilberstein, Xiaoping Chen

Real-time Traffic

AAAI 2010 | Approaches Allow Agents | Intelligent Agents | Significant Computational Savings | Trial-based Dynamic Programming |

claim paper

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	AAAI
Authors	Feng Wu, Shlomo Zilberstein, Xiaoping Chen

Sciweavers

Trial-Based Dynamic Programming for Multi-Agent Planning

AAAI 2010 | Approaches Allow Agents | Intelligent Agents | Significant Computational Savings | Trial-based Dynamic Programming |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers