On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints

14 years 7 months ago

Download www.aamas-conference.org

Decentralized Markov Decision Processes (DEC-MDPs) are a popular model of agent-coordination problems in domains with uncertainty and time constraints but very difﬁcult to solve. In this paper, we improve a state-of-the-art heuristic solution method for DEC-MDPs, called OC-DEC-MDP, that has recently been shown to scale up to larger DEC-MDPs. Our heuristic solution method, called Value Function Propagation (VFP), combines two orthogonal improvements of OC-DEC-MDP. First, it speeds up OC-DECMDP by an order of magnitude by maintaining and manipulating a value function for each state (as a function of time) rather than a separate value for each pair of sate and time interval. Furthermore, it achieves better solution qualities than OC-DEC-MDP because, as our analytical results show, it does not overestimate the expected total reward like OC-DEC- MDP. We test both improvements independently in a crisis-management domain as well as for other types of domains. Our experimental results demon...

Janusz Marecki, Milind Tambe

Real-time Traffic

ATAL 2007 | Heuristic Solution Method | Markov Decision | Value Function |

claim paper

Post Info
More Details (n/a)

Added	07 Jun 2010
Updated	07 Jun 2010
Type	Conference
Year	2007
Where	ATAL
Authors	Janusz Marecki, Milind Tambe

Comments (0)

Sciweavers

On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints

ATAL 2007 | Heuristic Solution Method | Markov Decision | Value Function |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers