Transition-independent decentralized markov decision processes

14 years 7 months ago

Download anytime.cs.umass.edu

There has been substantial progress with formal models for sequential decision making by individual agents using the Markov decision process (MDP). However, similar treatment of multi-agent systems is lacking. A recent complexity result, showing that solving decentralized MDPs is NEXPhard, provides a partial explanation. To overcome this complexity barrier, we identify a general class of transitionindependent decentralized MDPs that is widely applicable. The class consists of independent collaborating agents that are tied together through a global reward function that depends upon both of their histories. We present a novel algorithm for solving this class of problems and examine its properties. The result is the ﬁrst eﬀective technique to solve optimally a class of decentralized MDPs. This lays the foundation for further work in this area on both exact and approximate solutions. Categories and Subject Descriptors I.2.11 [Artiﬁcial Intelligence]: Distributed Artiﬁcial Intellig...

Raphen Becker, Shlomo Zilberstein, Victor R. Lesse

Real-time Traffic

ATAL 2003 | Decentralized Mdp | Keywords Decentralized Mdp | Transitionindependent Decentralized Mdps |

claim paper

Post Info
More Details (n/a)

Added	06 Jul 2010
Updated	06 Jul 2010
Type	Conference
Year	2003
Where	ATAL
Authors	Raphen Becker, Shlomo Zilberstein, Victor R. Lesser, Claudia V. Goldman

Comments (0)

Sciweavers

Transition-independent decentralized markov decision processes

ATAL 2003 | Decentralized Mdp | Keywords Decentralized Mdp | Transitionindependent Decentralized Mdps |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers