Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs

16 years 21 days ago

Download lang.is.kyushu-u.ac.jp

Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for ﬁnding an optimal joint policy is prohibitive, a Joint Equilibrium-based Search for Policies with Nash Equilibrium (JESP-NE) is proposed that ﬁnds a locally optimal joint policy in which each policy is a best response to other policies; i.e., the joint policy is a Nash equilibrium. One limitation of JESP-NE is that the quality of the obtained joint policy depends on the predeﬁned default policy. More speciﬁcally, when ﬁnding a best response, if some observation have zero probabilities, JESP-NE uses this default policy. If the default policy is quite bad, JESP-NE tends to converge to a sub-optimal joint policy. In this paper, we propose a method that ﬁnds a locally optimal joint policy based on a concept called Trembling-hand Perfect Equilibrium (TPE). In ﬁnding a TPE, we assume that an agent might make a mistake in selectin...

Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki

Real-time Traffic

Default Policy | Joint Policy | Optimal Joint Policy | PRIMA 2007 |

claim paper

Added	09 Jun 2010
Updated	09 Jun 2010
Type	Conference
Year	2007
Where	PRIMA
Authors	Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki

Sciweavers

Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs

Default Policy | Joint Policy | Optimal Joint Policy | PRIMA 2007 |

Explore & Download

Productivity Tools

Sciweavers