Sciweavers

PRIMA
2007
Springer

Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs

14 years 6 months ago
Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs
Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for finding an optimal joint policy is prohibitive, a Joint Equilibrium-based Search for Policies with Nash Equilibrium (JESP-NE) is proposed that finds a locally optimal joint policy in which each policy is a best response to other policies; i.e., the joint policy is a Nash equilibrium. One limitation of JESP-NE is that the quality of the obtained joint policy depends on the predefined default policy. More specifically, when finding a best response, if some observation have zero probabilities, JESP-NE uses this default policy. If the default policy is quite bad, JESP-NE tends to converge to a sub-optimal joint policy. In this paper, we propose a method that finds a locally optimal joint policy based on a concept called Trembling-hand Perfect Equilibrium (TPE). In finding a TPE, we assume that an agent might make a mistake in selectin...
Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki
Added 09 Jun 2010
Updated 09 Jun 2010
Type Conference
Year 2007
Where PRIMA
Authors Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki
Comments (0)