Rationality of Reward Sharing in Multi-agent Reinforcement Learning

14 years 6 months ago

Download svrrd2.niad.ac.jp

Abstract. In multi-agent reinforcement learning systems, it is important to share a reward among all agents. We focus on the Rationality Theorem of Proﬁt Sharing [5] and analyze how to share a reward among all proﬁt sharing agents. When an agent gets a direct reward R (R > 0), an indirect reward µR (µ ≥ 0) is given to the other agents. We have derived the necessary and suﬃcient condition to preserve the rationality as follows; µ < M − 1 MW (1 − ( 1 M )W0 )(n − 1)L , where M and L are the maximum number of conﬂicting all rules and rational rules in the same sensory input, W and W0 are the maximum episode length of a direct and an indirect-reward agents, and n is the number of agents. This theory is derived by avoiding the least desirable situation whose expected reward per an action is zero. Therefore, if we use this theorem, we can experience several eﬃcient aspects of reward sharing. Through numerical examples, we conﬁrm the eﬀectiveness of this theorem....

Kazuteru Miyazaki, Shigenobu Kobayashi

Real-time Traffic

Artificial Intelligence | Indirect Reward µR | PRICAI 1999 | Proﬁt Sharing Agents | Reinforcement Learning Systems |

claim paper

Post Info
More Details (n/a)

Added	04 Aug 2010
Updated	04 Aug 2010
Type	Conference
Year	1999
Where	PRICAI
Authors	Kazuteru Miyazaki, Shigenobu Kobayashi

Comments (0)

Sciweavers

Rationality of Reward Sharing in Multi-agent Reinforcement Learning

Artificial Intelligence | Indirect Reward µR | PRICAI 1999 | Proﬁt Sharing Agents | Reinforcement Learning Systems |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers