Reward shaping for valuing communications during multi-agent coordination

14 years 7 months ago

Download eprints.ecs.soton.ac.uk

Decentralised coordination in multi-agent systems is typically achieved using communication. However, in many cases, communication is expensive to utilise because there is limited bandwidth, it may be dangerous to communicate, or communication may simply be unavailable at times. In this context, we argue for a rational approach to communication — if it has a cost, the agents should be able to calculate a value of communicating. By doing this, the agents can balance the need to communicate with the cost of doing so. In this research, we present a novel model of rational communication, that uses reward shaping to value communications, and employ this valuation in decentralised POMDP policy generation. In this context, reward shaping is the process by which expectations over joint actions are adjusted based on how coordinated the agent team is. An empirical evaluation of the beneﬁts of this approach is presented in two domains. First, in the context of an idealised benchmark problem,...

Simon A. Williamson, Enrico H. Gerding, Nicholas R

Real-time Traffic

Artificial Intelligence | ATAL 2009 | Communication | Decentralised | Rational Communication |

claim paper

Post Info
More Details (n/a)

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	ATAL
Authors	Simon A. Williamson, Enrico H. Gerding, Nicholas R. Jennings

Comments (0)

Sciweavers

Reward shaping for valuing communications during multi-agent coordination

Artificial Intelligence | ATAL 2009 | Communication | Decentralised | Rational Communication |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers