Advice taking in multiagent reinforcement learning

14 years 8 months ago

Download homepages.inf.ed.ac.uk

This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) in the stochastic games framework that uses an additional “advice” signal to inform agents about mutually beneﬁcial forms of behaviour. β-WoLF is an extension of the WoLF-PHC algorithm that allows agents to assess whether the advice obtained through this additional reward signal is (i) useful for the learning agent itself and (ii) currently being followed by other agents in the system. With this, agents are able to decide autonomously whether to follow the advice or not, safeguarding themselves against malicious or unreliable advice which, if followed, might lead them to sacriﬁce their own future rewards, as well as unilateral cooperation that could be exploited by other agents in the system. We report on experimental results obtained with this novel algorithm which indicate that it enables cooperation in scenarios in which the need to defend oneself against exploitation results in poor coo...

Michael Rovatsos, Alexandros Belesiotis

Real-time Traffic

ATAL 2007 | MARL Algorithms | Multiagent Reinforcement | Reinforcement Learning |

claim paper

Post Info
More Details (n/a)

Added	07 Jun 2010
Updated	07 Jun 2010
Type	Conference
Year	2007
Where	ATAL
Authors	Michael Rovatsos, Alexandros Belesiotis

Comments (0)

Sciweavers

Advice taking in multiagent reinforcement learning

ATAL 2007 | MARL Algorithms | Multiagent Reinforcement | Reinforcement Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers