Team-Partitioned, Opaque-Transition Reinforcement Learning

15 years 10 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of using action-dependent features to generalize the state space. In our work, we use a learned action-dependent feature space. TPOT-RL is an effective technique to allow a team of agents to learn to cooperate towards the achievement of a speciﬁc goal. It is an adaptation of traditional RL methods that is applicable in complex, non-Markovian, multi-agent domains with large state spaces and limited training opportunities. Multi-agent scenarios are opaque-transition, as team members are not always in full communication with one another and adversaries may affect the environment. Hence, each learner cannot rely on having knowledge of future state transitions after acting in the world. TPOT-RL enables teams of agents to learn effective policies with very few training examples even in the face of a large state space wi...

Peter Stone, Manuela M. Veloso

Real-time Traffic

Action-dependent Feature | AGENTS 1999 | Multi-agent | Security Privacy | State Space |

claim paper

Post Info
More Details (n/a)

Added	03 Aug 2010
Updated	03 Aug 2010
Type	Conference
Year	1999
Where	AGENTS
Authors	Peter Stone, Manuela M. Veloso

Comments (0)

Sciweavers

Team-Partitioned, Opaque-Transition Reinforcement Learning

Action-dependent Feature | AGENTS 1999 | Multi-agent | Security Privacy | State Space |

Explore & Download

Productivity Tools

Sciweavers