Sciweavers

RAS
2010

Probabilistic Policy Reuse for inter-task transfer learning

13 years 9 months ago
Probabilistic Policy Reuse for inter-task transfer learning
Policy Reuse is a reinforcement learning technique that efficiently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration by probabilistically including the exploitation of those past policies. Policy Reuse was introduced and previously demonstrated its effectiveness in problems with different reward functions in the same state and action spaces. In this article, we contribute Policy Reuse as transfer learning among different domains. We introduce extended MDPs to include domains and tasks, where domains have different state and action spaces, and task are problems with different rewards within a domain. We show how Policy Reuse can be applied among domains by defining and using a mapping between their state and action spaces. We use several domains, as versions of a simulated RoboCup Keepaway problem, where we show that Policy Reuse can be used as a mechanism of transfer learning significantly outperforming a basic policy...
Fernando Fernández, Javier García, M
Added 30 Jan 2011
Updated 30 Jan 2011
Type Journal
Year 2010
Where RAS
Authors Fernando Fernández, Javier García, Manuela M. Veloso
Comments (0)