Apprenticeship learning via soft local homomorphisms

15 years 5 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IRL) provides an efﬁcient solution to this problem based on the assumption that the expert is optimally acting in a Markov Decision Process (MDP). However, past work on IRL requires an accurate estimate of the frequency of encountering each feature of the states when the robot follows the expert’s policy. Given that the complete policy of the expert is unknown, the features frequencies can only be empirically estimated from the demonstrated trajectories. In this paper, we propose to use a transfer method, known as soft homomorphism, in order to generalize the expert’s policy to unvisited regions of the state space. The generalized policy can be used either as the robot’s ﬁnal policy, or to calculate the features frequencies within an IRL algorithm. Empirical results show that our approach is able to l...

Abdeslam Boularias, Brahim Chaib-draa

Real-time Traffic

Expert’s Policy | ICRA 2010 | Policy | Robotics | State Space |

claim paper

Added	26 Jan 2011
Updated	26 Jan 2011
Type	Journal
Year	2010
Where	ICRA
Authors	Abdeslam Boularias, Brahim Chaib-draa

Sciweavers

Apprenticeship learning via soft local homomorphisms

Expert’s Policy | ICRA 2010 | Policy | Robotics | State Space |

Explore & Download

Productivity Tools

Sciweavers