Learning by demonstration with critique from a human teacher

14 years 6 months ago

Download www.cs.cmu.edu

Learning by demonstration can be a powerful and natural tool for developing robot control policies. That is, instead of tedious hand-coding, a robot may learn a control policy by interacting with a teacher. In this work we present an algorithm for learning by demonstration in which the teacher operates in two phases. The teacher first demonstrates the task to the learner. The teacher next critiques learner performance of the task. This critique is used by the learner to update its control policy. In our implementation we utilize a 1-Nearest Neighbor technique which incorporates both training dataset and teacher critique. Since the teacher critiques performance only, they do not need to guess at an effective critique for the underlying algorithm. We argue that this method is particularly well-suited to human teachers, who are generally better at assigning credit to performances than to algorithms. We have applied this algorithm to the simulated task of a robot intercepting a ball. Our ...

Brenna Argall, Brett Browning, Manuela M. Veloso

Real-time Traffic

Control Policy | HRI 2007 | Human Computer Interaction | Teacher | Teacher Critiques |

claim paper

Post Info
More Details (n/a)

Added	16 Aug 2010
Updated	16 Aug 2010
Type	Conference
Year	2007
Where	HRI
Authors	Brenna Argall, Brett Browning, Manuela M. Veloso

Comments (0)

Sciweavers

Learning by demonstration with critique from a human teacher

Control Policy | HRI 2007 | Human Computer Interaction | Teacher | Teacher Critiques |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers