Search Sciweavers | Sciweavers

21

ICRA
2010
IEEE

133views Robotics» more ICRA 2010»

Generalized model learning for Reinforcement Learning on a humanoid robot

13 years 6 months ago

— Reinforcement learning (RL) algorithms have long been promising methods for enabling an autonomous robot to improve its behavior on sequential decision-making tasks. The obviou...

Todd Hester, Michael Quinlan, Peter Stone

claim paper

Read More »

24

click to vote

HRI
2007
ACM

127views Human Computer Interaction» more HRI 2007»

Learning by demonstration with critique from a human teacher

13 years 11 months ago

Download www.cs.cmu.edu

Learning by demonstration can be a powerful and natural tool for developing robot control policies. That is, instead of tedious hand-coding, a robot may learn a control policy by ...

Brenna Argall, Brett Browning, Manuela M. Veloso

claim paper

Read More »

29

click to vote

AAAI
2008

144views Intelligent Agents» more AAAI 2008»

A Variance Analysis for POMDP Policy Evaluation

13 years 10 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...

Mahdi Milani Fard, Joelle Pineau, Peng Sun

claim paper

Read More »

30

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

13 years 5 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

23

click to vote

ICRA
2008
IEEE

143views Robotics» more ICRA 2008»

Adaptive workspace biasing for sampling-based planners

14 years 2 months ago

Download www.ri.cmu.edu

Abstract— The widespread success of sampling-based planning algorithms stems from their ability to rapidly discover the connectivity of a conﬁguration space. Past research has ...

Matthew Zucker, James Kuffner, James A. Bagnell

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers