Search Sciweavers | Sciweavers

5 search results - page 1 / 1

» Sparse incremental learning for interactive robot control po...

175

click to vote

ICRA
2008
IEEE

169views Robotics» more ICRA 2008»

Sparse incremental learning for interactive robot control policy estimation

16 years 1 months ago

Download www.cs.brown.edu

— We are interested in transferring control policies for arbitrary tasks from a human to a robot. Using interactive demonstration via teloperation as our transfer scenario, we ca...

Daniel H. Grollman, Odest Chadwicke Jenkins

claim paper

Read More »

170

click to vote

AR
2002

157views more AR 2002»

Acquiring state from control dynamics to learn grasping policies for robot hands

15 years 6 months ago

Download www.mit.edu

Abstract--A prominent emerging theory of sensorimotor development in biological systems proposes that control knowledge is encoded in the dynamics of physical interaction with the ...

Roderic A. Grupen, Jefferson A. Coelho Jr.

claim paper

Read More »

186

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 5 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

183

click to vote

HRI
2007
ACM

133views Human Computer Interaction» more HRI 2007»

Efficient model learning for dialog management

15 years 10 months ago

Download www.eecs.ucf.edu

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because of their rob...

Finale Doshi, Nicholas Roy

claim paper

Read More »

208

Voted

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 8 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers