Search Sciweavers | Sciweavers

163 search results - page 29 / 33

» Policy Gradient Methods for Robotics

212

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

16 years 5 days ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

169

click to vote

ICRA
2009
IEEE

169views Robotics» more ICRA 2009»

Manipulation planning with Workspace Goal Regions

16 years 1 months ago

Download www.ri.cmu.edu

— We present an approach to path planning for manipulators that uses Workspace Goal Regions (WGRs) to specify goal end-effector poses. Instead of specifying a discrete set of goa...

Dmitry Berenson, Siddhartha S. Srinivasa, Dave Fer...

claim paper

Read More »

176

click to vote

ICRA
2007
IEEE

178views Robotics» more ICRA 2007»

Behavior Based Adaptive Control for Autonomous Oceanographic Sampling

16 years 29 days ago

Download oceanai.mit.edu

Abstract— This paper describes an investigation into the adaptive control of autonomous mobile sensor platforms for providing oceanographic sampling. Mobile sensor platforms prov...

Donald P. Eickstedt, Michael R. Benjamin, Ding Wan...

claim paper

Read More »

180

click to vote

ML
2006
ACM

99views Machine Learning» more ML 2006»

Universal parameter optimisation in games based on SPSA

15 years 6 months ago

Download www.jhuapl.edu

Most game programs have a large number of parameters that are crucial for their performance. While tuning these parameters by hand is rather difficult, efficient and easy to use ge...

Levente Kocsis, Csaba Szepesvári

claim paper

Read More »

181

click to vote

NIPS
2008

188views Information Technology» more NIPS 2008»

Bayesian Kernel Shaping for Learning Control

15 years 8 months ago

Download eprints.pascal-network.org

In kernel-based regression learning, optimizing each kernel individually is useful when the data density, curvature of regression surfaces (or decision boundaries) or magnitude of...

Jo-Anne Ting, Mrinal Kalakrishnan, Sethu Vijayakum...

claim paper

Read More »

« Prev « First page 29 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers