Search Sciweavers | Sciweavers

95 search results - page 16 / 19

» Policy Gradients for Cryptanalysis

226

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 4 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

194

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

15 years 1 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

177

click to vote

PKDD
2009
Springer

181views Data Mining» more PKDD 2009»

Active Learning for Reward Estimation in Inverse Reinforcement Learning

16 years 1 months ago

Download users.isr.ist.utl.pt

Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...

Manuel Lopes, Francisco S. Melo, Luis Montesano

claim paper

Read More »

164

click to vote

ICRA
2008
IEEE

129views Robotics» more ICRA 2008»

Compliant manipulation for peg-in-hole: Is passive compliance a key to learn contact motion?

16 years 1 months ago

Download groups.csail.mit.edu

— We examine the usefulness of passive compliance in a manipulator that learns contact motion. Based on the notice that humans outperforms robots with the contact motion, we foll...

Seung-kook Yun

claim paper

Read More »

211

click to vote

JMLR
2010

227views more JMLR 2010»

PyBrain

15 years 5 months ago

Download www.idsia.ch

PyBrain is a versatile machine learning library for Python. Its goal is to provide ﬂexible, easyto-use yet still powerful algorithms for machine learning tasks, including a vari...

Tom Schaul, Justin Bayer, Daan Wierstra, Yi Sun, M...

claim paper

Read More »

« Prev « First page 16 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers