Search Sciweavers | Sciweavers

311 search results - page 9 / 63

» Gradient Convergence in Gradient methods with Errors

142

Voted

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 4 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

126

click to vote

MVA
2002

177views Computer Vision» more MVA 2002»

Global Motion Estimation Based on the Constrained Spatio-temporal Gradient Method in Model-Based Coding

15 years 3 months ago

Download www.cvl.iis.u-tokyo.ac.jp

For global motion estimation in model-based coding, this paper proposes a constrained spatio-temporal gradient method using contour information. To overcome the local minimum prob...

Young Wook Sohn, Doo-Hyun Kim, Dong-O Kim, Rae-Hon...

claim paper

Read More »

136

Voted

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 1 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

107

Voted

FOCM
2008

140views more FOCM 2008»

Online Gradient Descent Learning Algorithms

15 years 3 months ago

Download www.cs.ucl.ac.uk

This paper considers the least-square online gradient descent algorithm in a reproducing kernel Hilbert space (RKHS) without explicit regularization. We present a novel capacity i...

Yiming Ying, Massimiliano Pontil

claim paper

Read More »

143

Voted

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

15 years 9 months ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

« Prev « First page 9 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers