Search Sciweavers | Sciweavers

311 search results - page 15 / 63

» Gradient Convergence in Gradient methods with Errors

122

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 4 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

129

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 5 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

133

click to vote

EOR
2007

117views more EOR 2007»

Simultaneous perturbation stochastic approximation of nonsmooth functions

15 years 3 months ago

Download www.jhuapl.edu

A simultaneous perturbation stochastic approximation (SPSA) method has been developed in this paper, using the operators of perturbation with the Lipschitz density function. This ...

Vaida Bartkute, Leonidas Sakalauskas

claim paper

Read More »

158

click to vote

TVCG
2011

119views more TVCG 2011»

Toward High-Quality Gradient Estimation on Regular Lattices

14 years 10 months ago

Download www.cs.sfu.ca

—In this paper, we present two methods for accurate gradient estimation from scalar field data sampled on regular lattices. The first method is based on the multidimensional Tayl...

Zahid Hossain, Usman R. Alim, Torsten Möller

claim paper

Read More »

122

click to vote

ICASSP
2010
IEEE

189views Signal Processing» more ICASSP 2010»

A new method for kurtosis maximization and source separation

15 years 4 months ago

Download www-public.it-sudparis.eu

This paper introduces a new method to maximize kurtosisbased contrast functions. Such contrast functions appear in the problem of blind source separation of convolutively mixed so...

Marc Castella, Eric Moreau

claim paper

Read More »

« Prev « First page 15 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers