Sciweavers

311 search results - page 15 / 63
» Gradient Convergence in Gradient methods with Errors
Sort
View
ICML
2003
IEEE
14 years 8 months ago
Hierarchical Policy Gradient Algorithms
Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...
Mohammad Ghavamzadeh, Sridhar Mahadevan
NIPS
2001
13 years 9 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
EOR
2007
117views more  EOR 2007»
13 years 7 months ago
Simultaneous perturbation stochastic approximation of nonsmooth functions
A simultaneous perturbation stochastic approximation (SPSA) method has been developed in this paper, using the operators of perturbation with the Lipschitz density function. This ...
Vaida Bartkute, Leonidas Sakalauskas
TVCG
2011
119views more  TVCG 2011»
13 years 2 months ago
Toward High-Quality Gradient Estimation on Regular Lattices
—In this paper, we present two methods for accurate gradient estimation from scalar field data sampled on regular lattices. The first method is based on the multidimensional Tayl...
Zahid Hossain, Usman R. Alim, Torsten Möller
ICASSP
2010
IEEE
13 years 7 months ago
A new method for kurtosis maximization and source separation
This paper introduces a new method to maximize kurtosisbased contrast functions. Such contrast functions appear in the problem of blind source separation of convolutively mixed so...
Marc Castella, Eric Moreau