Search Sciweavers | Sciweavers

382 search results - page 6 / 77

» Gradient estimation in global optimization algorithms

150

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 8 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

193

click to vote

SIP
2003

150views Image Processing» more SIP 2003»

Time Domain Optimization Techniques for Blind Separation of Non-stationary Convolutive Mixed Signals

15 years 8 months ago

Download www.isip.uni-luebeck.de

This paper aims to solve the problem of Blind Signal Separation (BSS) in a convolutive environment based on output correlation matrix diagonalization. Firstly an extension of the ...

Iain Russell, Alfred Mertins, Jiangtao Xi

claim paper

Read More »

215

click to vote

CEC
2011
IEEE

221views Artificial Intelligence» more CEC 2011»

Stochastic Natural Gradient Descent by estimation of empirical covariances

14 years 6 months ago

Download chrome.ws.dei.polimi.it

—Stochastic relaxation aims at ﬁnding the minimum of a ﬁtness function by identifying a proper sequence of distributions, in a given model, that minimize the expected value o...

Luigi Malagò, Matteo Matteucci, Giovanni Pi...

claim paper

Read More »

169

Voted

ICCD
2001
IEEE

154views Hardware» more ICCD 2001»

Performance Optimization By Wire and Buffer Sizing Under The Transmission Line Model

16 years 3 months ago

Download cc.ee.ntu.edu.tw

As the operating frequency increases to Giga Hertz and the rise time of a signal is less than or comparable to the time-of-ﬂight delay of a line, it is necessary to consider the...

Tai-Chen Chen, Song-Ra Pan, Yao-Wen Chang

claim paper

Read More »

171

click to vote

WSC
2001

119views Modeling And Simulation» more WSC 2001»

Global random optimization by simultaneous perturbation stochastic approximation

15 years 8 months ago

Download www.jhuapl.edu

We examine the theoretical and numerical global convergence properties of a certain "gradient free" stochastic approximation algorithm called the "simultaneous pertu...

John L. Maryak, Daniel C. Chin

claim paper

Read More »

« Prev « First page 6 / 77 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers