Search Sciweavers | Sciweavers

248 search results - page 19 / 50

» Rate of Convergence for Constrained Stochastic Approximation...

181

click to vote

JAT
2010

71views more JAT 2010»

Functions with prescribed best linear approximations

15 years 5 months ago

Download www.ann.jussieu.fr

A common problem in applied mathematics is that of ﬁnding a function in a Hilbert space with prescribed best approximations from a ﬁnite number of closed vector subspaces. In ...

Patrick L. Combettes, Noli N. Reyes

claim paper

Read More »

200

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

15 years 9 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

173

click to vote

JAT
2006

64views more JAT 2006»

Nonlinear function approximation: Computing smooth solutions with an adaptive greedy algorithm

15 years 7 months ago

Download www.sfb013.uni-linz.ac.at

Opposed to linear schemes, nonlinear function approximation allows to obtain a dimension independent rate of convergence. Unfortunately, in the presence of data noise typical algo...

Andreas Hofinger

claim paper

Read More »

194

click to vote

TNN
1998

111views more TNN 1998»

Asymptotic distributions associated to Oja's learning equation for neural networks

15 years 7 months ago

Download www-public.int-evry.fr

— In this paper, we perform a complete asymptotic performance analysis of the stochastic approximation algorithm (denoted subspace network learning algorithm) derived from Oja’...

Jean Pierre Delmas, Jean-Francois Cardos

claim paper

Read More »

195

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 8 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

« Prev « First page 19 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers