Search Sciweavers | Sciweavers

313 search results - page 1 / 63

» Consistent Approximations and Approximate Functions and Grad...

194

click to vote

SIAMCO
2002

121views more SIAMCO 2002»

Consistent Approximations and Approximate Functions and Gradients in Optimal Control

15 years 6 months ago

Download www.ann.jussieu.fr

As shown in [7], optimal control problems with either ODE or PDE dynamics can be solved efficiently using a setting of consistent approximations obtained by numerical discretizati...

Olivier Pironneau, Elijah Polak

claim paper

Read More »

176

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 6 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

206

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 8 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

191

click to vote

GECCO
2010
Springer

239views Optimization» more GECCO 2010»

Benchmarking SPSA on BBOB-2010 noiseless function testbed

15 years 10 months ago

Download sci2s.ugr.es

This paper presents the result for Simultaneous Perturbation Stochastic Approximation (SPSA) on the BBOB 2010 noiseless testbed. SPSA is a stochastic gradient approximation strate...

Steffen Finck, Hans-Georg Beyer

claim paper

Read More »

190

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

16 years 7 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

« Prev « First page 1 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers