Search Sciweavers | Sciweavers

154 search results - page 8 / 31

» Robust snake convergence based on dynamic programming

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

13 years 2 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

click to vote

NIPS
1994

178views Information Technology» more NIPS 1994»

Generalization in Reinforcement Learning: Safely Approximating the Value Function

13 years 9 months ago

Download www.ri.cmu.edu

To appear in: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge MA, 1995. A straightforward approach to t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

AMC
2007

115views more AMC 2007»

Evolutionary programming based on non-uniform mutation

13 years 7 months ago

Download www.mmrc.iss.ac.cn

Abstract–A new evolutionary programming algorithm (NEP) using the non-uniform mutation operator instead of Gaussian or Cauchy mutation operators is proposed. NEP has the merits o...

Xinchao Zhao, Xiao-Shan Gao, Ze-Chun Hu

claim paper

Read More »

click to vote

GLVLSI
2006
IEEE

185views VLSI» more GLVLSI 2006»

Application of fast SOCP based statistical sizing in the microprocessor design flow

14 years 1 months ago

Download www.cerc.utexas.edu

In this paper we have applied statistical sizing in an industrial setting. Efficient implementation of the statistical sizing algorithm is achieved by utilizing a dedicated interi...

Murari Mani, Mahesh Sharma, Michael Orshansky

claim paper

Read More »

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Receding Horizon Differential Dynamic Programming

13 years 9 months ago

Download books.nips.cc

The control of high-dimensional, continuous, non-linear dynamical systems is a key problem in reinforcement learning and control. Local, trajectory-based methods, using techniques...

Yuval Tassa, Tom Erez, William D. Smart

claim paper

Read More »

« Prev « First page 8 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers