Search Sciweavers | Sciweavers

91 search results - page 8 / 19

» Event-based control using quadratic approximate value functi...

190

click to vote

ICML
2000
IEEE

169views Machine Learning» more ICML 2000»

Rates of Convergence for Variable Resolution Schemes in Optimal Control

16 years 7 months ago

Download sequel.futurs.inria.fr

This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...

Andrew W. Moore, Rémi Munos

claim paper

Read More »

229

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 8 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

251

click to vote

FOGA
2011

254views Artificial Intelligence» more FOGA 2011»

Approximating the distribution of fitness over hamming regions

14 years 10 months ago

Download www.cs.colostate.edu

The distribution of ﬁtness values across a set of states sharply inﬂuences the dynamics of evolutionary processes and heuristic search in combinatorial optimization. In this p...

Andrew M. Sutton, Darrell Whitley, Adele E. Howe

claim paper

Read More »

146

click to vote

ICPR
2006
IEEE

260views computer vision» more ICPR 2006»

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

16 years 8 months ago

Download ee2.chit.edu.tw

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...

Siwei Luo, Yu Zheng, Ziang Lv

claim paper

Read More »

216

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 7 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 8 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers