Search Sciweavers | Sciweavers

121 search results - page 10 / 25

» Toward Off-Policy Learning Control with Function Approximati...

196

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 7 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

196

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 6 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

175

click to vote

ICML
2000
IEEE

169views Machine Learning» more ICML 2000»

Rates of Convergence for Variable Resolution Schemes in Optimal Control

16 years 7 months ago

Download sequel.futurs.inria.fr

This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...

Andrew W. Moore, Rémi Munos

claim paper

Read More »

204

click to vote

ASC
2007

176views Artificial Intelligence» more ASC 2007»

An approximate stability analysis of nonlinear systems described by Universal Learning Networks

15 years 6 months ago

Download www.knu.edu.tw

Stability is one of the most important subjects in control systems. As for the stability of nonlinear dynamical systems, Lyapunov’s direct method and linearized stability analys...

Kotaro Hirasawa, Shingo Mabu, Shinji Eto, Jinglu H...

claim paper

Read More »

175

click to vote

TFS
2008

94views more TFS 2008»

Hierarchical Fuzzy CMAC for Nonlinear Systems Modeling

15 years 6 months ago

Download www.ctrl.cinvestav.mx

Abstract--Since the fuzzy cerebellar model articulation controller (FCMAC) uses linguistic variables, it is highly intuitive and easily comprehended. Despite the FCMAC's good ...

Wen Yu, Floriberto Ortiz Rodriguez, Marco A. Moren...

claim paper

Read More »

« Prev « First page 10 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers