Sciweavers

121 search results - page 10 / 25
» Toward Off-Policy Learning Control with Function Approximati...
Sort
View
ICML
1996
IEEE
14 years 10 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore
JCP
2007
143views more  JCP 2007»
13 years 9 months ago
Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization
Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...
Nicolas Chapados, Yoshua Bengio
ICML
2000
IEEE
14 years 10 months ago
Rates of Convergence for Variable Resolution Schemes in Optimal Control
This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...
Andrew W. Moore, Rémi Munos
ASC
2007
13 years 9 months ago
An approximate stability analysis of nonlinear systems described by Universal Learning Networks
Stability is one of the most important subjects in control systems. As for the stability of nonlinear dynamical systems, Lyapunov’s direct method and linearized stability analys...
Kotaro Hirasawa, Shingo Mabu, Shinji Eto, Jinglu H...
TFS
2008
94views more  TFS 2008»
13 years 8 months ago
Hierarchical Fuzzy CMAC for Nonlinear Systems Modeling
Abstract--Since the fuzzy cerebellar model articulation controller (FCMAC) uses linguistic variables, it is highly intuitive and easily comprehended. Despite the FCMAC's good ...
Wen Yu, Floriberto Ortiz Rodriguez, Marco A. Moren...