Search Sciweavers | Sciweavers

141 search results - page 24 / 29

» Fuzzy Kanerva-based function approximation for reinforcement...

186

click to vote

TFS
2008

94views more TFS 2008»

Hierarchical Fuzzy CMAC for Nonlinear Systems Modeling

15 years 6 months ago

Download www.ctrl.cinvestav.mx

Abstract--Since the fuzzy cerebellar model articulation controller (FCMAC) uses linguistic variables, it is highly intuitive and easily comprehended. Despite the FCMAC's good ...

Wen Yu, Floriberto Ortiz Rodriguez, Marco A. Moren...

claim paper

Read More »

236

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 4 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

186

click to vote

GECCO
2009
Springer

82views Optimization» more GECCO 2009»

On the scalability of XCS(F)

16 years 1 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Many successful applications have proven the potential of Learning Classiﬁer Systems and the XCS classiﬁer system in particular in datamining, reinforcement learning, and func...

Patrick O. Stalph, Martin V. Butz, David E. Goldbe...

claim paper

Read More »

217

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 7 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

215

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

16 years 17 days ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

« Prev « First page 24 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers