Search Sciweavers | Sciweavers

272 search results - page 28 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

176

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 1 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

170

click to vote

CEC
2005
IEEE

99views Artificial Intelligence» more CEC 2005»

XCS with computed prediction for the learning of Boolean functions

16 years 14 days ago

Download www.eskimo.com

Computed prediction represents a major shift in learning classiﬁer system research. XCS with computed prediction, based on linear approximators, has been applied so far to functi...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

216

click to vote

TNN
2010

216views Management» more TNN 2010»

Simplifying mixture models through function approximation

15 years 1 months ago

Download books.nips.cc

Finite mixture model is a powerful tool in many statistical learning problems. In this paper, we propose a general, structure-preserving approach to reduce its model complexity, w...

Kai Zhang, James T. Kwok

claim paper

Read More »

196

click to vote

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

16 years 1 months ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

188

click to vote

TEC
2008

115views more TEC 2008»

Function Approximation With XCS: Hyperellipsoidal Conditions, Recursive Least Squares, and Compaction

15 years 6 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

An important strength of learning classifier systems (LCSs) lies in the combination of genetic optimization techniques with gradient-based approximation techniques. The chosen app...

Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson

claim paper

Read More »

« Prev « First page 28 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers