Search Sciweavers | Sciweavers

61 search results - page 9 / 13

» Convergence of synchronous reinforcement learning with linea...

201

click to vote

ICML
2000
IEEE

169views Machine Learning» more ICML 2000»

Rates of Convergence for Variable Resolution Schemes in Optimal Control

16 years 8 months ago

Download sequel.futurs.inria.fr

This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...

Andrew W. Moore, Rémi Munos

claim paper

Read More »

170

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

15 years 11 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

223

click to vote

AIIDE
2006

123views Artificial Intelligence» more AIIDE 2006»

The Self Organization of Context for Learning in MultiAgent Games

15 years 8 months ago

Download www.aaai.org

Reinforcement learning is an effective machine learning paradigm in domains represented by compact and discrete state-action spaces. In high-dimensional and continuous domains, ti...

Christopher D. White, Dave Brogan

claim paper

Read More »

206

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 9 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

181

click to vote

CEC
2005
IEEE

99views Artificial Intelligence» more CEC 2005»

XCS with computed prediction for the learning of Boolean functions

16 years 29 days ago

Download www.eskimo.com

Computed prediction represents a major shift in learning classiﬁer system research. XCS with computed prediction, based on linear approximators, has been applied so far to functi...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

« Prev « First page 9 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers