Search Sciweavers | Sciweavers

223 search results - page 28 / 45

» Least-Squares Temporal Difference Learning

183

click to vote

IJCAI
2007

143views Artificial Intelligence» more IJCAI 2007»

Direct Code Access in Self-Organizing Neural Networks for Reinforcement Learning

15 years 8 months ago

Download www.aaai.org

TD-FALCON is a self-organizing neural network that incorporates Temporal Difference (TD) methods for reinforcement learning. Despite the advantages of fast and stable learning, TD...

Ah-Hwee Tan

claim paper

Read More »

223

Voted

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Bayesian actor-critic algorithms

16 years 8 months ago

Download www.machinelearning.org

We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...

Mohammad Ghavamzadeh, Yaakov Engel

claim paper

Read More »

213

click to vote

ECAI
2008
Springer

158views Artificial Intelligence» more ECAI 2008»

A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

15 years 9 months ago

Download emmanuel.rachelson.free.fr

Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of dec...

Emmanuel Rachelson, Gauthier Quesnel, Fréd&...

claim paper

Read More »

278

click to vote

ESANN
2008

278views Neural Networks» more ESANN 2008»

Learning to play Tetris applying reinforcement learning methods

15 years 8 months ago

Download www.dice.ucl.ac.be

In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...

Alexander Groß, Jan Friedland, Friedhelm Sch...

claim paper

Read More »

233

click to vote

EPS
1995
Springer

173views Artificial Intelligence» more EPS 1995»

PANIC: A Parallel Evolutionary Rule Based System

15 years 11 months ago

Download www.cs.bris.ac.uk

PANIC (Parallelism And Neural networks In Classifier systems) is a parallel system to evolve behavioral strategies codified by sets of rules. It integrates several adaptive techni...

Antonella Giani, Fabrizio Baiardi, Antonina Starit...

claim paper

Read More »

« Prev « First page 28 / 45 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers