Search Sciweavers | Sciweavers

16

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

14 years 8 months ago

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

27

click to vote

CEC
2010
IEEE

216views Artificial Intelligence» more CEC 2010»

Coevolutionary Temporal Difference Learning for small-board Go

13 years 7 months ago

Download www.cs.put.poznan.pl

—In this paper we apply Coevolutionary Temporal Difference Learning (CTDL), a hybrid of coevolutionary search and reinforcement learning proposed in our former study, to evolve s...

Krzysztof Krawiec, Marcin Szubert

posted by mszubert

Read More »

20

click to vote

GECCO
2009
Springer

142views Optimization» more GECCO 2009»

A stopping criterion based on Kalman estimation techniques with several progress indicators

14 years 1 months ago

Download www.giaa.inf.uc3m.es

The need for a stopping criterion in MOEA’s is a repeatedly mentioned matter in the domain of MOOP’s, even though it is usually left aside as secondary, while stopping criteri...

José Luis Guerrero, Jesús Garc&iacut...

claim paper

Read More »

24

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

13 years 8 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

22

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

14 years 8 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers