Search Sciweavers | Sciweavers

32

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

13 years 4 months ago

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

33

click to vote

IROS
2007
IEEE

132views Robotics» more IROS 2007»

Hysteretic q-learning : an algorithm for decentralized reinforcement learning in cooperative multi-agent teams

14 years 4 months ago

Download hal.archives-ouvertes.fr

— Multi-agent systems (MAS) are a ﬁeld of study of growing interest in a variety of domains such as robotics or distributed controls. The article focuses on decentralized reinf...

Laëtitia Matignon, Guillaume J. Laurent, Nadi...

claim paper

Read More »

35

click to vote

ML
2006
ACM

99views Machine Learning» more ML 2006»

Universal parameter optimisation in games based on SPSA

13 years 9 months ago

Download www.jhuapl.edu

Most game programs have a large number of parameters that are crucial for their performance. While tuning these parameters by hand is rather difficult, efficient and easy to use ge...

Levente Kocsis, Csaba Szepesvári

claim paper

Read More »

34

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 10 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

33

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

13 years 7 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers