Search Sciweavers | Sciweavers

1262 search results - page 166 / 253

» Reinforcement Learning: An Introduction

124

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 4 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

152

Voted

GECCO
2008
Springer

144views Optimization» more GECCO 2008»

Self-adaptive constructivism in Neural XCS and XCSF

15 years 4 months ago

Download www.cems.uwe.ac.uk

For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...

Gerard David Howard, Larry Bull, Pier Luca Lanzi

claim paper

Read More »

114

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 4 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

135

click to vote

ROBOCUP
2000
Springer

104views Robotics» more ROBOCUP 2000»

Essex Wizards 2000 Team Description

15 years 6 months ago

Download cswww.essex.ac.uk

: This article gives an overview of the Essex Wizards 2000 team participated in the RoboCup 2000 simulator league. A brief description of the agent architecture for the team is int...

Huosheng Hu, Kostas Kostiadis, Matthew Hunter, Kos...

claim paper

Read More »

119

click to vote

ESANN
2008

115views Neural Networks» more ESANN 2008»

15 years 4 months ago

Similarities and differences between policy gradient methods and evolution strategies

Download www.dice.ucl.ac.be

Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

« Prev « First page 166 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers