Search Sciweavers | Sciweavers

10 search results - page 2 / 2

» Coevolutionary Temporal Difference Learning for small-board ...

185

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 8 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

186

Voted

IJCAI
2007

173views Artificial Intelligence» more IJCAI 2007»

Reinforcement Learning of Local Shape in the Game of Go

15 years 8 months ago

Download webdocs.cs.ualberta.ca

We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has prov...

David Silver, Richard S. Sutton, Martin Mülle...

claim paper

Read More »

206

Voted

ACG
2003
Springer

157views Computer Graphics» more ACG 2003»

Evaluation in Go by a Neural Network using Soft Segmentation

16 years 2 days ago

Download webdocs.cs.ualberta.ca

In this article a neural network architecture is presented that is able to build a soft segmentation of a two-dimensional input. This network architecture is applied to position ev...

Markus Enzenberger

claim paper

Read More »

177

click to vote

JMLR
2002

100views more JMLR 2002»

On the Convergence of Optimistic Policy Iteration

15 years 6 months ago

Download www.mit.edu

We consider a finite-state Markov decision problem and establish the convergence of a special case of optimistic policy iteration that involves Monte Carlo estimation of Q-values,...

John N. Tsitsiklis

claim paper

Read More »

254

click to vote

ESANN
2008

278views Neural Networks» more ESANN 2008»

Learning to play Tetris applying reinforcement learning methods

15 years 8 months ago

Download www.dice.ucl.ac.be

In this paper the application of reinforcement learning to Tetris is investigated, particulary the idea of temporal difference learning is applied to estimate the state value funct...

Alexander Groß, Jan Friedland, Friedhelm Sch...

claim paper

Read More »

« Prev « First page 2 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers