Why did TD-Gammon Work?

14 years 2 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or even other games. We were able to replicate some of the success of TD-Gammon, developing a competitive evaluation function on a 4000 parameter feed-forward neural network, without using back-propagation, reinforcement or temporal difference learning methods. Instead we apply simple hill-climbing in a relative fitness environment. These results and further analysis suggest that the surprising success of Tesauro's program had more to do with the co-evolutionary structure of the learning task and the dynamics of the backgammon game itself.

Jordan B. Pollack, Alan D. Blair

Real-time Traffic

Difference Learning Methods | NIPS 1996 | NIPS 2007 | Similar Impressive Breakthroughs | Temporal Difference Learning |

claim paper

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1996
Where	NIPS
Authors	Jordan B. Pollack, Alan D. Blair

Comments (0)

Sciweavers

Why did TD-Gammon Work?

Difference Learning Methods | NIPS 1996 | NIPS 2007 | Similar Impressive Breakthroughs | Temporal Difference Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers