Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

185

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 8 months ago

Why did TD-Gammon Work?

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or even other games. We were able to replicate some of the success of TD-Gammon, developing a competitive evaluation function on a 4000 parameter feed-forward neural network, without using back-propagation, reinforcement or temporal difference learning methods. Instead we apply simple hill-climbing in a relative fitness environment. These results and further analysis suggest that the surprising success of Tesauro's program had more to do with the co-evolutionary structure of the learning task and the dynamics of the backgammon game itself.

Jordan B. Pollack, Alan D. Blair

Real-time Traffic

Difference Learning Methods | NIPS 1996 | NIPS 2007 | Similar Impressive Breakthroughs | Temporal Difference Learning |

claim paper

Related Content

» Why phishing works

» Why Cant They Create Architecture Models Like Developer X An Experience Report

» I just dont know why its gone maintaining informal information use in inpatient care

» How well do multiobjective evolutionary algorithms scale to large problems

» Mobile map interactions during a rendezvous exploring the implications of automation

» Beyond being in the lab using multiagent modeling to isolate competing hypotheses

» UIUC in HARD 2004Passage Retrieval Using HMMs

» Innovation Processes Revisited by Internet

» Coupling the Users The Benefits of Paired User Testing for iDTV

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1996
Where	NIPS
Authors	Jordan B. Pollack, Alan D. Blair

Comments (0)