Programming backgammon using self-teaching neural nets

14 years 2 months ago

Download www.math-info.univ-paris5.fr

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD-Gammon's selfteaching methodology results in a surprisingly strong program: without lookahead, its positional judgement rivals that of human experts, and when combined with shallow lookahead, it reaches a level of play that surpasses even the best human players. The success of TD-Gammon has also been replicated by several other programmers; at least two other neural net programs also appear to be capable of superhuman play. Previous papers on TD-Gammon have focused on developing a scientific understanding of its reinforcement learning methodology. This paper views machine learning as a tool in a programmer's toolkit, and considers how it can be combined with other programming techniques to achieve and surpass world-class backgammon play. Particular emphasis is placed on programming shallow-depth se...

Gerald Tesauro

Real-time Traffic

AI 2002 | Artificial Intelligence | Neural Net Programs | Random Initial Play | World-class Backgammon Play |

claim paper

Post Info
More Details (n/a)

Added	16 Dec 2010
Updated	16 Dec 2010
Type	Journal
Year	2002
Where	AI
Authors	Gerald Tesauro

Comments (0)

Sciweavers

Programming backgammon using self-teaching neural nets

AI 2002 | Artificial Intelligence | Neural Net Programs | Random Initial Play | World-class Backgammon Play |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers