Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search

15 years 8 months ago

Download www.personeel.unimaas.nl

Abstract. We present a new exploration term, more eﬃcient than classical UCT-like exploration terms. It combines eﬃciently expert rules, patterns extracted from datasets, All-Moves-As-First values, and classical online values. As this improved bandit formula does not solve several important situations (semeais, nakade) in computer Go, we present three other important improvements which are central in the recent progress of our program MoGo. – We show an expert-based improvement of Monte-Carlo simulations for nakade situations; we also emphasize some limitations of this modiﬁcation. – We show a technique which preserves diversity in the Monte-Carlo simulation, which greatly improves the results in 19x19. – Whereas the UCB-based exploration term is not eﬃcient in MoGo, we show a new exploration term which is highly eﬃcient in MoGo. MoGo recently won a game with handicap 7 against a 9Dan Pro player, Zhou JunXun, winner of the LG Cup 2007, and a game with handicap 6 against...

Guillaume Chaslot, Christophe Fiter, Jean-Baptiste

Real-time Traffic

ACG 2009 | Classical Uct-like Exploration | Computer Graphics | Exploration Term | Monte-carlo Simulation |

claim paper

» Generalized MonteCarlo Tree Search Extensions for General Game Playing

» Backtracking Through Biconnected Components of a Constraint Graph

» Architectural requirements of parallel computational biology applications with explicit in...

Post Info
More Details (n/a)

Added	25 May 2010
Updated	25 May 2010
Type	Conference
Year	2009
Where	ACG
Authors	Guillaume Chaslot, Christophe Fiter, Jean-Baptiste Hoock, Arpad Rimmel, Olivier Teytaud

Comments (0)

Sciweavers

Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search

ACG 2009 | Classical Uct-like Exploration | Computer Graphics | Exploration Term | Monte-carlo Simulation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers