Improving Exploration in UCT Using Local Manifolds

8 years 11 months ago

Download www.fandm.edu

Monte Carlo planning has been proven successful in many sequential decision-making settings, but it suffers from poor exploration when the rewards are sparse. In this paper, we improve exploration in UCT by generalizing across similar states using a given distance metric. When the state space does not have a natural distance metric, we show how we can learn a local manifold from the transition graph of states in the near future. to obtain a distance metric. On domains inspired by video games, empirical evidence shows that our algorithm is more sample efﬁcient than UCT, particularly when rewards are sparse.

Sriram Srinivasan, Erik Talvitie, Michael H. Bowli

Real-time Traffic

AAAI 2015 | Intelligent Agents |

claim paper

Post Info
More Details (n/a)

Added	27 Mar 2016
Updated	27 Mar 2016
Type	Journal
Year	2015
Where	AAAI
Authors	Sriram Srinivasan, Erik Talvitie, Michael H. Bowling

Comments (0)

Sciweavers

Improving Exploration in UCT Using Local Manifolds

AAAI 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers