Hierarchical Monte-Carlo Planning

8 years 10 months ago

Download ipvs.informatik.uni-stuttgart.de

Monte-Carlo Tree Search, especially UCT and its POMDP version POMCP, have demonstrated excellent performance on many problems. However, to efﬁciently scale to large domains one should also exploit hierarchical structure if present. In such hierarchical domains, ﬁnding rewarded states typically requires to search deeply; covering enough such informative states very far from the root becomes computationally expensive in ﬂat non-hierarchical search approaches. We propose novel, scalable MCTS methods which integrate a task hierarchy into the MCTS framework, speciﬁcally leading to hierarchical versions of both, UCT and POMCP. The new method does not need to estimate probabilistic models of each subtask, it instead computes subtask policies purely sample-based. We evaluate the hierarchical MCTS methods on various settings such as a hierarchical MDP, a Bayesian model-based hierarchical RL problem, and a large hierarchical POMDP.

Ngo Anh Vien, Marc Toussaint

Real-time Traffic

AAAI 2015 | Intelligent Agents |

claim paper

Post Info
More Details (n/a)

Added	27 Mar 2016
Updated	27 Mar 2016
Type	Journal
Year	2015
Where	AAAI
Authors	Ngo Anh Vien, Marc Toussaint

Comments (0)

Sciweavers

Hierarchical Monte-Carlo Planning

AAAI 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers