Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

107

Voted

UAI
2008

favoriteEmaildiscussreport

194views Artificial Intelligence» more UAI 2008»

Hierarchical POMDP Controller Optimization by Likelihood Maximization

15 years 3 months ago

Hierarchical POMDP Controller Optimization by Likelihood Maximization

Download uai2008.cs.helsinki.fi

Planning can often be simplified by decomposing the task into smaller tasks arranged hierarchically. Charlin et al. [4] recently showed that the hierarchy discovery problem can be framed as a non-convex optimization problem. However, the inherent computational difficulty of solving such an optimization problem makes it hard to scale to realworld problems. In another line of research, Toussaint et al. [18] developed a method to solve planning problems by maximumlikelihood estimation. In this paper, we show how the hierarchy discovery problem in partially observable domains can be tackled using a similar maximum likelihood approach. Our technique first transforms the problem into a dynamic Bayesian network through which a hierarchical structure can naturally be discovered while optimizing the policy. Experimental results demonstrate that this approach scales better than previous techniques based on non-convex optimization.

Marc Toussaint, Laurent Charlin, Pascal Poupart

Real-time Traffic

Artificial Intelligence | Hierarchy Discovery Problem | Non-convex Optimization Problem | Optimization Problem | UAI 2008 |

claim paper

Related Content

» Synthesis of Hierarchical FiniteState Controllers for POMDPs

» A Framework of Stochastic Power Management Using Hidden Markov Model

» Bayesian reinforcement learning in continuous POMDPs with gaussian processes

» Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

» Predictive representations for policy gradient in POMDPs

» Sensor Scheduling for Optimal Observability Using Estimation Entropy

» Stochastic Thresholding An approach to Estimator Optimization via Fisher Information Maxim...

» CrossValidation Optimization for Large Scale Structured Classification Kernel Methods

» Approximation Algorithms for PartialInformation Based Stochastic Control with Markovian Re...

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2008
Where	UAI
Authors	Marc Toussaint, Laurent Charlin, Pascal Poupart

Comments (0)