Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

184

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

15 years 8 months ago

Approximate Planning in POMDPs with Macro-Actions

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. We present and explore a new reinforcement learning algorithm over grid-points in belief space, which uses macro-actions and Monte Carlo updates of the Q-values. We apply the algorithm to a large scale robot navigation task and demonstrate that with abstraction we can consider an even smaller part of the belief space, we can learn POMDP policies faster, and we can do information gathering more efﬁciently.

Georgios Theocharous, Leslie Pack Kaelbling

Real-time Traffic

Belief Space | Entire Belief Space | NIPS 2003 | NIPS 2007 | Reinforcement Learning Algorithm |

claim paper

Related Content

» PUMA Planning Under Uncertainty with MacroActions

» Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

» Planning under Uncertainty for Robotic Tasks with Mixed Observability

» Dynamic DDN Construction for Lightweight Planning Architectures

» Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs

» Faster Teaching by POMDP Planning

» An Improved GridBased Approximation Algorithm for POMDPs

» POMDPs Make Better Hackers Accounting for Uncertainty in Penetration Testing

» Applying MetricTrees to BeliefPoint POMDPs

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	NIPS
Authors	Georgios Theocharous, Leslie Pack Kaelbling

Comments (0)