Concurrent Probabilistic Temporal Planning with Policy-Gradients

14 years 3 months ago

Download eprints.pascal-network.org

We present an any-time concurrent probabilistic temporal planner that includes continuous and discrete uncertainties and metric functions. Our approach is a direct policy search that attempts to optimise a parameterised policy using gradient ascent. Low memory use, plus the use of function approximation methods, plus factorisation of the policy, allow us to scale to challenging domains. This Factored Policy Gradient (FPG) Planner also attempts to optimise both steps to goal and the probability of success. We compare the FPG planner to other planners on CPTP domains, and on simpler but better studied probabilistic non-temporal domains.

Douglas Aberdeen, Olivier Buffet

Real-time Traffic

AIPS 2007 | Artificial Intelligence | Direct Policy Search | FPG Planner | Probabilistic Temporal Planner |

claim paper

Post Info
More Details (n/a)

Added	02 Oct 2010
Updated	02 Oct 2010
Type	Conference
Year	2007
Where	AIPS
Authors	Douglas Aberdeen, Olivier Buffet

Comments (0)

Sciweavers

Concurrent Probabilistic Temporal Planning with Policy-Gradients

AIPS 2007 | Artificial Intelligence | Direct Policy Search | FPG Planner | Probabilistic Temporal Planner |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers