A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

14 years 2 months ago

Download emmanuel.rachelson.free.fr

Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of decision under uncertainty. In this paper, after reviewing and comparing MDP frameworks designed to deal with temporal problems, we focus on Generalized Semi-Markov Decision Processes (GSMDP) with observable time. We highlight the inherent structure and complexity of these problems and present the differences with classical reinforcement learning problems. Finally, we introduce a new simulation-based reinforcement learning method for solving GSMDP, bringing together results from simulation-based policy iteration, regression techniques and simulation theory. We illustrate our approach on a subway network control example.

Emmanuel Rachelson, Gauthier Quesnel, Fréd&

Real-time Traffic

Artificial Intelligence | ECAI 2008 | Reinforcement Learning | Reinforcement Learning Problems | Simulation-based Policy Iteration |

claim paper

Post Info
More Details (n/a)

Added	19 Oct 2010
Updated	19 Oct 2010
Type	Conference
Year	2008
Where	ECAI
Authors	Emmanuel Rachelson, Gauthier Quesnel, Frédérick Garcia, Patrick Fabiani

Comments (0)

Sciweavers

A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

Artificial Intelligence | ECAI 2008 | Reinforcement Learning | Reinforcement Learning Problems | Simulation-based Policy Iteration |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers