Pure Stationary Optimal Strategies in Markov Decision Processes

14 years 9 months ago

Download www.labri.fr

Markov decision processes (MDPs) are controllable discrete event systems with stochastic transitions. Performances of an MDP are evaluated by a payoﬀ function. The controller of the MDP seeks to optimize those performances, using optimal strategies. There exists various ways of measuring performances, i.e. various classes of payoﬀ functions. For example, average performances can be evaluated by a mean-payoﬀ function, peak performances by a limsup payoﬀ function, and the parity payoﬀ function can be used to encode logical speciﬁcations. Surprisingly, all the MDPs equipped with mean, limsup or parity payoﬀ functions share a common non-trivial property: they admit pure stationary optimal strategies. In this paper, we introduce the class of preﬁx-independent and submixing payoﬀ functions, and we prove that any MDP equipped with such a payoﬀ function admits pure stationary optimal strategies. This result uniﬁes and simpliﬁes several existing proofs. Moreover, it is a...

Hugo Gimbert

Real-time Traffic

Optimal Strategies | Payoﬀ Function | STACS 2007 | Stationary Optimal Strategies | Theoretical Computer Science |

claim paper

Post Info
More Details (n/a)

Added	09 Jun 2010
Updated	09 Jun 2010
Type	Conference
Year	2007
Where	STACS
Authors	Hugo Gimbert

Comments (0)

Sciweavers

Pure Stationary Optimal Strategies in Markov Decision Processes

Optimal Strategies | Payoﬀ Function | STACS 2007 | Stationary Optimal Strategies | Theoretical Computer Science |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers