MDPs with Non-Deterministic Policies

14 years 2 months ago

Download www.cs.mcgill.ca

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for problems modelled as MDPs. Although finding the optimal policy is sufficient in many domains, in certain applications such as decision support systems where the policy is executed by a human (rather than a machine), finding all possible near-optimal policies might be useful as it provides more flexibility to the person executing the policy. In this paper we introduce the new concept of non-deterministic MDP policies, and address the question of finding near-optimal non-deterministic policies. We propose two solutions to this problem, one based on a Mixed Integer Program and the other one based on a search algorithm. We include experimental results obtained from applying this framework to optimize treatment choices in the context of a medical decision support system.

Mahdi Milani Fard, Joelle Pineau

Real-time Traffic

Decision Support System | Information Technology | Near-optimal Non-deterministic Policies | NIPS 2008 | Optimal Policy |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2008
Where	NIPS
Authors	Mahdi Milani Fard, Joelle Pineau

Comments (0)

Sciweavers

MDPs with Non-Deterministic Policies

Decision Support System | Information Technology | Near-optimal Non-deterministic Policies | NIPS 2008 | Optimal Policy |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers