Finite horizon risk sensitive MDP and linear programming

9 years 11 months ago

Download www.ieor.iitb.ac.in

— In the context of standard Markov decision processes (MDPs), the connection between Dynamic Program (DP) and Linear Program (LP) is well understood and is well established under sufﬁciently general conditions. LP based approach facilitates solving the constrained MDPs. Multiplicative or Risk sensitive MDPs, introduced to control the ﬂuctuations/variations around the expected value, are relatively less studied objects. DP equations are considerably well understood even in the context of Risk MDPs, however the LP connection is not known. We consider a ﬁnite horizon risk MDP problem and establish the connections between the DP and LP approaches. We augment the state space with a suitable component, to obtain the optimal policies for constrained risk MDPs. We apply this results to a server selection problem in Ber/M/K/K queues, with a constraint on the utilization of the fast server. We discuss some interesting structural properties of the risk optimal policies.

Atul Kumar, Veeraruna Kavitha, N. Hemachandra

Real-time Traffic

CDC 2015 | Control Systems |

claim paper

Post Info
More Details (n/a)

Added	18 Apr 2016
Updated	18 Apr 2016
Type	Journal
Year	2015
Where	CDC
Authors	Atul Kumar, Veeraruna Kavitha, N. Hemachandra

Comments (0)

Sciweavers

Finite horizon risk sensitive MDP and linear programming

CDC 2015 | Control Systems |

Explore & Download

Productivity Tools

Sciweavers