Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles

14 years 29 days ago

Download www.daimi.au.dk

Howard's policy iteration algorithm is one of the most widely used algorithms for finding optimal policies for controlling Markov Decision Processes (MDPs). When applied to weighted directed graphs, which may be viewed as Deterministic MDPs (DMDPs), Howard's algorithm can be used to find Minimum Mean-Cost cycles (MMCC). Experimental studies suggest that Howard's algorithm works extremely well in this context. The theoretical complexity of Howard's algorithm for finding MMCCs is a mystery. No polynomial time bound is known on its running time. Prior to this work, there were only linear lower bounds on the number of iterations performed by Howard's algorithm. We provide the first weighted graphs on which Howard's algorithm performs (n2 ) iterations, where n is the number of vertices in the graph.

Thomas Dueholm Hansen, Uri Zwick

Real-time Traffic

Algorithm | Algorithms | Howard's | ISAAC 2010 | Iterations |

claim paper

Post Info
More Details (n/a)

Added	13 Feb 2011
Updated	13 Feb 2011
Type	Journal
Year	2010
Where	ISAAC
Authors	Thomas Dueholm Hansen, Uri Zwick

Comments (0)

Sciweavers

Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles

Algorithm | Algorithms | Howard's | ISAAC 2010 | Iterations |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers