Sciweavers

TALG
2010

Discounted deterministic Markov decision processes and discounted all-pairs shortest paths

13 years 2 months ago
Discounted deterministic Markov decision processes and discounted all-pairs shortest paths
We present two new algorithms for finding optimal strategies for discounted, infinite-horizon, Deterministic Markov Decision Processes (DMDP). The first one is an adaptation of an algorithm of Young, Tarjan and Orlin for finding minimum mean weight cycles. It runs in O(mn + n2 log n) time, where n is the number of vertices (or states) and m is the number of edges (or actions). The second one is an adaptation of a classical algorithm of Karp for finding minimum mean weight cycles. It runs in O(mn) time. The first algorithm has a slightly slower worst-case complexity, but is faster than the first algorithm in many situations. Both algorithms improve on a recent O(mn2 )-time algorithm of Andersson and Vorobyov. We also present a randomized ˜O(m1/2 n2 )-time algorithm for finding Discounted All-Pairs Shortest Paths (DAPSP), improving several previous algorithms.
Omid Madani, Mikkel Thorup, Uri Zwick
Added 30 Jan 2011
Updated 30 Jan 2011
Type Journal
Year 2010
Where TALG
Authors Omid Madani, Mikkel Thorup, Uri Zwick
Comments (0)