How to solve large scale deterministic games with mean payoff by policy iteration

15 years 11 months ago

Download minimal.inria.fr

Min-max functions are dynamic programming operators of zero-sum deterministic games with ﬁnite state and action spaces. The problem of computing the linear growth rate of the orbits (cycle-time) of a min-max function, which is equivalent to computing the value of a deterministic game with mean payoﬀ, arises in the performance analysis of discrete event systems. We present here an improved version of the policy iteration algorithm given by Gaubert and Gunawardena in 1998 to compute the cycle-time of a min-max functions. The improvement consists of a fast evaluation of the spectral projector which is adapted to the case of large sparse graphs. We present detailed numerical experiments, both on randomly generated instances, and on concrete examples, indicating that the algorithm is experimentally fast. Categories and Subject Descriptors G.2.2 [Discrete Mathematics]: Graph Theory; G.4 [Mathematical software]: Algorithm design and analysis General Terms Algorithms, Performance Keywords...

Vishesh Dhingra, Stephane Gaubert

Real-time Traffic

Deterministic Game | Hardware | Min-max Functions | Policy Iteration | VALUETOOLS 2006 |

claim paper

Post Info
More Details (n/a)

Added	14 Jun 2010
Updated	14 Jun 2010
Type	Conference
Year	2006
Where	VALUETOOLS
Authors	Vishesh Dhingra, Stephane Gaubert

Comments (0)

Sciweavers

How to solve large scale deterministic games with mean payoff by policy iteration

Deterministic Game | Hardware | Min-max Functions | Policy Iteration | VALUETOOLS 2006 |

Explore & Download

Productivity Tools

Sciweavers