Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

14 years 5 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S| linear equations (where |S| is the state space size). In this paper we apply a recently introduced multiscale framework for analysis on graphs to design a faster algorithm for policy evaluation. For a fixed policy , this framework efficiently constructs a multiscale decomposition of the random walk P associated with the policy . This enables efficiently computing medium and long term state distributions, approximation of value functions, and the direct computation of the potential operator (I - P )-1 needed to solve Bellman's equation. We show that even a preliminary non-optimized version of the solver competes with highly optimized iterative techniques, and can be computed in time O(|S| log2 |S|).

Mauro Maggioni, Sridhar Mahadevan

Real-time Traffic

ICML 2006 | Machine Learning | Multiscale Decomposition | Multiscale Framework | Policy Evaluation |

claim paper

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2006
Where	ICML
Authors	Mauro Maggioni, Sridhar Mahadevan

Sciweavers

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

ICML 2006 | Machine Learning | Multiscale Decomposition | Multiscale Framework | Policy Evaluation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers