Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

11

STACS
2007
Springer

favoriteEmaildiscussreport

123views Theoretical Computer Science» more STACS 2007»

Pure Stationary Optimal Strategies in Markov Decision Processes

13 years 10 months ago

Pure Stationary Optimal Strategies in Markov Decision Processes

Download www.labri.fr

Markov decision processes (MDPs) are controllable discrete event systems with stochastic transitions. Performances of an MDP are evaluated by a payoﬀ function. The controller of the MDP seeks to optimize those performances, using optimal strategies. There exists various ways of measuring performances, i.e. various classes of payoﬀ functions. For example, average performances can be evaluated by a mean-payoﬀ function, peak performances by a limsup payoﬀ function, and the parity payoﬀ function can be used to encode logical speciﬁcations. Surprisingly, all the MDPs equipped with mean, limsup or parity payoﬀ functions share a common non-trivial property: they admit pure stationary optimal strategies. In this paper, we introduce the class of preﬁx-independent and submixing payoﬀ functions, and we prove that any MDP equipped with such a payoﬀ function admits pure stationary optimal strategies. This result uniﬁes and simpliﬁes several existing proofs. Moreover, it is a...

Hugo Gimbert

Real-time Traffic

Optimal Strategies | Payoﬀ Function | STACS 2007 | Stationary Optimal Strategies | Theoretical Computer Science |

claim paper

Related Content

» Limits of MultiDiscounted Markov Decision Processes

» Qualitative Analysis of PartiallyObservable Markov Decision Processes

» OneCounter Markov Decision Processes

» Multiobjective Model Checking of Markov Decision Processes

» Bursty Traffic in EnergyConstrained Opportunistic Spectrum Access

» Percentile optimization in uncertain Markov decision processes with application to efficie...

» OnLine Search for Solving Markov Decision Processes via Heuristic Sampling

» Mean field for Markov Decision Processes from Discrete to Continuous Optimization

» Finite Optimal Control for TimeBounded Reachability in CTMDPs and ContinuousTime Markov Ga...

Post Info
More Details (n/a)

Added	09 Jun 2010
Updated	09 Jun 2010
Type	Conference
Year	2007
Where	STACS
Authors	Hugo Gimbert

Comments (0)