Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

7

LICS
2007
IEEE

favoriteEmaildiscussreport

121views Automated Reasoning» more LICS 2007»

Limits of Multi-Discounted Markov Decision Processes

13 years 10 months ago

Limits of Multi-Discounted Markov Decision Processes

Download www.labri.fr

Markov decision processes (MDPs) are controllable discrete event systems with stochastic transitions. The payoff received by the controller can be evaluated in different ways, depending on the payoff function the MDP is equipped with. For example a mean–payoff function evaluates average performance, whereas a discounted payoff function gives more weights to earlier performance by means of a discount factor. Another well–known example is the parity payoff function which is used to encode logical speciﬁcations [14]. Surprisingly, parity and mean–payoff MDPs share two non–trivial properties: they both have pure stationary optimal strategies [4, 15] and they both are approximable by discounted MDPs with multiple discount factors (multi– discounted MDPs) [5, 15]. In this paper we unify and generalize these results. We introduce a new class of payoff functions called the priority weighted payoff functions, which are generalization of both parity and mean–payoff functions. We p...

Hugo Gimbert, Wieslaw Zielonka

Real-time Traffic

Computer Science | Discount Factors | LICS 2007 | Parity Payoff Function | Payoff Functions |

claim paper

Related Content

» Optimal Limited Contingency Planning

» An Iterative Algorithm for Solving Constrained Decentralized Markov Decision Processes

» Synthesis for PCTL in Parametric Markov Decision Processes

» Linear Program Approximations for Factored ContinuousState Markov Decision Processes

» BoundedParameter Partially Observable Markov Decision Processes

» A Fast Analytical Algorithm for Solving Markov Decision Processes with RealValued Resource...

» Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision ...

» Optimal Resource Allocation and Policy Formulation in LooselyCoupled Markov Decision Proce...

» Controlling deliberation in a Markov decision processbased agent

Post Info
More Details (n/a)

Added	04 Jun 2010
Updated	04 Jun 2010
Type	Conference
Year	2007
Where	LICS
Authors	Hugo Gimbert, Wieslaw Zielonka

Comments (0)