A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

15 years 7 months ago

Download emmanuel.rachelson.free.fr

Time is a crucial variable in planning and often requires special attention since it introduces a specific structure along with additional complexity, especially in the case of decision under uncertainty. In this paper, after reviewing and comparing MDP frameworks designed to deal with temporal problems, we focus on Generalized Semi-Markov Decision Processes (GSMDP) with observable time. We highlight the inherent structure and complexity of these problems and present the differences with classical reinforcement learning problems. Finally, we introduce a new simulation-based reinforcement learning method for solving GSMDP, bringing together results from simulation-based policy iteration, regression techniques and simulation theory. We illustrate our approach on a subway network control example.

Emmanuel Rachelson, Gauthier Quesnel, Fréd&

Real-time Traffic

Artificial Intelligence | ECAI 2008 | Reinforcement Learning | Reinforcement Learning Problems | Simulation-based Policy Iteration |

claim paper

» Stochastic Deliberation Scheduling using GSMDPs

» CrossLayer Rate and Power Adaptation Strategies for IRHARQ Systems over Fading Channels wi...

» CommunicationBased Decomposition Mechanisms for Decentralized MDPs

» EventDriven Power Management of Portable Systems

» Solving multiagent assignment Markov decision processes

» A hierarchical decision procedure for productivity innovation in largescale petrochemical ...

» Extending the lifetime of a network of batterypowered mobile devices by remote processing ...

» A polynomial algorithm for decentralized Markov decision processes with temporal constrain...

Post Info
More Details (n/a)

Added	19 Oct 2010
Updated	19 Oct 2010
Type	Conference
Year	2008
Where	ECAI
Authors	Emmanuel Rachelson, Gauthier Quesnel, Frédérick Garcia, Patrick Fabiani

Comments (0)

Sciweavers

A Simulation-based Approach for Solving Generalized Semi-Markov Decision Processes

Artificial Intelligence | ECAI 2008 | Reinforcement Learning | Reinforcement Learning Problems | Simulation-based Policy Iteration |

Explore & Download

Productivity Tools

Sciweavers