Sciweavers

829 search results - page 33 / 166
» A time aggregation approach to Markov decision processes
Sort
View
PKDD
2009
Springer
129views Data Mining» more  PKDD 2009»
15 years 8 months ago
Considering Unseen States as Impossible in Factored Reinforcement Learning
Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...
Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...
CORR
2012
Springer
235views Education» more  CORR 2012»
13 years 9 months ago
An Incremental Sampling-based Algorithm for Stochastic Optimal Control
Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...
Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli
116
Voted
SIGMETRICS
2006
ACM
121views Hardware» more  SIGMETRICS 2006»
15 years 8 months ago
Transient analysis of tree-Like processes and its application to random access systems
A new methodology to assess transient performance measures of tree-like processes is proposed by introducing the concept of tree-like processes with marked time epochs. As opposed...
Jeroen Van Velthoven, Benny Van Houdt, Chris Blond...
TSP
2008
106views more  TSP 2008»
15 years 1 months ago
An EM Algorithm for Ion-Channel Current Estimation
Parameter estimation of a continuous-time Markov chain observed through a discrete-time memoryless channel is studied. An expectation-maximization (EM) algorithm for maximum likeli...
William J. J. Roberts, Yariv Ephraim
94
Voted
CORR
2010
Springer
106views Education» more  CORR 2010»
15 years 2 months ago
MDPs with Unawareness
Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...
Joseph Y. Halpern, Nan Rong, Ashutosh Saxena