Sciweavers

1167 search results - page 61 / 234
» Relational Markov Games
Sort
View
106
Voted
UAI
1998
15 years 1 months ago
Hierarchical Solution of Markov Decision Processes using Macro-actions
tigate the use of temporally abstract actions, or macro-actions, in the solution of Markov decision processes. Unlike current models that combine both primitive actions and macro-...
Milos Hauskrecht, Nicolas Meuleau, Leslie Pack Kae...
IOR
2006
163views more  IOR 2006»
14 years 12 months ago
Adaptive Importance Sampling Technique for Markov Chains Using Stochastic Approximation
For a discrete-time finite-state Markov chain, we develop an adaptive importance sampling scheme to estimate the expected total cost before hitting a set of terminal states. This s...
T. P. I. Ahamed, Vivek S. Borkar, S. Juneja
JAIR
2000
152views more  JAIR 2000»
14 years 11 months ago
Value-Function Approximations for Partially Observable Markov Decision Processes
Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in whic...
Milos Hauskrecht
SIAMCO
2002
86views more  SIAMCO 2002»
14 years 11 months ago
On the Observability and Detectability of Continuous-Time Markov Jump Linear Systems
The paper introduces a new detectability concept for continuous-time Markov jump linear systems with finite Markov space that generalizes previous concepts found in the literature....
Eduardo F. Costa, João Bosco Ribeiro do Val
PRL
2007
131views more  PRL 2007»
14 years 11 months ago
A new look at discriminative training for hidden Markov models
ct 7 Discriminative training for hidden Markov models (HMMs) has been a central theme in speech recognition research for many years. 8 One most popular technique is minimum classiï...
Xiaodong He, Li Deng