Sciweavers

682 search results - page 77 / 137
» One-Counter Markov Decision Processes
Sort
View
77
Voted
NAACL
2007
15 years 2 months ago
Comparing User Simulation Models For Dialog Strategy Learning
This paper explores what kind of user simulation model is suitable for developing a training corpus for using Markov Decision Processes (MDPs) to automatically learn dialog strate...
Hua Ai, Joel R. Tetreault, Diane J. Litman
NIPS
2007
15 years 2 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
133
Voted
UAI
2007
15 years 2 months ago
Automatic Generation of Four-part Harmony
This paper introduces decision-theoretic planning techniques into automatic music generation. Markov decision processes (MDPs) are a mathematical model of planning under uncertain...
Liangrong Yi, Judy Goldsmith
94
Voted
ML
2002
ACM
146views Machine Learning» more  ML 2002»
15 years 15 days ago
Variable Resolution Discretization in Optimal Control
Abstract. The problemof state abstractionis of centralimportancein optimalcontrol,reinforcement learning and Markov decision processes. This paper studies the case of variable reso...
Rémi Munos, Andrew W. Moore
107
Voted
ECBS
2009
IEEE
113views Hardware» more  ECBS 2009»
15 years 7 months ago
Modeling and Analysis of Probabilistic Timed Systems
Probabilistic models are useful for analyzing systems which operate under the presence of uncertainty. In this paper, we present a technique for verifying safety and liveness prop...
Abhishek Dubey, Derek Riley, Sherif Abdelwahed, Te...