Sciweavers

2005 search results - page 259 / 401
» Decisive Markov Chains
Sort
View
ACL
2008
15 years 3 months ago
Mixture Model POMDPs for Efficient Handling of Uncertainty in Dialogue Management
In spoken dialogue systems, Partially Observable Markov Decision Processes (POMDPs) provide a formal framework for making dialogue management decisions under uncertainty, but effi...
James Henderson, Oliver Lemon
UAI
2000
15 years 3 months ago
Value-Directed Belief State Approximation for POMDPs
We consider the problem belief-state monitoring for the purposes of implementing a policy for a partially-observable Markov decision process (POMDP), specifically how one might ap...
Pascal Poupart, Craig Boutilier
JMLR
2010
125views more  JMLR 2010»
14 years 8 months ago
Variational methods for Reinforcement Learning
We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...
Thomas Furmston, David Barber
ICML
2004
IEEE
16 years 2 months ago
Bellman goes relational
Motivated by the interest in relational reinforcement learning, we introduce a novel relational Bellman update operator called ReBel. It employs a constraint logic programming lan...
Kristian Kersting, Martijn Van Otterlo, Luc De Rae...
ECBS
2009
IEEE
113views Hardware» more  ECBS 2009»
15 years 8 months ago
Modeling and Analysis of Probabilistic Timed Systems
Probabilistic models are useful for analyzing systems which operate under the presence of uncertainty. In this paper, we present a technique for verifying safety and liveness prop...
Abhishek Dubey, Derek Riley, Sherif Abdelwahed, Te...