Sciweavers

2990 search results - page 148 / 598
» Hidden Markov processes
Sort
View
CDC
2009
IEEE
133views Control Systems» more  CDC 2009»
15 years 10 months ago
Arbitrarily modulated Markov decision processes
— We consider decision-making problems in Markov decision processes where both the rewards and the transition probabilities vary in an arbitrary (e.g., nonstationary) fashion. We...
Jia Yuan Yu, Shie Mannor
153
Voted
AIPS
2009
15 years 7 months ago
Minimal Sufficient Explanations for Factored Markov Decision Processes
Explaining policies of Markov Decision Processes (MDPs) is complicated due to their probabilistic and sequential nature. We present a technique to explain policies for factored MD...
Omar Zia Khan, Pascal Poupart, James P. Black
COLING
2010
15 years 1 months ago
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...
Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...
171
Voted
IJCAI
2007
15 years 7 months ago
Using Linear Programming for Bayesian Exploration in Markov Decision Processes
A key problem in reinforcement learning is finding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...
Pablo Samuel Castro, Doina Precup
AUTOMATICA
2002
83views more  AUTOMATICA 2002»
15 years 6 months ago
A time aggregation approach to Markov decision processes
We propose a time aggregation approach for the solution of in
Xi-Ren Cao, Zhiyuan Ren, Shalabh Bhatnagar, Michae...