Sciweavers

2005 search results - page 72 / 401
» Decisive Markov Chains
Sort
View
MDAI
2005
Springer
15 years 7 months ago
Perceptive Evaluation for the Optimal Discounted Reward in Markov Decision Processes
We formulate a fuzzy perceptive model for Markov decision processes with discounted payoff in which the perception for transition probabilities is described by fuzzy sets. Our aim...
Masami Kurano, Masami Yasuda, Jun-ichi Nakagami, Y...
AAAI
2006
15 years 2 months ago
Learning Representation and Control in Continuous Markov Decision Processes
This paper presents a novel framework for simultaneously learning representation and control in continuous Markov decision processes. Our approach builds on the framework of proto...
Sridhar Mahadevan, Mauro Maggioni, Kimberly Fergus...
CORR
2011
Springer
183views Education» more  CORR 2011»
14 years 8 months ago
Mean-Variance Optimization in Markov Decision Processes
We consider finite horizon Markov decision processes under performance measures that involve both the mean and the variance of the cumulative reward. We show that either randomiz...
Shie Mannor, John N. Tsitsiklis
AIPS
2011
14 years 5 months ago
Sample-Based Planning for Continuous Action Markov Decision Processes
In this paper, we present a new algorithm that integrates recent advances in solving continuous bandit problems with sample-based rollout methods for planning in Markov Decision P...
Christopher R. Mansley, Ari Weinstein, Michael L. ...
ATAL
2008
Springer
15 years 3 months ago
Controlling deliberation in a Markov decision process-based agent
Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision...
George Alexander, Anita Raja, David J. Musliner