Sciweavers

656 search results - page 11 / 132
» Complexity of finite-horizon Markov decision process problem...
Sort
View
133
Voted
CDC
2010
IEEE
141views Control Systems» more  CDC 2010»
14 years 8 months ago
A dynamic programming algorithm for decentralized Markov decision processes with a broadcast structure
We give an optimal dynamic programming algorithm to solve a class of finite-horizon decentralized Markov decision processes (MDPs). We consider problems with a broadcast informati...
Jeff Wu, Sanjay Lall
IJCAI
2007
15 years 3 months ago
Using Linear Programming for Bayesian Exploration in Markov Decision Processes
A key problem in reinforcement learning is finding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...
Pablo Samuel Castro, Doina Precup
AAAI
1997
15 years 3 months ago
Model Minimization in Markov Decision Processes
Many stochastic planning problems can be represented using Markov Decision Processes (MDPs). A difficulty with using these MDP representations is that the common algorithms for so...
Thomas Dean, Robert Givan
105
Voted
ICMAS
2000
15 years 3 months ago
Communication in Multi-Agent Markov Decision Processes
In this paper, we formulate agent's decision process under the framework of Markov decision processes, and in particular, the multi-agent extension to Markov decision process...
Ping Xuan, Victor R. Lesser, Shlomo Zilberstein
IJCAI
2007
15 years 3 months ago
Average-Reward Decentralized Markov Decision Processes
Formal analysis of decentralized decision making has become a thriving research area in recent years, producing a number of multi-agent extensions of Markov decision processes. Wh...
Marek Petrik, Shlomo Zilberstein