Sciweavers

23 search results - page 1 / 5
» The Cross-Entropy Method for Policy Search in Decentralized ...
Sort
View
INFORMATICASI
2008
101views more  INFORMATICASI 2008»
13 years 4 months ago
The Cross-Entropy Method for Policy Search in Decentralized POMDPs
Frans A. Oliehoek, Julian F. P. Kooij, Nikos A. Vl...
ICML
2003
IEEE
14 years 5 months ago
The Cross Entropy Method for Fast Policy Search
We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...
Shie Mannor, Reuven Y. Rubinstein, Yohai Gat
TSMC
2011
258views more  TSMC 2011»
12 years 11 months ago
Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions
—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...
Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...
ATAL
2009
Springer
13 years 11 months ago
Lossless clustering of histories in decentralized POMDPs
Decentralized partially observable Markov decision processes (Dec-POMDPs) constitute a generic and expressive framework for multiagent planning under uncertainty. However, plannin...
Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J....
AAMAS
2010
Springer
13 years 5 months ago
Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs
POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their computational complexity, howeve...
Christopher Amato, Daniel S. Bernstein, Shlomo Zil...