Sciweavers

23 search results - page 1 / 5
» The Cross-Entropy Method for Policy Search in Decentralized ...
Sort
View
INFORMATICASI
2008
101views more  INFORMATICASI 2008»
14 years 9 months ago
The Cross-Entropy Method for Policy Search in Decentralized POMDPs
Frans A. Oliehoek, Julian F. P. Kooij, Nikos A. Vl...
ICML
2003
IEEE
15 years 10 months ago
The Cross Entropy Method for Fast Policy Search
We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...
Shie Mannor, Reuven Y. Rubinstein, Yohai Gat
TSMC
2011
258views more  TSMC 2011»
14 years 4 months ago
Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions
—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...
Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...
ATAL
2009
Springer
15 years 4 months ago
Lossless clustering of histories in decentralized POMDPs
Decentralized partially observable Markov decision processes (Dec-POMDPs) constitute a generic and expressive framework for multiagent planning under uncertainty. However, plannin...
Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J....
AAMAS
2010
Springer
14 years 9 months ago
Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs
POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their computational complexity, howeve...
Christopher Amato, Daniel S. Bernstein, Shlomo Zil...