Sciweavers

24 search results - page 1 / 5
» The Cross Entropy Method for Fast Policy Search
Sort
View
84
Voted
ICML
2003
IEEE
15 years 11 months ago
The Cross Entropy Method for Fast Policy Search
We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...
Shie Mannor, Reuven Y. Rubinstein, Yohai Gat
INFORMATICASI
2008
101views more  INFORMATICASI 2008»
14 years 11 months ago
The Cross-Entropy Method for Policy Search in Decentralized POMDPs
Frans A. Oliehoek, Julian F. P. Kooij, Nikos A. Vl...
93
Voted
TSMC
2011
258views more  TSMC 2011»
14 years 5 months ago
Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions
—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...
Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...
94
Voted
AAAI
2010
15 years 10 days ago
Relative Entropy Policy Search
Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...
Jan Peters, Katharina Mülling, Yasemin Altun
75
Voted
WSC
2004
15 years 7 days ago
Global Likelihood Optimization Via the Cross-Entropy Method, with an Application to Mixture Models
Global likelihood maximization is an important aspect of many statistical analyses. Often the likelihood function is highly multi-extremal. This presents a significant challenge t...
Zdravko I. Botev, Dirk P. Kroese