Search Sciweavers | Sciweavers

24 search results - page 1 / 5

» The Cross Entropy Method for Fast Policy Search

193

click to vote

ICML
2003
IEEE

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

16 years 8 months ago

Download www.hpl.hp.com

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

claim paper

Read More »

127

click to vote

INFORMATICASI
2008

101views more INFORMATICASI 2008»

The Cross-Entropy Method for Policy Search in Decentralized POMDPs

15 years 7 months ago

Download www.informatica.si

Frans A. Oliehoek, Julian F. P. Kooij, Nikos A. Vl...

claim paper

Read More »

196

click to vote

TSMC
2011

258views more TSMC 2011»

Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

15 years 2 months ago

Download www.montefiore.ulg.ac.be

—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

206

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 9 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

206

click to vote

WSC
2004

105views Modeling And Simulation» more WSC 2004»

Global Likelihood Optimization Via the Cross-Entropy Method, with an Application to Mixture Models

15 years 9 months ago

Download espace.library.uq.edu.au

Global likelihood maximization is an important aspect of many statistical analyses. Often the likelihood function is highly multi-extremal. This presents a significant challenge t...

Zdravko I. Botev, Dirk P. Kroese

claim paper

Read More »

« Prev « First page 1 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers