Sciweavers

193

ICML
2003
IEEE

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

16 years 8 months ago

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers