Search Sciweavers | Sciweavers

23 search results - page 1 / 5

» The Cross-Entropy Method for Policy Search in Decentralized ...

click to vote

INFORMATICASI
2008

101views more INFORMATICASI 2008»

The Cross-Entropy Method for Policy Search in Decentralized POMDPs

13 years 4 months ago

Download www.informatica.si

Frans A. Oliehoek, Julian F. P. Kooij, Nikos A. Vl...

claim paper

Read More »

click to vote

ICML
2003
IEEE

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

14 years 5 months ago

Download www.hpl.hp.com

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

claim paper

Read More »

click to vote

TSMC
2011

258views more TSMC 2011»

Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

12 years 11 months ago

Download www.montefiore.ulg.ac.be

—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

click to vote

ATAL
2009
Springer

103views Intelligent Agents» more ATAL 2009»

Lossless clustering of histories in decentralized POMDPs

13 years 11 months ago

Download www.science.uva.nl

Decentralized partially observable Markov decision processes (Dec-POMDPs) constitute a generic and expressive framework for multiagent planning under uncertainty. However, plannin...

Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J....

claim paper

Read More »

click to vote

AAMAS
2010
Springer

129views Intelligent Agents» more AAMAS 2010»

Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

13 years 5 months ago

Download anytime.cs.umass.edu

POMDPs and their decentralized multiagent counterparts, DEC-POMDPs, offer a rich framework for sequential decision making under uncertainty. Their computational complexity, howeve...

Christopher Amato, Daniel S. Bernstein, Shlomo Zil...

claim paper

Read More »

« Prev « First page 1 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers