Search Sciweavers | Sciweavers

23 search results - page 2 / 5

» The Cross-Entropy Method for Policy Search in Decentralized ...

105

Voted

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 2 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

117

Voted

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 2 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

Voted

AIPS
2008

148views Artificial Intelligence» more AIPS 2008»

Exact Dynamic Programming for Decentralized POMDPs with Lossless Policy Compression

15 years 3 months ago

Download www.damas.ift.ulaval.ca

High dimensionality of belief space in DEC-POMDPs is one of the major causes that makes the optimal joint policy computation intractable. The belief state for a given agent is a p...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

118

Voted

ATAL
2007
Springer

142views Intelligent Agents» more ATAL 2007»

Q-value functions for decentralized POMDPs

15 years 7 months ago

Download www.science.uva.nl

Planning in single-agent models like MDPs and POMDPs can be carried out by resorting to Q-value functions: a (near-) optimal Q-value function is computed in a recursive manner by ...

Frans A. Oliehoek, Nikos A. Vlassis

claim paper

Read More »

114

Voted

ATAL
2010
Springer

164views Intelligent Agents» more ATAL 2010»

Point-based policy generation for decentralized POMDPs

15 years 2 months ago

Download anytime.cs.umass.edu

Memory-bounded techniques have shown great promise in solving complex multi-agent planning problems modeled as DEC-POMDPs. Much of the performance gains can be attributed to pruni...

Feng Wu, Shlomo Zilberstein, Xiaoping Chen

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers