Memory-Bounded Dynamic Programming for DEC-POMDPs

13 years 6 months ago

Download anytime.cs.umass.edu

Decentralized decision making under uncertainty has been shown to be intractable when each agent has different partial information about the domain. Thus, improving the applicability and scalability of planning algorithms is an important challenge. We present the ﬁrst memory-bounded dynamic programming algorithm for ﬁnite-horizon decentralized POMDPs. A set of heuristics is used to identify relevant points of the inﬁnitely large belief space. Using these belief points, the algorithm successively selects the best joint policies for each horizon. The algorithm is extremely efﬁcient, having linear time and space complexity with respect to the horizon length. Experimental results show that it can handle horizons that are multiple orders of magnitude larger than what was previously possible, while achieving the same or better solution quality. These results signiﬁcantly increase the applicability of decentralized decision-making techniques.

Sven Seuken, Shlomo Zilberstein

Real-time Traffic