Point-based policy generation for decentralized POMDPs

15 years 8 months ago

Download anytime.cs.umass.edu

Memory-bounded techniques have shown great promise in solving complex multi-agent planning problems modeled as DEC-POMDPs. Much of the performance gains can be attributed to pruning techniques that alleviate the complexity of the exhaustive backup step of the original MBDP algorithm. Despite these improvements, state-of-the-art algorithms can still handle a relative small pool of candidate policies, which limits the quality of the solution in some benchmark problems. We present a new algorithm, PointBased Policy Generation, which avoids altogether searching the entire joint policy space. The key observation is that the best joint policy for each reachable belief state can be constructed directly, instead of producing first a large set of candidates. We also provide an efficient approximate implementation of this operation. The experimental results show that our solution technique improves the performance significantly in terms of both runtime and solution quality. Categories and Subje...

Feng Wu, Shlomo Zilberstein, Xiaoping Chen

Real-time Traffic

ATAL 2010 | Intelligent Agents | Joint Policy | Multi-agent Planning | Multi-agent Planning Problems |

claim paper

» Security in multiagent systems by policy randomization

» Pointbased incremental pruning heuristic for solving finitehorizon DECPOMDPs

» Reasoning about joint beliefs for executiontime communication decisions

» Navigation Planning in Probabilistic Roadmaps with Uncertainty

Post Info
More Details (n/a)

Added	08 Nov 2010
Updated	08 Nov 2010
Type	Conference
Year	2010
Where	ATAL
Authors	Feng Wu, Shlomo Zilberstein, Xiaoping Chen

Comments (0)

Sciweavers

Point-based policy generation for decentralized POMDPs

ATAL 2010 | Intelligent Agents | Joint Policy | Multi-agent Planning | Multi-agent Planning Problems |

Explore & Download

Productivity Tools

Sciweavers