Scaling Up: Solving POMDPs through Value Based Clustering

13 years 6 months ago

Download www.aaai.org

Partially Observable Markov Decision Processes (POMDPs) provide an appropriately rich model for agents operating under partial knowledge of the environment. Since ﬁnding an optimal POMDP policy is intractable, approximation techniques have been a main focus of research, among them point-based algorithms, which scale up relatively well - up to thousands of states. An important decision in a point-based algorithm is the order of backup operations over belief states. Prioritization techniques for ordering the sequence of backup operations reduce the number of needed backups considerably, but involve signiﬁcant overhead. This paper suggests a new way to order backups, based on a soft clustering of the belief space. Our novel soft clustering method relies on the solution of the underlying MDP. Empirical evaluation veriﬁes that our method rapidly computes a good order of backups, showing orders of magnitude improvement in runtime over a number of benchmarks.

Yan Virin, Guy Shani, Solomon Eyal Shimony, Ronen

Real-time Traffic

AAAI 2007 | Backup Operations | Intelligent Agents | Observable Markov Decision | Point-based Algorithm |

claim paper

» A Computational Pipeline for Protein Structure Prediction and Analysis at Genome Scale

Post Info
More Details (n/a)

Added	02 Oct 2010
Updated	02 Oct 2010
Type	Conference
Year	2007
Where	AAAI
Authors	Yan Virin, Guy Shani, Solomon Eyal Shimony, Ronen I. Brafman

Comments (0)

Sciweavers

Scaling Up: Solving POMDPs through Value Based Clustering

AAAI 2007 | Backup Operations | Intelligent Agents | Observable Markov Decision | Point-based Algorithm |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers