Exploiting belief bounds: practical POMDPs for personal assistant agents

15 years 11 months ago

Download teamcore.usc.edu

Agents or agent teams deployed to assist humans often face the challenges of monitoring the state of key processes in their environment (including the state of their human users themselves) and making periodic decisions based on such monitoring. POMDPs appear well suited to enable agents to address these challenges, given the uncertain environment and cost of actions, but optimal policy generation for POMDPs is computationally expensive. This paper introduces three key techniques to speedup POMDP policy generation that exploit the notion of progress or dynamics in personal assistant domains. Policy computation is restricted to the belief space polytope that remains reachable given the progress structure of a domain. We introduce new algorithms; particularly one based on applying Lagrangian methods to compute a bounded belief space support in polynomial time. Our techniques are complementary to many existing exact and approximate POMDP policy generation algorithms. Indeed, we illustrat...

Pradeep Varakantham, Rajiv T. Maheswaran, Milind T

Real-time Traffic

ATAL 2005 | Policy Generation | Policy Generation Algorithms | POMDP Policy Generation |

claim paper

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	ATAL
Authors	Pradeep Varakantham, Rajiv T. Maheswaran, Milind Tambe

Comments (0)

Sciweavers

Exploiting belief bounds: practical POMDPs for personal assistant agents

ATAL 2005 | Policy Generation | Policy Generation Algorithms | POMDP Policy Generation |

Explore & Download

Productivity Tools

Sciweavers