Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

41

UAI
2000

favoriteEmaildiscussreport

91views Artificial Intelligence» more UAI 2000»

Value-Directed Belief State Approximation for POMDPs

13 years 10 months ago

Value-Directed Belief State Approximation for POMDPs

Download www.cs.uwaterloo.ca

We consider the problem belief-state monitoring for the purposes of implementing a policy for a partially-observable Markov decision process (POMDP), specifically how one might approximate the belief state. Other schemes for beliefstate approximation (e.g., based on minimizing a measure such as KL-divergence between the true and estimated state) are not necessarily appropriate for POMDPs. Instead we propose a framework for analyzing value-directed approximation schemes, where approximation quality is determined by the expected error in utility rather than by the error in the belief state itself. We propose heuristic methods for finding good projection schemes for belief state estimation--exhibiting anytime characteristics--given a POMDP value function. We also describe several algorithms for constructingbounds on the error in decision quality (expected utility)associated with acting in accordance with a given belief state approximation.

Pascal Poupart, Craig Boutilier

Real-time Traffic

Approximation | Belief State | UAI 2000 | UAI 2008 | Value-directed Approximation Schemes |

claim paper

Related Content

» VDCBPI an Approximate Scalable Algorithm for Large POMDPs

» Decomposing LargeScale POMDP Via Belief State Analysis

» What makes some POMDP problems easy to approximate

» Vectorspace Analysis of Beliefstate Approximation for POMDPs

» Compressing POMDPs Using Locality Preserving NonNegative Matrix Factorization

» Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

» Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

» Exploiting belief bounds practical POMDPs for personal assistant agents

» The permutable POMDP fast solutions to POMDPs for preference elicitation

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2000
Where	UAI
Authors	Pascal Poupart, Craig Boutilier

Comments (0)