Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

15

ECSQARU
2001
Springer

favoriteEmaildiscussreport

118views Automated Reasoning» more ECSQARU 2001»

Space-Progressive Value Iteration: An Anytime Algorithm for a Class of POMDPs

13 years 9 months ago

Space-Progressive Value Iteration: An Anytime Algorithm for a Class of POMDPs

Download www.cs.ust.hk

Abstract. Finding optimal policies for general partially observable Markov decision processes (POMDPs) is computationally difﬁcult primarily due to the need to perform dynamic-programming (DP) updates over the entire belief space. In this paper, we ﬁrst study a somewhat restrictive class of special POMDPs called almost-discernible POMDPs and propose an anytime algorithm called spaceprogressive value iteration(SPVI). SPVI does not perform DP updates over the entire belief space. Rather it restricts DP updates to a belief subspace that grows over time. It is argued that given sufﬁcient time SPVI can ﬁnd near-optimal policies for almost-discernible POMDPs. We then show how SPVI can be applied to more a general class of POMDPs. Empirical results are presented to show the effectiveness of SPVI.

Nevin Lianwen Zhang, Weihong Zhang

Real-time Traffic

Automated Reasoning | DP Updates | ECSQARU 2001 | Entire Belief Space | Observable Markov Decision |

claim paper

Related Content

» Pointbased value iteration An anytime algorithm for POMDPs

» Heuristic Search Value Iteration for POMDPs

» Improving Anytime PointBased Value Iteration Using Principled Point Selections

» Anytime PointBased Approximations for Large POMDPs

» An approximate algorithm for solving oracular POMDPs

» Theoretical Analysis of Heuristic Search Methods for Online POMDPs

» An Incremental Samplingbased Algorithm for Stochastic Optimal Control

» Cutandsolve An iterative search strategy for combinatorial optimization problems

Post Info
More Details (n/a)

Added	28 Jul 2010
Updated	28 Jul 2010
Type	Conference
Year	2001
Where	ECSQARU
Authors	Nevin Lianwen Zhang, Weihong Zhang

Comments (0)