Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

33

AAAI
1997

favoriteEmaildiscussreport

133views Intelligent Agents» more AAAI 1997»

Incremental Methods for Computing Bounds in Partially Observable Markov Decision Processes

13 years 10 months ago

Incremental Methods for Computing Bounds in Partially Observable Markov Decision Processes

Download www.cs.pitt.edu

Partially observable Markov decision processes (POMDPs) allow one to model complex dynamic decision or control problems that include both action outcome uncertainty and imperfect observability. The control problem is formulated as a dynamic optimization problem with a value function combining costs or rewards from multiple steps. In this paper we propose, analyse and test various incremental methods for computing bounds on the value function for control problems with inﬁnite discounted horizon criteria. The methods described and tested include novel incremental versions of grid-based linear interpolation method and simple lower bound method with Sondik’s updates. Both of these can work with arbitrary points of the belief space and can be enhanced by various heuristic point selection strategies. Also introduced is a new method for computing an initial upper bound – the fast informed bound method. This method is able to improve signiﬁcantly on the standard and commonly usedupper...

Milos Hauskrecht

Real-time Traffic

AAAI 1997 | Bound Method | Control Problems | Intelligent Agents | Value Function |

claim paper

Related Content

» Incremental Pruning A Simple Fast Exact Method for Partially Observable Markov Decision Pr...

» Automatic Recovery Using Bounded Partially Observable Markov Decision Processes

» BoundedParameter Partially Observable Markov Decision Processes

» Qualitative Analysis of PartiallyObservable Markov Decision Processes

» ValueFunction Approximations for Partially Observable Markov Decision Processes

» Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

» Automated handwashing assistance for persons with dementia using video and a partially obs...

» Computing Optimal Policies for Partially Observable Decision Processes Using Compact Repre...

» Active Learning in Partially Observable Markov Decision Processes

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	1997
Where	AAAI
Authors	Milos Hauskrecht

Comments (0)