On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP

15 years 5 months ago

Download www.cs.helsinki.fi

We consider the average cost problem for partially observable Markov decision processes (POMDP) with finite state, observation, and control spaces. We prove that there exists an -optimal finite-state controller functionally independent of initial distributions for any > 0, under the assumption that the optimal liminf average cost function of the POMDP is constant. As part of our proof, we establish that if the optimal liminf average cost function is constant, then the optimal limsup average cost function is also constant, and the two are equal. We also discuss the connection between the existence of nearly optimal finite-history controllers and two other important issues for average cost POMDP: the existence of an average cost that is independent of the initial state distribution, and the existence of a bounded solution to the constant average cost optimality equation. May 2006 Revised: March, May 2007 Huizhen Yu was with the Laboratory for Information and Decision Systems (LIDS), ...

Huizhen Yu, Dimitri P. Bertsekas

Real-time Traffic

Average Cost | Average Cost Function | Liminf Average Cost | MOR 2008 |

claim paper

Added	13 Dec 2010
Updated	13 Dec 2010
Type	Journal
Year	2008
Where	MOR
Authors	Huizhen Yu, Dimitri P. Bertsekas

Sciweavers

On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP

Average Cost | Average Cost Function | Liminf Average Cost | MOR 2008 |

Explore & Download

Productivity Tools

Sciweavers