Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

122

CDC
2008
IEEE

140views Control Systems» more CDC 2008»

Information state for Markov decision processes with network delays

15 years 11 months ago

Information state for Markov decision processes with network delays

Download wsl.stanford.edu

We consider a networked control system, where each subsystem evolves as a Markov decision process (MDP). Each subsystem is coupled to its neighbors via communication links over which the signals are delayed, but are otherwise transmitted noise-free. A centralized controller receives delayed state information from each subsystem. Such a networked Markov decision process with delays can be represented as a partially observed Markov decision process (POMDP). We show that this POMDP has an information state that depends only on a ﬁnite history of measurements and control. Thus, the POMDP can be converted into an information state MDP, whose state does not grow with time. The optimal controller for networked Markov decision processes can thus be computed using dynamic programming over a ﬁnite state space. This result generalizes the previous results on Markov decision processes with delayed state information.

Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith

Real-time Traffic

CDC 2008 | Control Systems | Markov Decision | Markov Decision Process | Networked Markov Decision |

claim paper

Related Content

» Cooperative Relay Scheduling under Partial State Information in Energy Harvesting Sensor N...

» Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

» A Markov Decision Process based flow assignment framework for heterogeneous network access

» A dynamic programming algorithm for decentralized Markov decision processes with a broadca...

» Active Learning of Dynamic Bayesian Networks in Markov Decision Processes

» Structured Reachability Analysis for Markov Decision Processes

» Computing Optimal Policies for Partially Observable Decision Processes Using Compact Repre...

» Purely Epistemic Markov Decision Processes

» Communication in MultiAgent Markov Decision Processes

Post Info
More Details (n/a)

Added	29 May 2010
Updated	29 May 2010
Type	Conference
Year	2008
Where	CDC
Authors	Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith

Comments (0)