Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

111

AAAI
1996

favoriteEmaildiscussreport

197views Intelligent Agents» more AAAI 1996»

Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations

15 years 2 months ago

Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations

Download people.cs.ubc.ca

: Partially-observable Markov decision processes provide a very general model for decision-theoretic planning problems, allowing the trade-offs between various courses of actions to be determined under conditions of uncertainty, and incorporating partial observations made by an agent. Dynamic programming algorithms based on the information or belief state of an agent can be used to construct optimal policies without explicit consideration of past history, but at high computational cost. In this paper, we discuss how structured representations of the system dynamics can be incorporated in classic POMDP solution algorithms. We use Bayesian networks with structured conditional probability matrices to represent POMDPs, and use this representation to structure the belief space for POMDP algorithms. This allows irrelevant distinctions to be ignored. Apart from speeding up optimal policy construction, we suggest that such representations can be exploited to great extent in the development of ...

Craig Boutilier, David Poole

Real-time Traffic

AAAI 1996 | Dynamic Programming Algorithms | Intelligent Agents | Partially-observable Markov Decision | POMDP Solution Algorithms |

claim paper

Related Content

» Planning with POMDPs Using a Compact LogicBased Representation

» Automated handwashing assistance for persons with dementia using video and a partially obs...

» Controlling Listeningoriented Dialogue using Partially Observable Markov Decision Processe...

» Active Learning in Partially Observable Markov Decision Processes

» Sensor Scheduling for Optimal Observability Using Estimation Entropy

» A Planning Algorithm for Predictive State Representations

» Dynamic Control of Data Ferries under Partial Observations

» Predictive representations for policy gradient in POMDPs

» Planning under Uncertainty for Robotic Tasks with Mixed Observability

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1996
Where	AAAI
Authors	Craig Boutilier, David Poole

Comments (0)