Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

126

ECML
2005
Springer

favoriteEmaildiscussreport

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 6 months ago

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actions and receives observations and rewards from the environment. Many POMDP solution methods are based on computing a belief state, which is a probability distribution over possible states in which the agent could be. The action choice of the agent is then based on the belief state. The belief state is computed based on a model of the environment, and the history of actions and observations seen by the agent. However, reward information is not taken into account in updating the belief state. In this paper, we argue that rewards can carry useful information that can help disambiguate the hidden state. We present a method for updating the belief state which takes rewards into account. We present experiments with exact and approximate planning methods on several standard POMDP domains, using this belief update met...

Masoumeh T. Izadi, Doina Precup

Real-time Traffic

Agent Takes Actions | Belief State | ECML 2005 | Observable Markov Decision |

claim paper

Related Content

» Incremental Methods for Computing Bounds in Partially Observable Markov Decision Processes

» Planning with POMDPs Using a Compact LogicBased Representation

» Decomposing LargeScale POMDP Via Belief State Analysis

» Automated handwashing assistance for persons with dementia using video and a partially obs...

» Computing Optimal Policies for Partially Observable Decision Processes Using Compact Repre...

» Controlling Listeningoriented Dialogue using Partially Observable Markov Decision Processe...

» Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

» Approximation Algorithms for PartialInformation Based Stochastic Control with Markovian Re...

» Active Learning in Partially Observable Markov Decision Processes

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ECML
Authors	Masoumeh T. Izadi, Doina Precup

Comments (0)