Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

94

ECML
2005
Springer

favoriteEmaildiscussreport

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

15 years 6 months ago

Active Learning in Partially Observable Markov Decision Processes

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. We propose two formulations of the problem. The ﬁrst formulation relies on a model of the uncertainty that is added directly into the POMDP planning problem. This has some interesting theoretical properties, but is impractical when many of the parameters are uncertain. Our second approach, called MEDUSA, is an instance of active learning, whereby we incrementally improve the POMDP model using selected queries, while still optimizing reward. Results show a good performance of the algorithm even in large problems: the most useful parameters of the model are learned quickly and the agent still accumulates high reward throughout the process.

Robin Jaulmes, Joelle Pineau, Doina Precup

Real-time Traffic

ECML 2005 | Model | Partially Observable Markov Decision Process | ﬁrst Formulation Relies |

claim paper

Related Content

» An epsilonOptimal GridBased Algorithm for Partially Observable Markov Decision Processes

» Regionbased value iteration for partially observable Markov decision processes

» ValueDirected Human Behavior Analysis from Video Using Partially Observable Markov Decisio...

» Planning treatment of ischemic heart disease with partially observable Markov decision pro...

» Controlling Listeningoriented Dialogue using Partially Observable Markov Decision Processe...

» Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

» BoundedParameter Partially Observable Markov Decision Processes

» Reinforcement learning with limited reinforcement using Bayes risk for active learning in ...

» MarketBased Reinforcement Learning in Partially Observable Worlds

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ECML
Authors	Robin Jaulmes, Joelle Pineau, Doina Precup

Comments (0)