Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

153

SARA
2007
Springer

167views Artificial Intelligence» more SARA 2007»

Active Learning of Dynamic Bayesian Networks in Markov Decision Processes

15 years 10 months ago

Active Learning of Dynamic Bayesian Networks in Markov Decision Processes

Download www-anw.cs.umass.edu

Several recent techniques for solving Markov decision processes use dynamic Bayesian networks to compactly represent tasks. The dynamic Bayesian network representation may not be given, in which case it is necessary to learn it if one wants to apply these techniques. We develop an algorithm for learning dynamic Bayesian network representations of Markov decision processes using data collected through exploration in the environment. To accelerate data collection we develop a novel scheme for active learning of the networks. We assume that it is not possible to sample the process in arbitrary states, only along trajectories, which prevents us from applying existing active learning techniques. Our active learning scheme selects actions that maximize the total entropy of distributions used to evaluate potential reﬁnements of the networks.

Anders Jonsson, Andrew G. Barto

Real-time Traffic

Active Learning | Artificial Intelligence | Bayesian Network Representations | Dynamic Bayesian Network | SARA 2007 |

claim paper

Related Content

» Dynamic Workflow Composition using Markov Decision Processes

» Automatic Feature Selection for ModelBased Reinforcement Learning in Factored MDPs

» Bayesian reinforcement learning in continuous POMDPs with gaussian processes

» PhraseBased Statistical Language Generation Using Graphical Models and Active Learning

» Decision Theoretic Modeling of Human Facial Displays

» ValueDirected Human Behavior Analysis from Video Using Partially Observable Markov Decisio...

» Decision Support for User Interface Design Usability Diagnosis by Time Analysis of the Use...

» A Markov Clustering Topic Model for Mining Behaviour in Video

» Heterogeneous Continuous Dynamic Bayesian Networks with Flexible Structure and InterTime S...

Post Info
More Details (n/a)

Added	09 Jun 2010
Updated	09 Jun 2010
Type	Conference
Year	2007
Where	SARA
Authors	Anders Jonsson, Andrew G. Barto

Comments (0)