Sciweavers

71 search results - page 12 / 15
» A Causal Bayesian Network View of Reinforcement Learning
Sort
View
ICCV
2009
IEEE
16 years 2 months ago
Modelling Activity Global Temporal Dependencies using Time Delayed Probabilistic Graphical Model
We present a novel approach for detecting global behaviour anomalies in multiple disjoint cameras by learning time delayed dependencies between activities cross camera views. Sp...
Chen Change Loy, Tao Xiang and Shaogang Gong
ATAL
2009
Springer
15 years 4 months ago
Integrating organizational control into multi-agent learning
Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in largescale systems. In this work, we develop an organization-b...
Chongjie Zhang, Sherief Abdallah, Victor R. Lesser
UAI
2003
14 years 11 months ago
Probabilistic Models For Joint Clustering And Time-Warping Of Multidimensional Curves
In this paper we present a family of models and learning algorithms that can simultaneously align and cluster sets of multidimensional curves measured on a discrete time grid. Our...
Darya Chudova, Scott Gaffney, Padhraic Smyth
ICML
2010
IEEE
14 years 10 months ago
Continuous-Time Belief Propagation
Many temporal processes can be naturally modeled as a stochastic system that evolves continuously over time. The representation language of continuous-time Bayesian networks allow...
Tal El-Hay, Ido Cohn, Nir Friedman, Raz Kupferman
GECCO
2006
Springer
195views Optimization» more  GECCO 2006»
15 years 1 months ago
Studying XCS/BOA learning in Boolean functions: structure encoding and random Boolean functions
Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, conseque...
Martin V. Butz, Martin Pelikan