Sciweavers

2005 search results - page 342 / 401
» Decisive Markov Chains
Sort
View
ICPR
2004
IEEE
16 years 28 days ago
Complex Human Activity Recognition for Monitoring Wide Outdoor Environments
The problem of automatic recognition of human activities is among the most important and challenging open areas of research in Computer Vision. This paper presents a new approach ...
Arcangelo Distante, I. Gnoni, Marco Leo, Paolo Spa...
ICML
2009
IEEE
16 years 19 days ago
Predictive representations for policy gradient in POMDPs
We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...
Abdeslam Boularias, Brahim Chaib-draa
ICML
2007
IEEE
16 years 19 days ago
Learning state-action basis functions for hierarchical MDPs
This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...
Sarah Osentoski, Sridhar Mahadevan
ICML
2007
IEEE
16 years 19 days ago
A recursive method for discriminative mixture learning
We consider the problem of learning density mixture models for classification. Traditional learning of mixtures for density estimation focuses on models that correctly represent t...
Minyoung Kim, Vladimir Pavlovic
ICML
2007
IEEE
16 years 19 days ago
Conditional random fields for multi-agent reinforcement learning
Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...
Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...