Sciweavers

1084 search results - page 92 / 217
» Hidden Markov Models with Multiple Observation Processes
Sort
View
COLT
2000
Springer
15 years 8 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
NIPS
2008
15 years 5 months ago
Bayesian Model of Behaviour in Economic Games
Classical game theoretic approaches that make strong rationality assumptions have difficulty modeling human behaviour in economic games. We investigate the role of finite levels o...
Debajyoti Ray, Brooks King-Casas, P. Read Montague...
NIPS
2001
15 years 5 months ago
Predictive Representations of State
We show that states of a dynamical system can be usefully represented by multi-step, action-conditional predictions of future observations. State representations that are grounded...
Michael L. Littman, Richard S. Sutton, Satinder P....
FLAIRS
2007
15 years 6 months ago
Probabilistic Interactive Installations
We present a description of two small audio/visual immersive installations. The main framework is an interactive structure that enables multiple participants to generate jazz impr...
Constance G. Baltera, Sara B. Smith, Judy A. Frank...
ICIP
2003
IEEE
16 years 5 months ago
Feature selection for unsupervised discovery of statistical temporal structures in video
We present algorithms for automatic feature selection for unsupervised structure discovery from video sequences. Feature selection in this scenario is hard because of the absence ...
Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang...