Sciweavers

4446 search results - page 740 / 890
» Learning Observer Agents
Sort
View
ICML
2008
IEEE
16 years 5 months ago
Estimating local optimums in EM algorithm over Gaussian mixture model
EM algorithm is a very popular iteration-based method to estimate the parameters of Gaussian Mixture Model from a large observation set. However, in most cases, EM algorithm is no...
Zhenjie Zhang, Bing Tian Dai, Anthony K. H. Tung
ICML
2008
IEEE
16 years 5 months ago
Exploration scavenging
We examine the problem of evaluating a policy in the contextual bandit setting using only observations collected during the execution of another policy. We show that policy evalua...
John Langford, Alexander L. Strehl, Jennifer Wortm...
ICML
2006
IEEE
16 years 5 months ago
A choice model with infinitely many latent features
Elimination by aspects (EBA) is a probabilistic choice model describing how humans decide between several options. The options from which the choice is made are characterized by b...
Carl Edward Rasmussen, Dilan Görür, Fran...
ICML
2006
IEEE
16 years 5 months ago
Predictive linear-Gaussian models of controlled stochastic dynamical systems
We introduce the controlled predictive linearGaussian model (cPLG), a model that uses predictive state to model discrete-time dynamical systems with real-valued observations and v...
Matthew R. Rudary, Satinder P. Singh
ICML
2006
IEEE
16 years 5 months ago
Probabilistic inference for solving discrete and continuous state Markov Decision Processes
Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...
Marc Toussaint, Amos J. Storkey