Sciweavers

7 search results - page 1 / 2
» Bayes Meets Bellman: The Gaussian Process Approach to Tempor...
Sort
View
ICML
2003
IEEE
14 years 5 months ago
Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning
We present a novel Bayesian approach to the problem of value function estimation in continuous state spaces. We define a probabilistic generative model for the value function by i...
Yaakov Engel, Shie Mannor, Ron Meir
ICPR
2008
IEEE
13 years 11 months ago
Tracking human body by using particle filter Gaussian process Markov-switching model
The goal of this article is to present an effective and robust tracking algorithm for nonlinear feet motion by deploying particle filter integrated with Gaussian process latent v...
Jing Wang, Hong Man, Yafeng Yin
ICML
2007
IEEE
14 years 5 months ago
Bayesian actor-critic algorithms
We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...
Mohammad Ghavamzadeh, Yaakov Engel
ICML
2006
IEEE
13 years 11 months ago
Automatic basis function construction for approximate dynamic programming and reinforcement learning
We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...
Philipp W. Keller, Shie Mannor, Doina Precup
ECAI
2006
Springer
13 years 8 months ago
Least Squares SVM for Least Squares TD Learning
Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...
Tobias Jung, Daniel Polani