Sciweavers

15 search results - page 2 / 3
» Incremental Least-Squares Temporal Difference Learning
Sort
View
ICDM
2009
IEEE
120views Data Mining» more  ICDM 2009»
14 years 6 days ago
Least Square Incremental Linear Discriminant Analysis
Abstract—Linear discriminant analysis (LDA) is a wellknown dimension reduction approach, which projects highdimensional data into a low-dimensional space with the best separation...
Li-Ping Liu, Yuan Jiang, Zhi-Hua Zhou
NIPS
2001
13 years 7 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
PKDD
2009
Springer
169views Data Mining» more  PKDD 2009»
14 years 2 days ago
Hybrid Least-Squares Algorithms for Approximate Policy Evaluation
The goal of approximate policy evaluation is to “best” represent a target value function according to a specific criterion. Temporal difference methods and Bellman residual m...
Jeffrey Johns, Marek Petrik, Sridhar Mahadevan
NECO
2011
13 years 15 days ago
Least Squares Estimation Without Priors or Supervision
Selection of an optimal estimator typically relies on either supervised training samples (pairs of measurements and their associated true values), or a prior probability model for...
Martin Raphan, Eero P. Simoncelli
ICML
2009
IEEE
14 years 6 months ago
Regularization and feature selection in least-squares temporal difference learning
We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...
J. Zico Kolter, Andrew Y. Ng