Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

106

Voted

ECAI
2006
Springer

favoriteEmaildiscussreport

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 4 months ago

Least Squares SVM for Least Squares TD Learning

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible sequential nature) of training data arising in reinforcement learning we employ a subspace based variant of LS-SVM that sequentially processes the data and is hence especially suited for online learning. This approach is adapted from the context of Gaussian process regression and turns the unwieldy original optimization problem (with computational complexity being cubic in the number of processed data) into a reduced problem (with computional complexity being linear in the number of processed data). We introduce a QR decomposition based approach to solve the resulting generalized normal equations incrementally that is numerically more stable than existing recursive least squares based update algorithms. We also allow a forgetting factor in the updates to track non-stationary target functions (i.e. for the use...

Tobias Jung, Daniel Polani

Real-time Traffic

Artificial Intelligence | ECAI 2006 | Possible Sequential Nature | Squares Temporal Difference | Subspace Based Variant |

claim paper

Related Content

» Technical Update LeastSquares Temporal Difference Learning

» LeastSquares Temporal Difference Learning

» Efficient Reinforcement Learning Using Recursive LeastSquares Methods

» Regularized Least Squares Cancer Classifiers from DNA microarray data

» A Stagewise Least Square Loss Function for Classification

» Adaptive and Iterative Least Squares Support Vector Regression based on Quadratic Renyi En...

» Feature Selection for Microarray Data Using Least Squares SVM and Particle Swarm Optimizat...

» On partial least squares in head pose estimation How to simultaneously deal with misalignm...

» A LeastSquares Framework for Component Analysis

Post Info
More Details (n/a)

Added	22 Aug 2010
Updated	22 Aug 2010
Type	Conference
Year	2006
Where	ECAI
Authors	Tobias Jung, Daniel Polani

Comments (0)