Sciweavers

ECML
2004
Springer

Experiments in Value Function Approximation with Sparse Support Vector Regression

13 years 10 months ago
Experiments in Value Function Approximation with Sparse Support Vector Regression
Abstract. We present first experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of SVR two ideas are employed. The first is sparse greedy approximation: the data is projected onto the subspace spanned by only a small subset of the original data (in feature space). This subset can be built up in an on-line fashion. Second, we use the sparsified data to solve a reduced quadratic problem, where the number of variables is independent of the total number of training samples seen. The feasability of this approach is demonstrated on two common toy-problems.
Tobias Jung, Thomas Uthmann
Added 01 Jul 2010
Updated 01 Jul 2010
Type Conference
Year 2004
Where ECML
Authors Tobias Jung, Thomas Uthmann
Comments (0)