Sciweavers

6 search results - page 2 / 2
» Postponed Updates for Temporal-Difference Reinforcement Lear...
Sort
View
ECAI
2006
Springer
13 years 9 months ago
Least Squares SVM for Least Squares TD Learning
Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...
Tobias Jung, Daniel Polani