Sciweavers

CDC
2008
IEEE

Approximate dynamic programming using support vector regression

13 years 11 months ago
Approximate dynamic programming using support vector regression
— This paper presents a new approximate policy iteration algorithm based on support vector regression (SVR). It provides an overview of commonly used cost approximation architectures in approximate dynamic programming problems, explains some difficulties encountered by these architectures, and argues that SVR-based architectures can avoid some of these difficulties. A key contribution of this paper is to present an extension of the SVR problem to carry out approximate policy iteration by minimizing the Bellman error at selected states. The algorithm does not require trajectory simulations to be performed and is able to utilize a rich set of basis functions in a computationally efficient way. A proof of the algorithm’s correctness in the limit of sampling the entire state space is presented. Finally, computational results for two test problems are shown.
Brett Bethke, Jonathan P. How, Asuman E. Ozdaglar
Added 29 May 2010
Updated 29 May 2010
Type Conference
Year 2008
Where CDC
Authors Brett Bethke, Jonathan P. How, Asuman E. Ozdaglar
Comments (0)