Sciweavers

CDC
2010
IEEE
136views Control Systems» more  CDC 2010»
12 years 11 months ago
Pathologies of temporal difference methods in approximate dynamic programming
Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...
Dimitri P. Bertsekas