Sciweavers

1118 search results - page 8 / 224
» Relational temporal difference learning
Sort
View
76
Voted
AAAI
2006
15 years 2 months ago
Incremental Least-Squares Temporal Difference Learning
Alborz Geramifard, Michael H. Bowling, Richard S. ...
NECO
2010
52views more  NECO 2010»
14 years 11 months ago
Hyperbolically Discounted Temporal Difference Learning
William H. Alexander, Joshua W. Brown
ICML
2008
IEEE
16 years 2 months ago
A worst-case comparison between temporal difference and residual gradient with linear function approximation
Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...
Lihong Li