Sciweavers

1340 search results - page 12 / 268
» Kalman Temporal Differences
Sort
View
ICML
1999
IEEE
16 years 16 days ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
ISDA
2009
IEEE
15 years 6 months ago
Postponed Updates for Temporal-Difference Reinforcement Learning
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson
ECAI
2008
Springer
15 years 1 months ago
Using Decision Trees as the Answer Networks in Temporal Difference-Networks
Laura-Andreea Antanas, Kurt Driessens, Jan Ramon, ...