Sciweavers

6 search results - page 2 / 2
» Hybrid Reinforcement Supervised Learning of Dialogue Policie...
Sort
View
ICML
1999
IEEE
14 years 6 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan