Sciweavers

5171 search results - page 1035 / 1035
» Deterministic Parallel Processing
Sort
View
96
Voted
ICML
1996
IEEE
15 years 10 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore
« Prev « First page 1035 / 1035 Last » Next »