Search Sciweavers | Sciweavers

5171 search results - page 1035 / 1035

» Deterministic Parallel Processing

231

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 8 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 1035 / 1035 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers