Search Sciweavers | Sciweavers

86

NIPS
1994

178views Information Technology» more NIPS 1994»

Generalization in Reinforcement Learning: Safely Approximating the Value Function

15 years 1 months ago

To appear in: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge MA, 1995. A straightforward approach to t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

99

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 21 days ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

126

click to vote

SIGIR
2011
ACM

220views Information Technology» more SIGIR 2011»

Pseudo test collections for learning web search ranking functions

14 years 2 months ago

Download www.cs.umd.edu

Test collections are the primary drivers of progress in information retrieval. They provide a yardstick for assessing the eﬀectiveness of ranking functions in an automatic, rapi...

Nima Asadi, Donald Metzler, Tamer Elsayed, Jimmy L...

claim paper

Read More »

80

click to vote

BMCBI
2007

129views more BMCBI 2007»

Exploring inconsistencies in genome-wide protein function annotations: a machine learning approach

15 years 14 hour ago

Download www.biomedcentral.com

Background: Incorrectly annotated sequence data are becoming more commonplace as databases increasingly rely on automated techniques for annotation. Hence, there is an urgent need...

Carson M. Andorf, Drena Dobbs, Vasant Honavar

claim paper

Read More »

110

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 21 days ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers