Search Sciweavers | Sciweavers

397 search results - page 57 / 80

» Reinforcement Learning with Hierarchies of Machines

113

click to vote

ALT
1994
Springer

152views Machine Learning» more ALT 1994»

Program Synthesis in the Presence of Infinite Number of Inaccuracies

15 years 3 months ago

Download www.comp.nus.edu.sg

Most studies modeling inaccurate data in Gold style learning consider cases in which the number of inaccuracies is finite. The present paper argues that this approach is not reaso...

Sanjay Jain

claim paper

Read More »

103

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 17 days ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

ICMLA
2010

161views Machine Learning» more ICMLA 2010»

Robust Learning for Adaptive Programs by Leveraging Program Structure

14 years 9 months ago

Download web.engr.oregonstate.edu

Abstract--We study how to effectively integrate reinforcement learning (RL) and programming languages via adaptation-based programming, where programs can include non-deterministic...

Jervis Pinto, Alan Fern, Tim Bauer, Martin Erwig

claim paper

Read More »

105

click to vote

ML
2000
ACM

150views Machine Learning» more ML 2000»

Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web

14 years 11 months ago

Download informatics.indiana.edu

This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...

Filippo Menczer, Richard K. Belew

claim paper

Read More »

click to vote

COLT
1989
Springer

126views Machine Learning» more COLT 1989»

Learning in the Presence of Inaccurate Information

15 years 3 months ago

Download www.comp.nus.edu.sg

The present paper considers the eﬀects of introducing inaccuracies in a learner’s environment in Gold’s learning model of identiﬁcation in the limit. Three kinds of inaccu...

Mark A. Fulk, Sanjay Jain

claim paper

Read More »

« Prev « First page 57 / 80 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers