Search Sciweavers | Sciweavers

16 search results - page 3 / 4

» Theoretical Results on Reinforcement Learning with Temporall...

click to vote

ML
2000
ACM

150views Machine Learning» more ML 2000»

Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web

13 years 6 months ago

Download informatics.indiana.edu

This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...

Filippo Menczer, Richard K. Belew

claim paper

Read More »

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

13 years 10 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

13 years 6 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

click to vote

SIGCSE
2000
ACM

132views Education» more SIGCSE 2000»

Empirical investigation throughout the CS curriculum

13 years 10 months ago

Download www.dave-reed.com

Empirical skills are playing an increasingly important role in the computing profession and our society. But while traditional computer science curricula are effective in teaching...

David W. Reed, Craig S. Miller, Grant Braught

claim paper

Read More »

click to vote

PAMI
2007

186views more PAMI 2007»

Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

13 years 5 months ago

Download people.ee.duke.edu

—This paper presents a method for learning decision theoretic models of human behaviors from video data. Our system learns relationships between the movements of a person, the co...

Jesse Hoey, James J. Little

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers