Search Sciweavers | Sciweavers

223 search results - page 2 / 45

» Least-Squares Temporal Difference Learning

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

13 years 5 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

14 years 5 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

PKDD
2009
Springer

169views Data Mining» more PKDD 2009»

Hybrid Least-Squares Algorithms for Approximate Policy Evaluation

13 years 11 months ago

Download www.cs.umass.edu

The goal of approximate policy evaluation is to “best” represent a target value function according to a speciﬁc criterion. Temporal difference methods and Bellman residual m...

Jeffrey Johns, Marek Petrik, Sridhar Mahadevan

claim paper

Read More »

click to vote

SIBGRAPI
2009
IEEE

232views Computer Graphics» more SIBGRAPI 2009»

Learning Discriminative Appearance-Based Models Using Partial Least Squares

13 years 11 months ago

Download www.umiacs.umd.edu

Appearance information is essential for applications such as tracking and people recognition. One of the main problems of using appearance-based discriminative models is the ambig...

William Robson Schwartz, Larry S. Davis

claim paper

Read More »

click to vote

NIPS
1998

131views Information Technology» more NIPS 1998»

Lazy Learning Meets the Recursive Least Squares Algorithm

13 years 5 months ago

Download www.swarm-bots.org

Lazy learning is a memory-based technique that, once a query is received, extracts a prediction interpolating locally the neighboring examples of the query which are considered re...

Mauro Birattari, Gianluca Bontempi, Hugues Bersini

claim paper

Read More »

« Prev « First page 2 / 45 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers