Search Sciweavers | Sciweavers

161 search results - page 6 / 33

» Least Squares SVM for Least Squares TD Learning

113

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

15 years 3 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

136

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 3 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

108

Voted

GECCO
2008
Springer

172views Optimization» more GECCO 2008»

Recursive least squares and quadratic prediction in continuous multistep problems

15 years 3 months ago

Download www.cs.bham.ac.uk

XCS with computed prediction, namely XCSF, has been recently extended in several ways. In particular, a novel prediction update algorithm based on recursive least squares and the ...

Daniele Loiacono, Pier Luca Lanzi

claim paper

Read More »

click to vote

CORR
2008
Springer

69views Education» more CORR 2008»

Solving Time of Least Square Systems in Sigma-Pi Unit Networks

15 years 2 months ago

Download hal.archives-ouvertes.fr

The solving of least square systems is a useful operation in neurocomputational modeling of learning, pattern matching, and pattern recognition. In these last two cases, the soluti...

Pierre Courrieu

claim paper

Read More »

101

click to vote

ICML
2006
IEEE

116views Machine Learning» more ICML 2006»

Efficient co-regularised least squares regression

16 years 2 months ago

Download www.cs.uni-potsdam.de

In many applications, unlabelled examples are inexpensive and easy to obtain. Semisupervised approaches try to utilise such examples to reduce the predictive error. In this paper,...

Stefan Wrobel, Thomas Gärtner, Tobias Scheffe...

claim paper

Read More »

« Prev « First page 6 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers