Search Sciweavers | Sciweavers

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

177

click to vote

ICML
2001
IEEE

146views Machine Learning» more ICML 2001»

A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal Difference Learning

16 years 7 months ago

Download www.stanford.edu

David Choi, Benjamin Van Roy

claim paper

Read More »

177

click to vote

AUSAI
2005
Springer

123views Artificial Intelligence» more AUSAI 2005»

Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning

16 years 17 days ago

Download eprints.utas.edu.au

: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...

Peter Vamplew, Robert Ollington

claim paper

Read More »

« Prev « First page 34 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers