Search Sciweavers | Sciweavers

113 search results - page 3 / 23

» Predictive State Temporal Difference Learning

click to vote

EUROPAR
2006
Springer

107views Distributed And Parallel Com...» more EUROPAR 2006»

Comparison of Different Methods for Next Location Prediction

13 years 9 months ago

Download www.informatik.uni-augsburg.de

Next location prediction anticipates a person's movement based on the history of previous sojourns. It is useful for proactive actions taken to assist the person in an ubiquit...

Jan Petzold, Faruk Bagci, Wolfgang Trumler, Theo U...

claim paper

Read More »

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

13 years 6 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

click to vote

COLT
2000
Springer

121views Machine Learning» more COLT 2000»

Bias-Variance Error Bounds for Temporal Difference Updates

13 years 10 months ago

Download www.cis.upenn.edu

We give the ﬁrst rigorous upper bounds on the error of temporal difference (td) algorithms for policy evaluation as a function of the amount of experience. These upper bounds pr...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

ACSC
2008
IEEE

108views Theoretical Computer Science» more ACSC 2008»

An investigation of the state formation and transition limitations for prediction problems in recurrent neural networks

13 years 7 months ago

Download crpit.com

Recurrent neural networks are able to store information about previous as well as current inputs. This "memory" allows them to solve temporal problems such as language r...

Angel Kennedy, Cara MacNish

claim paper

Read More »

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

13 years 7 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

« Prev « First page 3 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers