Search Sciweavers | Sciweavers

102 search results - page 3 / 21

» Efficient Asymptotic Approximation in Temporal Difference Le...

click to vote

COLT
2000
Springer

121views Machine Learning» more COLT 2000»

Bias-Variance Error Bounds for Temporal Difference Updates

13 years 10 months ago

Download www.cis.upenn.edu

We give the ﬁrst rigorous upper bounds on the error of temporal difference (td) algorithms for policy evaluation as a function of the amount of experience. These upper bounds pr...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

13 years 23 days ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

13 years 7 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

13 years 4 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

click to vote

EWRL
2008

191views Machine Learning» more EWRL 2008»

Bayesian Reward Filtering

13 years 7 months ago

Download www.metz.supelec.fr

A wide variety of function approximation schemes have been applied to reinforcement learning. However, Bayesian filtering approaches, which have been shown efficient in other field...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

« Prev « First page 3 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers