Sciweavers

1118 search results - page 8 / 224

» Relational temporal difference learning

88

NIPS
2007

86views Information Technology» more NIPS 2007»

Temporal Difference Updating without a Learning Rate

15 years 7 months ago

Temporal Difference Updating without a Learning Rate

Download www.vetta.org

Marcus Hutter, Shane Legg

claim paper

Read More »

121

AAAI
2006

144views Intelligent Agents» more AAAI 2006»

Incremental Least-Squares Temporal Difference Learning

15 years 7 months ago

Incremental Least-Squares Temporal Difference Learning

Download webdocs.cs.ualberta.ca

Alborz Geramifard, Michael H. Bowling, Richard S. ...

claim paper

Read More »

127

AAMAS
2010
Springer

190views Intelligent Agents» more AAMAS 2010»

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

15 years 6 months ago

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

Download www.cs.utexas.edu

Shimon Whiteson, Matthew E. Taylor, Peter Stone

claim paper

Read More »

97

NECO
2010

52views more NECO 2010»

Hyperbolically Discounted Temporal Difference Learning

15 years 4 months ago

Hyperbolically Discounted Temporal Difference Learning

Download ccsrv1.psych.indiana.edu

William H. Alexander, Joshua W. Brown

claim paper

Read More »

151

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

16 years 6 months ago

A worst-case comparison between temporal difference and residual gradient with linear function approximation

Download www.research.rutgers.edu

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

« Prev « First page 8 / 224 Last » Next »