Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

10

CORR
2016
Springer

favoriteEmaildiscussreport

65views Education» more CORR 2016»

Investigating practical, linear temporal difference learning

8 years 26 days ago

Investigating practical, linear temporal difference learning

Download homes.soic.indiana.edu

Oﬀ-policy reinforcement learning has many applications including: learning from demonstration, learning multiple goal seeking policies in parallel, and representing predictive knowledge. Recently there has been an proliferation of new policyevaluation algorithms that ﬁll a longstanding algorithmic void in reinforcement learning: combining robustness to oﬀpolicy sampling, function approximation, linear complexity, and temporal diﬀerence (TD) updates. This paper contains two main contributions. First, we derive two new hybrid TD policy-evaluation algorithms, which ﬁll a gap in this collection of algorithms. Second, we perform an empirical comparison to elicit which of these new linear TD methods should be preferred in diﬀerent situations, and make concrete suggestions about practical use. Keywords Reinforcement learning; temporal diﬀerence learning; oﬀpolicy learning

Adam M. White, Martha White

Real-time Traffic

CORR 2016 | Education |

claim paper

Related Content

» LeastSquares Temporal Difference Learning

» OffPolicy Temporal Difference Learning with Function Approximation

» Temporal Implications of Information Technology for Work Practices Organizing in and for T...

» Predictive State Temporal Difference Learning

» A worstcase comparison between temporal difference and residual gradient with linear funct...

» Technical Update LeastSquares Temporal Difference Learning

» On Average Versus Discounted Reward TemporalDifference Learning

» Regularization and feature selection in leastsquares temporal difference learning

» Learning to Play Chess Using Temporal Differences

» Convergence of Least Squares Temporal Difference Methods Under General Conditions

Post Info
More Details (n/a)

Added	31 Mar 2016
Updated	31 Mar 2016
Type	Journal
Year	2016
Where	CORR
Authors	Adam M. White, Martha White

Comments (0)