Replacing eligibility trace for action-value learning with function approximation

15 years 5 months ago

Download www.dice.ucl.ac.be

The eligibility trace is one of the most used mechanisms to speed up reinforcement learning. Earlier reported experiments seem to indicate that replacing eligibility traces would perform better than accumulating eligibility traces. However, replacing traces are currently not applicable when using function approximation methods where states are not represented uniquely by binary values. This paper proposes two modiﬁcations to replacing traces that overcome this limitation. Experimental results from the Mountain-Car task indicate that the new replacing traces outperform both the accumulating and the ‘ordinary’ replacing traces.

Kary Främling

Real-time Traffic

Eligibility Traces | ESANN 2007 | Function Approximation Methods | Neural Networks | Reinforcement Learning |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	ESANN
Authors	Kary Främling

Comments (0)

Sciweavers

Replacing eligibility trace for action-value learning with function approximation

Eligibility Traces | ESANN 2007 | Function Approximation Methods | Neural Networks | Reinforcement Learning |

Explore & Download

Productivity Tools

Sciweavers