Sciweavers

1233 search results - page 159 / 247
» Reinforcement Learning in MirrorBot
Sort
View
ICAART
2010
INSTICC
15 years 7 months ago
A Cautious Approach to Generalization in Reinforcement Learning
Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel...
ISDA
2009
IEEE
15 years 4 months ago
Postponed Updates for Temporal-Difference Reinforcement Learning
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson