Sciweavers

1235 search results - page 114 / 247
» Reinforcement learning in a nutshell
Sort
View
ICAART
2010
INSTICC
15 years 8 months ago
A Reinforcement Learning Approach for Multiagent Navigation
Francisco Martinez-Gil, Fernando Barber, Miguel Lo...
ICAART
2010
INSTICC
15 years 8 months ago
A Cautious Approach to Generalization in Reinforcement Learning
Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel...
ISDA
2009
IEEE
15 years 5 months ago
Postponed Updates for Temporal-Difference Reinforcement Learning
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson