Sciweavers

DIS
2007
Springer
13 years 5 months ago
Reducing Trials by Thinning-Out in Skill Discovery
In this paper, we propose a new concept, thinning-out, for reducing the number of trials in skill discovery. Thinning-out means to skip over such trials that are unlikely to improv...
Hayato Kobayashi, Kohei Hatano, Akira Ishino, Ayum...
ICML
2006
IEEE
14 years 4 months ago
An intrinsic reward mechanism for efficient exploration
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...
Özgür Simsek, Andrew G. Barto