Sciweavers

6 search results - page 2 / 2
» Tuning Bandit Algorithms in Stochastic Environments
Sort
View
COLT
2010
Springer
13 years 3 months ago
Open Loop Optimistic Planning
We consider the problem of planning in a stochastic and discounted environment with a limited numerical budget. More precisely, we investigate strategies exploring the set of poss...
Sébastien Bubeck, Rémi Munos