Sciweavers

16 search results - page 4 / 4
» Deviations of Stochastic Bandit Regret
Sort
View
COLT
2010
Springer
13 years 2 months ago
Open Loop Optimistic Planning
We consider the problem of planning in a stochastic and discounted environment with a limited numerical budget. More precisely, we investigate strategies exploring the set of poss...
Sébastien Bubeck, Rémi Munos