Sciweavers

2 search results - page 1 / 1
» The value of information in multi-armed bandits with exponen...
Sort
View
COLT
2010
Springer
13 years 2 months ago
Best Arm Identification in Multi-Armed Bandits
We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...
Jean-Yves Audibert, Sébastien Bubeck, R&eac...