Search Sciweavers | Sciweavers

14

ALT
2011
Springer

259views Machine Learning» more ALT 2011»

12 years 4 months ago

This paper studies the deviations of the regret in a stochastic multi-armed bandit problem. When the total number of plays n is known beforehand by the agent, Audibert et al. (2009...

Antoine Salomon, Jean-Yves Audibert

claim paper

Read More »

8

click to vote

ALT
2009
Springer

128views Machine Learning» more ALT 2009»

Pure Exploration in Multi-armed Bandits Problems

14 years 1 months ago

Download sequel.futurs.inria.fr

Abstract. We consider the framework of stochastic multi-armed bandit problems and study the possibilities and limitations of strategies that explore sequentially the arms. The stra...

Sébastien Bubeck, Rémi Munos, Gilles...

claim paper

Read More »

11

click to vote

ALT
2007
Springer

134views Machine Learning» more ALT 2007»

Tuning Bandit Algorithms in Stochastic Environments

14 years 1 months ago

Download www.sztaki.hu

Algorithms based on upper-conﬁdence bounds for balancing exploration and exploitation are gaining popularity since they are easy to implement, eﬃcient and eﬀective. In this p...

Jean-Yves Audibert, Rémi Munos, Csaba Szepe...

claim paper

Read More »

13

click to vote

CORR
2012
Springer

192views Education» more CORR 2012»

The best of both worlds: stochastic and adversarial bandits

12 years 1 days ago

Download www.princeton.edu

We present a bandit algorithm, SAO (Stochastic and Adversarial Optimal), whose regret is, essentially, optimal both for adversarial rewards and for stochastic rewards. Speciﬁcal...

Sébastien Bubeck, Aleksandrs Slivkins

claim paper

Read More »

15

click to vote

JMLR
2010

103views more JMLR 2010»

Regret Bounds and Minimax Policies under Partial Monitoring

12 years 11 months ago

Download jmlr.csail.mit.edu

This work deals with four classical prediction settings, namely full information, bandit, label efficient and bandit label efficient as well as four different notions of regret: p...

Jean-Yves Audibert, Sébastien Bubeck

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers