Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

29

ALT
2008
Springer

favoriteEmaildiscussreport

171views Machine Learning» more ALT 2008»

Active Learning in Multi-armed Bandits

14 years 6 months ago

Active Learning in Multi-armed Bandits

Download www.sztaki.hu

In this paper we consider the problem of actively learning the mean values of distributions associated with a ﬁnite number of options (arms). The algorithms can select which option to generate the next sample from in order to produce estimates with equally good precision for all the distributions. When an algorithm uses sample means to estimate the unknown values then the optimal solution, assuming full knowledge of the distributions, is to sample each option proportional to its variance. In this paper we propose an incremental algorithm that asymptotically achieves the same loss as an optimal rule. We prove that the excess loss suﬀered by this algorithm, apart from logarithmic factors, scales as n−3/2 , which we conjecture to be the optimal rate. The performance of the algorithm is illustrated in a simple problem.

András Antos, Varun Grover, Csaba Szepesv&a

Real-time Traffic

Algorithm Uses Sample | ALT 2008 | Incremental Algorithm | Machine Learning | Optimal Rule |

claim paper

Related Content

» The NonBayesian Restless MultiArmed Bandit a Case of NearLogarithmic Regret

» MultiArmed Bandits in Metric Spaces

» Stochastic scheduling of active support vector learning algorithms

» Learning in A Changing World NonBayesian Restless MultiArmed Bandit

» Exploiting Similarity Information in Reinforcement Learning Similarity Models for MultiAr...

» Online Algorithms for the MultiArmed Bandit Problem with Markovian Rewards

» Best Arm Identification in MultiArmed Bandits

» MultiArmed Bandit Mechanisms for MultiSlot Sponsored Search Auctions

» On the Combinatorial MultiArmed Bandit Problem with Markovian Rewards

» An Optimal Dynamic Mechanism for MultiArmed Bandit Processes

Post Info
More Details (n/a)

Added	14 Mar 2010
Updated	14 Mar 2010
Type	Conference
Year	2008
Where	ALT
Authors	András Antos, Varun Grover, Csaba Szepesvári

Comments (0)