Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

9

NIPS
2008

favoriteEmaildiscussreport

116views Information Technology» more NIPS 2008»

Algorithms for Infinitely Many-Armed Bandits

13 years 6 months ago

Algorithms for Infinitely Many-Armed Bandits

Download www.stat.lsa.umich.edu

We consider multi-armed bandit problems where the number of arms is larger than the possible number of experiments. We make a stochastic assumption on the mean-reward of a new selected arm which characterizes its probability of being a near-optimal arm. Our assumption is weaker than in previous works. We describe algorithms based on upper-confidence-bounds applied to a restricted set of randomly selected arms and provide upper-bounds on the resulting expected regret. We also derive a lower-bound which matches (up to a logarithmic factor) the upper-bound in some cases.

Yizao Wang, Jean-Yves Audibert, Rémi Munos

Real-time Traffic

Information Technology | Multi-armed Bandit Problems | Near-optimal Arm | NIPS 2008 | Stochastic Assumption |

claim paper

Related Content

» Nearly optimal explorationexploitation decision thresholds

» Playing games with approximation algorithms

» An Optimal Dynamic Mechanism for MultiArmed Bandit Processes

» Nearly Tight Bounds for the ContinuumArmed Bandit Problem

» Adapting to a Changing Environment the Brownian Restless Bandits

» Open Loop Optimistic Planning

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	NIPS
Authors	Yizao Wang, Jean-Yves Audibert, Rémi Munos

Comments (0)