X-Armed Bandits

13 years 1 months ago

Download eprints.pascal-network.org

We consider a generalization of stochastic bandit problems where the set of arms, X, is allowed to be a generic topological space. We constraint the mean-payoff function with a dissimilarity function over X in a way that is more general than Lipschitz. We construct an arm selection policy whose regret improves upon previous result for a large class of problems. In particular, our results imply that if X is the unit hypercube in a Euclidean space and the mean-payoff function has a finite number of global maxima around which the behavior of the function is locally H

Sébastien Bubeck, Rémi Munos, Gilles

Real-time Traffic

CORR 2010 | Education | Generic Topological Space | Mean-payoff Functions | Stochastic Bandit Problems |

claim paper

Post Info
More Details (n/a)

Added	21 Mar 2011
Updated	21 Mar 2011
Type	Journal
Year	2010
Where	CORR
Authors	Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári

Comments (0)

Sciweavers

X-Armed Bandits

CORR 2010 | Education | Generic Topological Space | Mean-payoff Functions | Stochastic Bandit Problems |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers