Search Sciweavers | Sciweavers

27 search results - page 2 / 6

» Improved Rates for the Stochastic Continuum-Armed Bandit Pro...

click to vote

CORR
2010
Springer

72views Education» more CORR 2010»

X-Armed Bandits

13 years 2 months ago

Download eprints.pascal-network.org

We consider a generalization of stochastic bandit problems where the set of arms, X, is allowed to be a generic topological space. We constraint the mean-payoff function with a di...

Sébastien Bubeck, Rémi Munos, Gilles...

claim paper

Read More »

click to vote

ALT
2007
Springer

134views Machine Learning» more ALT 2007»

Tuning Bandit Algorithms in Stochastic Environments

14 years 2 months ago

Download www.sztaki.hu

Algorithms based on upper-conﬁdence bounds for balancing exploration and exploitation are gaining popularity since they are easy to implement, eﬃcient and eﬀective. In this p...

Jean-Yves Audibert, Rémi Munos, Csaba Szepe...

claim paper

Read More »

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

13 years 4 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

13 years 7 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

click to vote

CORR
2010
Springer

174views Education» more CORR 2010»

Gaussian Process Bandits for Tree Search

13 years 6 months ago

Download www.mendeley.com

We motivate and analyse a new Tree Search algorithm, based on recent advances in the use of Gaussian Processes for bandit problems. We assume that the function to maximise on the ...

Louis Dorard, John Shawe-Taylor

claim paper

Read More »

« Prev « First page 2 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers