Search Sciweavers | Sciweavers

74 search results - page 6 / 15

» Regret Bounds for Gaussian Process Bandit Problems

158

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

15 years 5 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

143

click to vote

CORR
2006
Springer

83views Education» more CORR 2006»

How to Beat the Adaptive Multi-Armed Bandit

15 years 6 months ago

Download people.cs.uchicago.edu

The multi-armed bandit is a concise model for the problem of iterated decision-making under uncertainty. In each round, a gambler must pull one of K arms of a slot machine, withou...

Varsha Dani, Thomas P. Hayes

claim paper

Read More »

180

click to vote

CORR
2007
Springer

106views Education» more CORR 2007»

Bandit Algorithms for Tree Search

15 years 5 months ago

Download hal.inria.fr

Bandit based methods for tree search have recently gained popularity when applied to huge trees, e.g. in the game of go [6]. Their eﬃcient exploration of the tree enables to ret...

Pierre-Arnaud Coquelin, Rémi Munos

claim paper

Read More »

208

click to vote

SIGMOD
2012
ACM

210views Database» more SIGMOD 2012»

Interactive regret minimization

13 years 8 months ago

Download personal.denison.edu

We study the notion of regret ratio proposed in [19] to deal with multi-criteria decision making in database systems. The regret minimization query proposed in [19] was shown to h...

Danupon Nanongkai, Ashwin Lall, Atish Das Sarma, K...

claim paper

Read More »

171

click to vote

TSP
2010

170views Artificial Intelligence» more TSP 2010»

Distributed learning in multi-armed bandit with multiple players

15 years 18 days ago

Download www.ece.ucdavis.edu

We formulate and study a decentralized multi-armed bandit (MAB) problem. There are distributed players competing for independent arms. Each arm, when played, offers i.i.d. reward a...

Keqin Liu, Qing Zhao

claim paper

Read More »

« Prev « First page 6 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers