Search Sciweavers | Sciweavers

17

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

13 years 2 months ago

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

14

click to vote

COLT
2008
Springer

124views Machine Learning» more COLT 2008»

High-Probability Regret Bounds for Bandit Online Linear Optimization

13 years 6 months ago

Download colt2008.cs.helsinki.fi

We present a modification of the algorithm of Dani et al. [8] for the online linear optimization problem in the bandit setting, which with high probability has regret at most O ( ...

Peter L. Bartlett, Varsha Dani, Thomas P. Hayes, S...

claim paper

Read More »

17

click to vote

CORR
2012
Springer

210views Education» more CORR 2012»

Towards minimax policies for online linear optimization with bandit feedback

12 years 9 days ago

Download www.princeton.edu

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of...

Sébastien Bubeck, Nicolò Cesa-Bianch...

claim paper

Read More »

12

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

13 years 4 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

7

click to vote

NIPS
2008

119views Information Technology» more NIPS 2008»

Online Optimization in X-Armed Bandits

13 years 6 months ago