Search Sciweavers | Sciweavers

7 search results - page 1 / 2

» Online Geometric Optimization in the Bandit Setting Against ...

click to vote

COLT
2004
Springer

78views Machine Learning» more COLT 2004»

Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary

13 years 10 months ago

Download www.cs.cmu.edu

We give an algorithm for the bandit version of a very general online optimization problem considered by Kalai and Vempala [1], for the case of an adaptive adversary. In this proble...

H. Brendan McMahan, Avrim Blum

claim paper

Read More »

click to vote

COLT
2008
Springer

124views Machine Learning» more COLT 2008»

High-Probability Regret Bounds for Bandit Online Linear Optimization

13 years 6 months ago

Download colt2008.cs.helsinki.fi

We present a modification of the algorithm of Dani et al. [8] for the online linear optimization problem in the bandit setting, which with high probability has regret at most O ( ...

Peter L. Bartlett, Varsha Dani, Thomas P. Hayes, S...

claim paper

Read More »

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

13 years 4 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

13 years 6 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

click to vote

STOC
2007
ACM

146views Algorithms» more STOC 2007»

Playing games with approximation algorithms

14 years 5 months ago

Download www.cc.gatech.edu

In an online linear optimization problem, on each period t, an online algorithm chooses st S from a fixed (possibly infinite) set S of feasible decisions. Nature (who may be adve...

Sham M. Kakade, Adam Tauman Kalai, Katrina Ligett

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers