Sciweavers

184

COLT
2008
Springer

124views Machine Learning» more COLT 2008»

High-Probability Regret Bounds for Bandit Online Linear Optimization

15 years 9 months ago

Download colt2008.cs.helsinki.fi

We present a modification of the algorithm of Dani et al. [8] for the online linear optimization problem in the bandit setting, which with high probability has regret at most O ( ...

Peter L. Bartlett, Varsha Dani, Thomas P. Hayes, S...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers