Search Sciweavers | Sciweavers

9 search results - page 1 / 2

» Optimal Algorithms for Online Convex Optimization with Multi...

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

13 years 2 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

click to vote

NIPS
2004

136views Information Technology» more NIPS 2004»

Nearly Tight Bounds for the Continuum-Armed Bandit Problem

13 years 6 months ago

Download books.nips.cc

In the multi-armed bandit problem, an online algorithm must choose from a set of strategies in a sequence of n trials so as to minimize the total cost of the chosen strategies. Wh...

Robert D. Kleinberg

claim paper

Read More »

click to vote

CORR
2012
Springer

210views Education» more CORR 2012»

Towards minimax policies for online linear optimization with bandit feedback

12 years 9 days ago

Download www.princeton.edu

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of...

Sébastien Bubeck, Nicolò Cesa-Bianch...

claim paper

Read More »

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

13 years 4 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

click to vote

ICML
2009
IEEE

170views Machine Learning» more ICML 2009»

Interactively optimizing information retrieval systems as a dueling bandits problem

14 years 5 months ago

Download www.yisongyue.com

We present an on-line learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, ...

Yisong Yue, Thorsten Joachims

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers