Search Sciweavers | Sciweavers

227 search results - page 2 / 46

» Linearly Parameterized Bandits

click to vote

COLT
2008
Springer

124views Machine Learning» more COLT 2008»

High-Probability Regret Bounds for Bandit Online Linear Optimization

13 years 7 months ago

Download colt2008.cs.helsinki.fi

We present a modification of the algorithm of Dani et al. [8] for the online linear optimization problem in the bandit setting, which with high probability has regret at most O ( ...

Peter L. Bartlett, Varsha Dani, Thomas P. Hayes, S...

claim paper

Read More »

click to vote

CORR
2012
Springer

210views Education» more CORR 2012»

Towards minimax policies for online linear optimization with bandit feedback

12 years 29 days ago

Download www.princeton.edu

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of...

Sébastien Bubeck, Nicolò Cesa-Bianch...

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Combinatorial Network Optimization with Unknown Variables: Multi-Armed Bandits with Linear Rewards

13 years 6 days ago

Download ceng.usc.edu

In the classic multi-armed bandits problem, the goal is to have a policy for dynamically operating arms that each yield stochastic rewards with unknown means. The key metric of int...

Yi Gai, Bhaskar Krishnamachari, Rahul Jain

claim paper

Read More »

click to vote

COLT
2008
Springer

96views Machine Learning» more COLT 2008»

Stochastic Linear Optimization under Bandit Feedback

13 years 7 months ago

Download people.cs.uchicago.edu

Varsha Dani, Thomas P. Hayes, Sham M. Kakade

claim paper

Read More »

click to vote

ICML
2001
IEEE

132views Machine Learning» more ICML 2001»

Expectation Maximization for Weakly Labeled Data

14 years 6 months ago

Download characters.media.mit.edu

We call data weakly labeled if it has no exact label but rather a numerical indication of correctness of the label "guessed" by the learning algorithm - a situation comm...

Yuri A. Ivanov, Bruce Blumberg, Alex Pentland

claim paper

Read More »

« Prev « First page 2 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers