Search Sciweavers | Sciweavers

We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...

Jacob Abernethy, Elad Hazan, Alexander Rakhlin

claim paper

Read More »

133

click to vote

ALT
2006
Springer

156views Machine Learning» more ALT 2006»

Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring

15 years 5 months ago

Download www.szit.bme.hu

In this paper the sequential prediction problem with expert advice is considered when the loss is unbounded under partial monitoring scenarios. We deal with a wide class of the par...

Chamy Allenberg, Peter Auer, László ...

claim paper

Read More »

117

Voted

CORR
2008
Springer

136views Education» more CORR 2008»

Multi-Armed Bandits in Metric Spaces

15 years 1 months ago

Download www.cs.cornell.edu

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of n trials so as to maximize the total payoff of the chosen strategies. While ...

Robert Kleinberg, Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

118

click to vote

STOC
2007
ACM

146views Algorithms» more STOC 2007»

Playing games with approximation algorithms

16 years 2 months ago

Download www.cc.gatech.edu

In an online linear optimization problem, on each period t, an online algorithm chooses st S from a fixed (possibly infinite) set S of feasible decisions. Nature (who may be adve...

Sham M. Kakade, Adam Tauman Kalai, Katrina Ligett

claim paper

Read More »

« Prev « First page 1 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers