Sciweavers

77 search results - page 1 / 16
» Efficient bandit algorithms for online multiclass prediction
Sort
View
73
Voted
ICML
2008
IEEE
16 years 1 months ago
Efficient bandit algorithms for online multiclass prediction
Sham M. Kakade, Shai Shalev-Shwartz, Ambuj Tewari
103
Voted
COLT
2008
Springer
15 years 2 months ago
Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization
We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...
Jacob Abernethy, Elad Hazan, Alexander Rakhlin
125
Voted
ALT
2006
Springer
15 years 4 months ago
Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring
In this paper the sequential prediction problem with expert advice is considered when the loss is unbounded under partial monitoring scenarios. We deal with a wide class of the par...
Chamy Allenberg, Peter Auer, László ...
114
Voted
CORR
2008
Springer
136views Education» more  CORR 2008»
15 years 1 months ago
Multi-Armed Bandits in Metric Spaces
In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of n trials so as to maximize the total payoff of the chosen strategies. While ...
Robert Kleinberg, Aleksandrs Slivkins, Eli Upfal
STOC
2007
ACM
146views Algorithms» more  STOC 2007»
16 years 1 months ago
Playing games with approximation algorithms
In an online linear optimization problem, on each period t, an online algorithm chooses st S from a fixed (possibly infinite) set S of feasible decisions. Nature (who may be adve...
Sham M. Kakade, Adam Tauman Kalai, Katrina Ligett