Towards minimax policies for online linear optimization with bandit feedback

14 years 2 days ago

Download www.princeton.edu

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of order √ dn log N for any ﬁnite action set with N actions, under the assumption that the instan

Sébastien Bubeck, Nicolò Cesa-Bianch

Real-time Traffic

CORR 2012 | Education | Exponential Weights | Linear Optimization | Optimization Problem |

claim paper

Post Info
More Details (n/a)

Added	20 Apr 2012
Updated	20 Apr 2012
Type	Journal
Year	2012
Where	CORR
Authors	Sébastien Bubeck, Nicolò Cesa-Bianchi, Sham M. Kakade

Comments (0)

Sciweavers

Towards minimax policies for online linear optimization with bandit feedback

CORR 2012 | Education | Exponential Weights | Linear Optimization | Optimization Problem |

Explore & Download

Productivity Tools

Sciweavers