Search Sciweavers | Sciweavers

153

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

14 years 12 months ago

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

126

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

14 years 12 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

118

click to vote

COLT
2010
Springer

205views Machine Learning» more COLT 2010»

Convex Games in Banach Spaces

14 years 12 months ago

Download www.cs.utexas.edu

We study the regret of an online learner playing a multi-round game in a Banach space B against an adversary that plays a convex function at each round. We characterize the minima...

Karthik Sridharan, Ambuj Tewari

claim paper

Read More »

102

click to vote

COLT
2010
Springer

117views Machine Learning» more COLT 2010»

Strongly Non-U-Shaped Learning Results by General Techniques

14 years 12 months ago

Download www.colt2010.org

In learning, a semantic or behavioral U-shape occurs when a learner rst learns, then unlearns, and, nally, relearns, some target concept (on the way to success). Within the framew...

John Case, Timo Kötzing

claim paper

Read More »

127

click to vote

COLT
2010
Springer

149views Machine Learning» more COLT 2010»

Open Loop Optimistic Planning

14 years 12 months ago

Download www.colt2010.org

We consider the problem of planning in a stochastic and discounted environment with a limited numerical budget. More precisely, we investigate strategies exploring the set of poss...

Sébastien Bubeck, Rémi Munos

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers