Search Sciweavers | Sciweavers

109 search results - page 2 / 22

» Algorithm Selection as a Bandit Problem with Unbounded Losse...

click to vote

ALT
2009
Springer

176views Machine Learning» more ALT 2009»

The Follow Perturbed Leader Algorithm Protected from Unbounded One-Step Losses

14 years 2 months ago

Download www.iitp.ru

In this paper the sequential prediction problem with expert advice is considered for the case when the losses of experts suﬀered at each step can be unbounded. We present some mo...

Vladimir V. V'yugin

claim paper

Read More »

click to vote

JMLR
2011

137views more JMLR 2011»

Online Learning in Case of Unbounded Losses Using Follow the Perturbed Leader Algorithm

13 years 4 months ago

Download jmlr.csail.mit.edu

In this paper the sequential prediction problem with expert advice is considered for the case where losses of experts suffered at each step cannot be bounded in advance. We presen...

Vladimir V. V'yugin

claim paper

Read More »

click to vote

CP
2006
Springer

121views Artificial Intelligence» more CP 2006»

A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem

14 years 1 months ago

Download www.cs.cmu.edu

The max k-armed bandit problem is a recently-introduced online optimization problem with practical applications to heuristic search. Given a set of k slot machines, each yielding p...

Matthew J. Streeter, Stephen F. Smith

claim paper

Read More »

click to vote

CORR
2010
Springer

127views Education» more CORR 2010»

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

13 years 9 months ago

Download wireless.cs.uh.edu

We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...

Cem Tekin, Mingyan Liu

claim paper

Read More »

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

13 years 7 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

« Prev « First page 2 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers