Search Sciweavers | Sciweavers

7 search results - page 1 / 2

» Pure Exploration in Multi-armed Bandits Problems

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Mortal Multi-Armed Bandits

13 years 6 months ago

Download www.cs.cmu.edu

We formulate and study a new variant of the k-armed bandit problem, motivated by e-commerce applications. In our model, arms have (stochastic) lifetime after which they expire. In...

Deepayan Chakrabarti, Ravi Kumar, Filip Radlinski,...

claim paper

Read More »

click to vote

CORR
2010
Springer

127views Education» more CORR 2010»

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

13 years 4 months ago

Download wireless.cs.uh.edu

We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...

Cem Tekin, Mingyan Liu

claim paper

Read More »

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

13 years 2 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

click to vote

CORR
2010
Springer

189views Education» more CORR 2010»

An Optimal Dynamic Mechanism for Multi-Armed Bandit Processes

13 years 4 months ago

Download research.microsoft.com

We consider the problem of revenue-optimal dynamic mechanism design in settings where agents' types evolve over time as a function of their (both public and private) experien...

Sham M. Kakade, Ilan Lobel, Hamid Nazerzadeh

claim paper

Read More »

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Stochastic scheduling of active support vector learning algorithms

13 years 10 months ago

Download www-users.cs.umn.edu

Active learning is a generic approach to accelerate training of classiﬁers in order to achieve a higher accuracy with a small number of training examples. In the past, simple ac...

Gaurav Pandey, Himanshu Gupta, Pabitra Mitra

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers