Sciweavers

3 search results - page 1 / 1
» Multi-armed Bandit Algorithms and Empirical Evaluation
Sort
View
NIPS
2008
13 years 6 months ago
Mortal Multi-Armed Bandits
We formulate and study a new variant of the k-armed bandit problem, motivated by e-commerce applications. In our model, arms have (stochastic) lifetime after which they expire. In...
Deepayan Chakrabarti, Ravi Kumar, Filip Radlinski,...
ECML
2005
Springer
13 years 10 months ago
Multi-armed Bandit Algorithms and Empirical Evaluation
The multi-armed bandit problem for a gambler is to decide which arm of a K-slot machine to pull to maximize his total reward in a series of trials. Many real-world learning and opt...
Joannès Vermorel, Mehryar Mohri
ICML
2003
IEEE
14 years 5 months ago
Online Choice of Active Learning Algorithms
This paper is concerned with the question of how to online combine an ensemble of active learners so as to expedite the learning progress during a pool-based active learning sessi...
Yoram Baram, Ran El-Yaniv, Kobi Luz