Mortal Multi-Armed Bandits

13 years 10 months ago

Download www.cs.cmu.edu

We formulate and study a new variant of the k-armed bandit problem, motivated by e-commerce applications. In our model, arms have (stochastic) lifetime after which they expire. In this setting an algorithm needs to continuously explore new arms, in contrast to the standard k-armed bandit model in which arms are available indefinitely and exploration is reduced once an optimal arm is identified with nearcertainty. The main motivation for our setting is online-advertising, where ads have limited lifetime due to, for example, the nature of their content and their campaign budgets. An algorithm needs to choose among a large collection of ads, more than can be fully explored within the typical ad lifetime. We present an optimal algorithm for the state-aware (deterministic reward function) case, and build on this technique to obtain an algorithm for the state-oblivious (stochastic reward function) case. Empirical studies on various reward distributions, including one derived from a real-wor...

Deepayan Chakrabarti, Ravi Kumar, Filip Radlinski,

Real-time Traffic

Algorithms | Information Technology | K-armed Bandit | K-armed Bandit Problem | NIPS 2008 |

claim paper

Related Content

» MultiArmed Bandits in Metric Spaces

» MultiArmed Bandit Mechanisms for MultiSlot Sponsored Search Auctions

» How to Beat the Adaptive MultiArmed Bandit

» Online Algorithms for the MultiArmed Bandit Problem with Markovian Rewards

» Learning in A Changing World NonBayesian Restless MultiArmed Bandit

» Combinatorial Network Optimization with Unknown Variables MultiArmed Bandits with Linear R...

» The NonBayesian Restless MultiArmed Bandit a Case of NearLogarithmic Regret

» Best Arm Identification in MultiArmed Bandits

» On the Combinatorial MultiArmed Bandit Problem with Markovian Rewards

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2008
Where	NIPS
Authors	Deepayan Chakrabarti, Ravi Kumar, Filip Radlinski, Eli Upfal

Comments (0)

Sciweavers

Mortal Multi-Armed Bandits

Algorithms | Information Technology | K-armed Bandit | K-armed Bandit Problem | NIPS 2008 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers