Search Sciweavers | Sciweavers

17 search results - page 2 / 4

» Multi-armed bandit problems with dependent arms

178

click to vote

CORR
2008
Springer

136views Education» more CORR 2008»

Multi-Armed Bandits in Metric Spaces

15 years 6 months ago

Download www.cs.cornell.edu

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of n trials so as to maximize the total payoff of the chosen strategies. While ...

Robert Kleinberg, Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

156

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Mortal Multi-Armed Bandits

15 years 7 months ago

Download www.cs.cmu.edu

We formulate and study a new variant of the k-armed bandit problem, motivated by e-commerce applications. In our model, arms have (stochastic) lifetime after which they expire. In...

Deepayan Chakrabarti, Ravi Kumar, Filip Radlinski,...

claim paper

Read More »

188

click to vote

CORR
2010
Springer

187views Education» more CORR 2010»

Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit

15 years 6 months ago

Download www.ece.ucdavis.edu

We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. In this problem, at each time, a player chooses K out of N (N > K) arms to play. The state of ...

Haoyang Liu, Keqin Liu, Qing Zhao

claim paper

Read More »

179

click to vote

CORR
2010
Springer

127views Education» more CORR 2010»

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

15 years 6 months ago

Download wireless.cs.uh.edu

We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...

Cem Tekin, Mingyan Liu

claim paper

Read More »

150

click to vote

CORR
2006
Springer

83views Education» more CORR 2006»

How to Beat the Adaptive Multi-Armed Bandit

15 years 6 months ago

Download people.cs.uchicago.edu

The multi-armed bandit is a concise model for the problem of iterated decision-making under uncertainty. In each round, a gambler must pull one of K arms of a slot machine, withou...

Varsha Dani, Thomas P. Hayes

claim paper

Read More »

« Prev « First page 2 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers