Search Sciweavers | Sciweavers

9

CORR
2010
Springer

127views Education» more CORR 2010»

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

13 years 4 months ago

We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...

Cem Tekin, Mingyan Liu

claim paper

Read More »

18

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

13 years 2 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

28

click to vote

Publication

466views

Multi-Armed Bandit Mechanisms for Multi-Slot Sponsored Search Auctions

14 years 3 months ago

Download arxiv.org

In pay-per click sponsored search auctions which are cur- rently extensively used by search engines, the auction for a keyword involves a certain number of advertisers (say k) c...

Akash Das Sarma, Sujit Gujar, Y. Narahari

posted by sujit

Read More »

23

click to vote

CORR
2010
Springer

175views Education» more CORR 2010»

On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards

12 years 11 months ago

Download ceng.usc.edu

We consider a combinatorial generalization of the classical multi-armed bandit problem that is defined as follows. There is a given bipartite graph of M users and N M resources. F...

Yi Gai, Bhaskar Krishnamachari, Mingyan Liu

claim paper

Read More »

22

click to vote

CORR
2010
Springer

189views Education» more CORR 2010»

An Optimal Dynamic Mechanism for Multi-Armed Bandit Processes

13 years 4 months ago

Download research.microsoft.com

We consider the problem of revenue-optimal dynamic mechanism design in settings where agents' types evolve over time as a function of their (both public and private) experien...

Sham M. Kakade, Ilan Lobel, Hamid Nazerzadeh

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers