Search Sciweavers | Sciweavers

17 search results - page 1 / 4

» Distributed learning in multi-armed bandit with multiple pla...

223

click to vote

CORR
2010
Springer

143views Education» more CORR 2010»

The Non-Bayesian Restless Multi-Armed Bandit: a Case of Near-Logarithmic Regret

15 years 4 months ago

Download www.ece.ucdavis.edu

In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are N arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A play...

Wenhan Dai, Yi Gai, Bhaskar Krishnamachari, Qing Z...

claim paper

Read More »

225

Voted

CORR
2010
Springer

187views Education» more CORR 2010»

Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit

15 years 7 months ago

Download www.ece.ucdavis.edu

We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. In this problem, at each time, a player chooses K out of N (N > K) arms to play. The state of ...

Haoyang Liu, Keqin Liu, Qing Zhao

claim paper

Read More »

203

Voted

CORR
2010
Springer

127views Education» more CORR 2010»

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

15 years 7 months ago

Download wireless.cs.uh.edu

We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...

Cem Tekin, Mingyan Liu

claim paper

Read More »

216

Voted

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

15 years 5 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

272

click to vote

CORR
2010
Springer

189views Education» more CORR 2010»

An Optimal Dynamic Mechanism for Multi-Armed Bandit Processes

15 years 7 months ago

Download research.microsoft.com

We consider the problem of revenue-optimal dynamic mechanism design in settings where agents' types evolve over time as a function of their (both public and private) experien...

Sham M. Kakade, Ilan Lobel, Hamid Nazerzadeh

claim paper

Read More »

« Prev « First page 1 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers