Search Sciweavers | Sciweavers

15

CORR
2010
Springer

143views Education» more CORR 2010»

The Non-Bayesian Restless Multi-Armed Bandit: a Case of Near-Logarithmic Regret

13 years 1 months ago

In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are N arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A play...

Wenhan Dai, Yi Gai, Bhaskar Krishnamachari, Qing Z...

claim paper

Read More »

19

click to vote

CORR
2008
Springer

136views Education» more CORR 2008»

Multi-Armed Bandits in Metric Spaces

13 years 4 months ago

Download www.cs.cornell.edu

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of n trials so as to maximize the total payoff of the chosen strategies. While ...

Robert Kleinberg, Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

13

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Stochastic scheduling of active support vector learning algorithms

13 years 10 months ago

Download www-users.cs.umn.edu

Active learning is a generic approach to accelerate training of classiﬁers in order to achieve a higher accuracy with a small number of training examples. In the past, simple ac...

Gaurav Pandey, Himanshu Gupta, Pabitra Mitra

claim paper

Read More »

19

click to vote

CORR
2010
Springer

187views Education» more CORR 2010»

Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit

13 years 4 months ago

Download www.ece.ucdavis.edu

We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. In this problem, at each time, a player chooses K out of N (N > K) arms to play. The state of ...

Haoyang Liu, Keqin Liu, Qing Zhao

claim paper

Read More »

12

click to vote

ICAART
2010
INSTICC

222views Intelligent Agents» more ICAART 2010»

14 years 1 months ago