Search Sciweavers | Sciweavers

44 search results - page 2 / 9

» A structured multiarmed bandit problem and the greedy policy

click to vote

CDC
2009
IEEE

123views Control Systems» more CDC 2009»

On the myopic policy for a class of restless bandit problems with applications in dynamic multichannel access

15 years 4 months ago

Download www.ece.ucdavis.edu

We consider a class of restless multi-armed bandit problems that arises in multi-channel opportunistic communications, where channels are modeled as independent and stochastically...

Keqin Liu, Qing Zhao

claim paper

Read More »

click to vote

TSP
2010

170views Artificial Intelligence» more TSP 2010»

Distributed learning in multi-armed bandit with multiple players

14 years 6 months ago

Download www.ece.ucdavis.edu

We formulate and study a decentralized multi-armed bandit (MAB) problem. There are distributed players competing for independent arms. Each arm, when played, offers i.i.d. reward a...

Keqin Liu, Qing Zhao

claim paper

Read More »

click to vote

ICASSP
2010
IEEE

224views Signal Processing» more ICASSP 2010»

Distributed learning in cognitive radio networks: Multi-armed bandit with distributed multiple players

14 years 11 months ago

Download www.ece.ucdavis.edu

—We consider a cognitive radio network with distributed multiple secondary users, where each user independently searches for spectrum opportunities in multiple channels without e...

Keqin Liu, Qing Zhao

claim paper

Read More »

118

click to vote

AGI
2011

231views Artificial Intelligence» more AGI 2011»

Reinforcement Learning and the Bayesian Control Rule

14 years 3 months ago

Download metatip.com

We present an actor-critic scheme for reinforcement learning in complex domains. The main contribution is to show that planning and I/O dynamics can be separated such that an intra...

Pedro Alejandro Ortega, Daniel Alexander Braun, Si...

claim paper

Read More »

107

click to vote

COLT
2003
Springer

121views Machine Learning» more COLT 2003»

Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem

15 years 4 months ago

Download www.ece.mcgill.ca

We consider the Multi-armed bandit problem under the PAC (“probably approximately correct”) model. It was shown by Even-Dar et al. [5] that given n arms, it suﬃces to play th...

Shie Mannor, John N. Tsitsiklis

claim paper

Read More »

« Prev « First page 2 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers