Search Sciweavers | Sciweavers

44 search results - page 1 / 9

» A structured multiarmed bandit problem and the greedy policy

click to vote

CDC
2008
IEEE

104views Control Systems» more CDC 2008»

A structured multiarmed bandit problem and the greedy policy

13 years 11 months ago

Download web.mit.edu

—We consider a multiarmed bandit problem where the expected reward of each arm is a linear function of an unknown scalar with a prior distribution. The objective is to choose a s...

Adam J. Mersereau, Paat Rusmevichientong, John N. ...

claim paper

Read More »

click to vote

SDM
2007
SIAM

167views Data Mining» more SDM 2007»

Bandits for Taxonomies: A Model-based Approach

13 years 6 months ago

Download www.cs.cmu.edu

We consider a novel problem of learning an optimal matching, in an online fashion, between two feature spaces that are organized as taxonomies. We formulate this as a multi-armed ...

Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabar...

claim paper

Read More »

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Multi-armed bandit problems with dependent arms

14 years 5 months ago

Download www.cs.cmu.edu

We provide a framework to exploit dependencies among arms in multi-armed bandit problems, when the dependencies are in the form of a generative model on clusters of arms. We find ...

Sandeep Pandey, Deepayan Chakrabarti, Deepak Agarw...

claim paper

Read More »

click to vote

ECML
2005
Springer

105views Machine Learning» more ECML 2005»

Multi-armed Bandit Algorithms and Empirical Evaluation

13 years 10 months ago

Download www.cs.nyu.edu

The multi-armed bandit problem for a gambler is to decide which arm of a K-slot machine to pull to maximize his total reward in a series of trials. Many real-world learning and opt...

Joannès Vermorel, Mehryar Mohri

claim paper

Read More »

click to vote

CORR
2010
Springer

143views Education» more CORR 2010»

The Non-Bayesian Restless Multi-Armed Bandit: a Case of Near-Logarithmic Regret

13 years 1 months ago

Download www.ece.ucdavis.edu

In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are N arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A play...

Wenhan Dai, Yi Gai, Bhaskar Krishnamachari, Qing Z...

claim paper

Read More »

« Prev « First page 1 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers