Search Sciweavers | Sciweavers

82 search results - page 1 / 17

» An Optimal Dynamic Mechanism for Multi-Armed Bandit Processe...

click to vote

CORR
2010
Springer

189views Education» more CORR 2010»

An Optimal Dynamic Mechanism for Multi-Armed Bandit Processes

13 years 4 months ago

Download research.microsoft.com

We consider the problem of revenue-optimal dynamic mechanism design in settings where agents' types evolve over time as a function of their (both public and private) experien...

Sham M. Kakade, Ilan Lobel, Hamid Nazerzadeh

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Combinatorial Network Optimization with Unknown Variables: Multi-Armed Bandits with Linear Rewards

12 years 11 months ago

Download ceng.usc.edu

In the classic multi-armed bandits problem, the goal is to have a policy for dynamically operating arms that each yield stochastic rewards with unknown means. The key metric of int...

Yi Gai, Bhaskar Krishnamachari, Rahul Jain

claim paper

Read More »

click to vote

CORR
2010
Springer

187views Education» more CORR 2010»

Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit

13 years 4 months ago

Download www.ece.ucdavis.edu

We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. In this problem, at each time, a player chooses K out of N (N > K) arms to play. The state of ...

Haoyang Liu, Keqin Liu, Qing Zhao

claim paper

Read More »

click to vote

CORR
2010
Springer

143views Education» more CORR 2010»

The Non-Bayesian Restless Multi-Armed Bandit: a Case of Near-Logarithmic Regret

13 years 1 months ago

Download www.ece.ucdavis.edu

In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are N arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A play...

Wenhan Dai, Yi Gai, Bhaskar Krishnamachari, Qing Z...

claim paper

Read More »

click to vote

GECCO
2010
Springer

191views Optimization» more GECCO 2010»

Toward comparison-based adaptive operator selection

13 years 9 months ago

Download hal.archives-ouvertes.fr

Adaptive Operator Selection (AOS) turns the impacts of the applications of variation operators into Operator Selection through a Credit Assignment mechanism. However, most Credit ...

Álvaro Fialho, Marc Schoenauer, Michè...

claim paper

Read More »

« Prev « First page 1 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers