Sciweavers

22 search results - page 4 / 5
» Contextual Multi-Armed Bandits
Sort
View
CORR
2008
Springer
78views Education» more  CORR 2008»
13 years 5 months ago
Characterizing Truthful Multi-Armed Bandit Mechanisms
Moshe Babaioff, Yogeshwer Sharma, Aleksandrs Slivk...
AMAI
2011
Springer
12 years 5 months ago
Multi-armed bandits with episode context
A multi-armed bandit episode consists of n trials, each allowing selection of one of K arms, resulting in payoff from a distribution over [0, 1] associated with that arm. We assum...
Christopher D. Rosin
JMLR
2012
11 years 8 months ago
Contextual Bandit Learning with Predictable Rewards
Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...
Alekh Agarwal, Miroslav Dudík, Satyen Kale,...
GECCO
2010
Springer
191views Optimization» more  GECCO 2010»
13 years 10 months ago
Toward comparison-based adaptive operator selection
Adaptive Operator Selection (AOS) turns the impacts of the applications of variation operators into Operator Selection through a Credit Assignment mechanism. However, most Credit ...
Álvaro Fialho, Marc Schoenauer, Michè...
SAC
2005
ACM
13 years 11 months ago
Stochastic scheduling of active support vector learning algorithms
Active learning is a generic approach to accelerate training of classifiers in order to achieve a higher accuracy with a small number of training examples. In the past, simple ac...
Gaurav Pandey, Himanshu Gupta, Pabitra Mitra