Sciweavers

22 search results - page 4 / 5
» Contextual Multi-Armed Bandits
Sort
View
CORR
2008
Springer
78views Education» more  CORR 2008»
14 years 9 months ago
Characterizing Truthful Multi-Armed Bandit Mechanisms
Moshe Babaioff, Yogeshwer Sharma, Aleksandrs Slivk...
AMAI
2011
Springer
13 years 9 months ago
Multi-armed bandits with episode context
A multi-armed bandit episode consists of n trials, each allowing selection of one of K arms, resulting in payoff from a distribution over [0, 1] associated with that arm. We assum...
Christopher D. Rosin
JMLR
2012
13 years 15 hour ago
Contextual Bandit Learning with Predictable Rewards
Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...
Alekh Agarwal, Miroslav Dudík, Satyen Kale,...
GECCO
2010
Springer
191views Optimization» more  GECCO 2010»
15 years 2 months ago
Toward comparison-based adaptive operator selection
Adaptive Operator Selection (AOS) turns the impacts of the applications of variation operators into Operator Selection through a Credit Assignment mechanism. However, most Credit ...
Álvaro Fialho, Marc Schoenauer, Michè...
SAC
2005
ACM
15 years 3 months ago
Stochastic scheduling of active support vector learning algorithms
Active learning is a generic approach to accelerate training of classifiers in order to achieve a higher accuracy with a small number of training examples. In the past, simple ac...
Gaurav Pandey, Himanshu Gupta, Pabitra Mitra