Sciweavers

2 search results - page 1 / 1
» Multi-armed bandits with episode context
Sort
View
AMAI
2011
Springer
12 years 4 months ago
Multi-armed bandits with episode context
A multi-armed bandit episode consists of n trials, each allowing selection of one of K arms, resulting in payoff from a distribution over [0, 1] associated with that arm. We assum...
Christopher D. Rosin
GECCO
2010
Springer
191views Optimization» more  GECCO 2010»
13 years 9 months ago
Toward comparison-based adaptive operator selection
Adaptive Operator Selection (AOS) turns the impacts of the applications of variation operators into Operator Selection through a Credit Assignment mechanism. However, most Credit ...
Álvaro Fialho, Marc Schoenauer, Michè...