Search Sciweavers | Sciweavers

A multi-armed bandit episode consists of n trials, each allowing selection of one of K arms, resulting in payoff from a distribution over [0, 1] associated with that arm. We assum...

Christopher D. Rosin

claim paper

Read More »

146

Voted

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 3 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

128

click to vote

GECCO
2010
Springer

191views Optimization» more GECCO 2010»

Toward comparison-based adaptive operator selection

15 years 5 months ago

Download hal.archives-ouvertes.fr

Adaptive Operator Selection (AOS) turns the impacts of the applications of variation operators into Operator Selection through a Credit Assignment mechanism. However, most Credit ...

Álvaro Fialho, Marc Schoenauer, Michè...

claim paper

Read More »

114

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Stochastic scheduling of active support vector learning algorithms

15 years 6 months ago

Download www-users.cs.umn.edu

Active learning is a generic approach to accelerate training of classiﬁers in order to achieve a higher accuracy with a small number of training examples. In the past, simple ac...

Gaurav Pandey, Himanshu Gupta, Pabitra Mitra

claim paper

Read More »

« Prev « First page 4 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers