Sciweavers

779 search results - page 148 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ATAL
2006
Springer
15 years 1 months ago
Efficient agent-based cluster ensembles
Numerous domains ranging from distributed data acquisition to knowledge reuse need to solve the cluster ensemble problem of combining multiple clusterings into a single unified cl...
Adrian K. Agogino, Kagan Tumer
BMCBI
2006
165views more  BMCBI 2006»
14 years 9 months ago
Improved variance estimation of classification performance via reduction of bias caused by small sample size
Background: Supervised learning for classification of cancer employs a set of design examples to learn how to discriminate between tumors. In practice it is crucial to confirm tha...
Ulrika Wickenberg-Bolin, Hanna Göransson, M&a...
IADIS
2003
14 years 11 months ago
Adaptive Web Service for QOS Improvement
In this paper we investigate how “self-awareness'', through on-line self-monitoring and measurement, coupled with intelligent adaptive behaviour in response to observe...
Erol Gelenbe, Arturo Núñez
STOC
2006
ACM
122views Algorithms» more  STOC 2006»
15 years 10 months ago
Fast convergence to Wardrop equilibria by adaptive sampling methods
We study rerouting policies in a dynamic round-based variant of a well known game theoretic traffic model due to Wardrop. Previous analyses (mostly in the context of selfish routi...
Simon Fischer, Harald Räcke, Berthold Vö...
COLT
2010
Springer
14 years 7 months ago
Nonparametric Bandits with Covariates
We consider a bandit problem which involves sequential sampling from two populations (arms). Each arm produces a noisy reward realization which depends on an observable random cov...
Philippe Rigollet, Assaf Zeevi