Sciweavers

1950 search results - page 106 / 390
» Informative sampling for large unbalanced data sets
Sort
View
115
Voted
ICPR
2010
IEEE
15 years 6 months ago
Underwater Mine Classification with Imperfect Labels
A new algorithm for performing classification with imperfectly labeled data is presented. The proposed approach is motivated by the insight that the average prediction of a group ...
David Williams
133
Voted
ICML
2004
IEEE
15 years 8 months ago
Active learning using pre-clustering
The paper is concerned with two-class active learning. While the common approach for collecting data in active learning is to select samples close to the classification boundary,...
Hieu Tat Nguyen, Arnold W. M. Smeulders
159
Voted
ICAIL
2005
ACM
15 years 9 months ago
Effective Document Clustering for Large Heterogeneous Law Firm Collections
Computational resources for research in legal environments have historically implied remote access to large databases of legal documents such as case law, statutes, law reviews an...
Jack G. Conrad, Khalid Al-Kofahi, Ying Zhao, Georg...
123
Voted
ACSW
2004
15 years 4 months ago
Experiences in Building a Tool for Navigating Association Rule Result Sets
Practical knowledge discovery is an iterative process. First, the experiences gained from one mining run are used to inform the parameter setting and the dataset and attribute sel...
Peter Fule, John F. Roddick
BMCBI
2006
171views more  BMCBI 2006»
15 years 3 months ago
The effect of oligonucleotide microarray data pre-processing on the analysis of patient-cohort studies
Background: Intensity values measured by Affymetrix microarrays have to be both normalized, to be able to compare different microarrays by removing non-biological variation, and s...
Roel G. W. Verhaak, Frank J. T. Staal, Peter J. M....