Sciweavers

1950 search results - page 128 / 390
» Informative sampling for large unbalanced data sets
Sort
View
120
Voted
SIGMOD
1999
ACM
98views Database» more  SIGMOD 1999»
15 years 7 months ago
Self-tuning Histograms: Building Histograms Without Looking at Data
In this paper, we introduce self-tuning histograms. Although similar in structure to traditional histograms, these histograms infer data distributions not by examining the data or...
Ashraf Aboulnaga, Surajit Chaudhuri
136
Voted
OTM
2009
Springer
15 years 10 months ago
LinksB2N: Automatic Data Integration for the Semantic Web
Abstract. The ongoing trend towards open data embraced by the Semantic Web has started to produce a large number of data sources. These data sources are published using RDF vocabul...
Manuel Salvadores, Gianluca Correndo, Bene Rodrigu...
149
Voted
KDD
2004
ACM
302views Data Mining» more  KDD 2004»
16 years 3 months ago
Redundancy based feature selection for microarray data
In gene expression microarray data analysis, selecting a small number of discriminative genes from thousands of genes is an important problem for accurate classification of diseas...
Lei Yu, Huan Liu
219
Voted
CVPR
2009
IEEE
16 years 10 months ago
Isometric Registration of Ambiguous and Partial Data
This paper introduces a new shape matching algorithm for computing correspondences between 3D surfaces that have undergone (approximately) isometric deformations. The new approach ...
Art Tevs (Max Planck Institute Informatik), Martin...
141
Voted
CORR
1999
Springer
67views Education» more  CORR 1999»
15 years 3 months ago
ZBroker: A Query Routing Broker for Z39.50 Databases
A query routing broker is a software agent that determines from a large set of accessing information sources the ones most relevant to a user's information need. As the numbe...
Yong Lin, Jian Xu, Ee-Peng Lim, Wee Keong Ng