Sciweavers

1413 search results - page 183 / 283
» Mining Multiple Large Databases
Sort
View
KDD
2010
ACM
197views Data Mining» more  KDD 2010»
14 years 11 months ago
Semi-supervised feature selection for graph classification
The problem of graph classification has attracted great interest in the last decade. Current research on graph classification assumes the existence of large amounts of labeled tra...
Xiangnan Kong, Philip S. Yu
CLUSTER
2005
IEEE
15 years 7 months ago
A pipelined data-parallel algorithm for ILP
The amount of data collected and stored in databases is growing considerably for almost all areas of human activity. Processing this amount of data is very expensive, both humanly...
Nuno A. Fonseca, Fernando M. A. Silva, Víto...
ICDE
2003
IEEE
159views Database» more  ICDE 2003»
16 years 3 months ago
Scaling up the ALIAS Duplicate Elimination System
Duplicate elimination is an important stage in integrating data from multiple sources. The challenges involved are finding a robust deduplication function that can identify when t...
Sunita Sarawagi, Alok Kirpal
SIGMOD
1991
ACM
81views Database» more  SIGMOD 1991»
15 years 5 months ago
Multi-Disk B-trees
In this paper, Dept. of Computer Science, University of Waterloo Waterloo, Ontario, Canada, N2L 3G1 we consider how to exploit multiple disks to improve the performance of B-tree ...
Bernhard Seeger, Per-Åke Larson
BMCBI
2010
162views more  BMCBI 2010»
15 years 2 months ago
Moara: a Java library for extracting and normalizing gene and protein mentions
Background: Gene/protein recognition and normalization are important preliminary steps for many biological text mining tasks, such as information retrieval, protein-protein intera...
Mariana L. Neves, José María Carazo,...