Sciweavers

1390 search results - page 198 / 278
» Self-Sizing of Clustered Databases
Sort
View
WWW
2004
ACM
15 years 10 months ago
Similarity spreading: a unified framework for similarity calculation of interrelated objects
In many Web search applications, similarities between objects of one type (say, queries) can be affected by the similarities between their interrelated objects of another type (sa...
Gui-Rong Xue, Hua-Jun Zeng, Zheng Chen, Wei-Ying M...
KDD
2008
ACM
156views Data Mining» more  KDD 2008»
15 years 10 months ago
Unsupervised deduplication using cross-field dependencies
Recent work in deduplication has shown that collective deduplication of different attribute types can improve performance. But although these techniques cluster the attributes col...
Robert Hall, Charles A. Sutton, Andrew McCallum
KDD
2005
ACM
139views Data Mining» more  KDD 2005»
15 years 10 months ago
Reasoning about sets using redescription mining
Redescription mining is a newly introduced data mining problem that seeks to find subsets of data that afford multiple definitions. It can be viewed as a generalization of associa...
Mohammed Javeed Zaki, Naren Ramakrishnan
VLDB
2007
ACM
174views Database» more  VLDB 2007»
15 years 10 months ago
An adaptive and dynamic dimensionality reduction method for high-dimensional indexing
Abstract The notorious "dimensionality curse" is a wellknown phenomenon for any multi-dimensional indexes attempting to scale up to high dimensions. One well-known approa...
Heng Tao Shen, Xiaofang Zhou, Aoying Zhou
EDBT
2008
ACM
146views Database» more  EDBT 2008»
15 years 10 months ago
Attribute selection in multivariate microaggregation
Microaggregation is one of the most employed microdata protection methods. The idea is to build clusters of at least k original records, and then replace them with the centroid of...
Javier Herranz, Jordi Nin, Vicenç Torra