Sciweavers

1246 search results - page 5 / 250
» High Performance Clustering Based on the Similarity Join
Sort
View
ICDM
2003
IEEE
125views Data Mining» more  ICDM 2003»
15 years 2 months ago
Clustering Item Data Sets with Association-Taxonomy Similarity
We explore in this paper the efficient clustering of item data. Different from those of the traditional data, the features of item data are known to be of high dimensionality and...
Ching-Huang Yun, Kun-Ta Chuang, Ming-Syan Chen
VLDB
2007
ACM
169views Database» more  VLDB 2007»
15 years 9 months ago
Peer-to-Peer Similarity Search in Metric Spaces
This paper addresses the efficient processing of similarity queries in metric spaces, where data is horizontally distributed across a P2P network. The proposed approach does not r...
Christos Doulkeridis, Akrivi Vlachou, Yannis Kotid...
87
Voted
SIGMOD
2004
ACM
182views Database» more  SIGMOD 2004»
15 years 9 months ago
Efficient set joins on similarity predicates
In this paper we present an efficient, scalable and general algorithm for performing set joins on predicates involving various similarity measures like intersect size, Jaccard-coe...
Sunita Sarawagi, Alok Kirpal
LREC
2008
120views Education» more  LREC 2008»
14 years 11 months ago
Division of Example Sentences Based on the Meaning of a Target Word Using Semi-Supervised Clustering
In this paper, we describe a system that divides example sentences (data set) into clusters, based on the meaning of the target word, using a semi-supervised clustering technique....
Hiroyuki Shinnou, Minoru Sasaki
86
Voted
PVLDB
2010
126views more  PVLDB 2010»
14 years 8 months ago
Set Similarity Join on Probabilistic Data
Set similarity join has played an important role in many real-world applications such as data cleaning, near duplication detection, data integration, and so on. In these applicati...
Xiang Lian, Lei Chen 0002