Sciweavers

1246 search results - page 1 / 250
» High Performance Clustering Based on the Similarity Join
Sort
View
CIKM
2000
Springer
13 years 9 months ago
High Performance Clustering Based on the Similarity Join
Christian Böhm, Bernhard Braunmüller, Ma...
ICDM
2002
IEEE
163views Data Mining» more  ICDM 2002»
13 years 9 months ago
High Performance Data Mining Using the Nearest Neighbor Join
The similarity join has become an important database primitive to support similarity search and data mining. A similarity join combines two sets of complex objects such that the r...
Christian Böhm, Florian Krebs
ICDE
1998
IEEE
142views Database» more  ICDE 1998»
14 years 6 months ago
High Dimensional Similarity Joins: Algorithms and Performance Evaluation
Current data repositories include a variety of data types, including audio, images and time series. State of the art techniques for indexing such data and doing query processing r...
Nick Koudas, Kenneth C. Sevcik
PVLDB
2010
195views more  PVLDB 2010»
12 years 11 months ago
Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints
A string similarity join finds similar pairs between two collections of strings. It is an essential operation in many applications, such as data integration and cleaning, and has ...
Jiannan Wang, Guoliang Li, Jianhua Feng
SIGMOD
2012
ACM
288views Database» more  SIGMOD 2012»
11 years 7 months ago
Exploiting MapReduce-based similarity joins
Cloud enabled systems have become a crucial component to efficiently process and analyze massive amounts of data. One of the key data processing and analysis operations is the Sim...
Yasin N. Silva, Jason M. Reed