Sciweavers

54 search results - page 1 / 11
» Efficient parallel set-similarity joins using MapReduce
Sort
View
SIGMOD
2010
ACM
208views Database» more  SIGMOD 2010»
13 years 9 months ago
Efficient parallel set-similarity joins using MapReduce
Rares Vernica, Michael J. Carey, Chen Li
SIGMOD
2011
ACM
299views Database» more  SIGMOD 2011»
12 years 7 months ago
Processing theta-joins using MapReduce
Joins are essential for many data analysis tasks, but are not supported directly by the MapReduce paradigm. While there has been progress on equi-joins, implementation of join alg...
Alper Okcan, Mirek Riedewald
CORR
2010
Springer
205views Education» more  CORR 2010»
13 years 5 months ago
Behavioral Simulations in MapReduce
In many scientific domains, researchers are turning to large-scale behavioral simulations to better understand real-world phenomena. While there has been a great deal of work on s...
Guozhang Wang, Marcos Antonio Vaz Salles, Benjamin...
ADBIS
2009
Springer
162views Database» more  ADBIS 2009»
13 years 9 months ago
Efficient Set Similarity Joins Using Min-prefixes
Identification of all objects in a dataset whose similarity is not less than a specified threshold is of major importance for management, search, and analysis of data. Set similari...
Leonardo Ribeiro, Theo Härder
ICDE
2009
IEEE
194views Database» more  ICDE 2009»
14 years 6 months ago
Top-k Set Similarity Joins
Abstract-- Similarity join is a useful primitive operation underlying many applications, such as near duplicate Web page detection, data integration, and pattern recognition. Tradi...
Chuan Xiao, Wei Wang 0011, Xuemin Lin, Haichuan Sh...