Search Sciweavers | Sciweavers

23

SIGMOD
2012
ACM

288views Database» more SIGMOD 2012»

Exploiting MapReduce-based similarity joins

11 years 7 months ago

Cloud enabled systems have become a crucial component to eﬃciently process and analyze massive amounts of data. One of the key data processing and analysis operations is the Sim...

Yasin N. Silva, Jason M. Reed

claim paper

Read More »

15

click to vote

PVLDB
2008

201views more PVLDB 2008»

Ed-Join: an efficient algorithm for similarity joins with edit distance constraints

13 years 4 months ago

Download www.cse.unsw.edu.au

There has been considerable interest in similarity join in the research community recently. Similarity join is a fundamental operation in many application areas, such as data inte...

Chuan Xiao, Wei Wang 0011, Xuemin Lin

claim paper

Read More »

14

click to vote

DASFAA
2006
IEEE

183views Database» more DASFAA 2006»

Probabilistic Similarity Join on Uncertain Data

13 years 10 months ago

Download www.dbs.informatik.uni-muenchen.de

An important database primitive for commonly used feature databases is the similarity join. It combines two datasets based on some similarity predicate into one set such that the n...

Hans-Peter Kriegel, Peter Kunath, Martin Pfeifle, ...

claim paper

Read More »

18

click to vote

WWW
2008
ACM

214views Internet Technology» more WWW 2008»

14 years 5 months ago

Efficient similarity joins for near duplicate detection

Download www2008.org

With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...

Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...

claim paper

Read More »

12

click to vote

IDEAS
2009
IEEE

192views Database» more IDEAS 2009»

A cluster-based approach to XML similarity joins

13 years 11 months ago

Download wwwlgis.informatik.uni-kl.de

A natural consequence of the widespread adoption of XML as standard for information representation and exchange is the redundant storage of large amounts of persistent XML documen...

Leonardo Ribeiro, Theo Härder, Fernanda S. Pi...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers