Search Sciweavers | Sciweavers

20

Publication

344views

12 years 3 months ago

Top-k Similarity Join over Multi-valued Objects

The top-k similarity joins have been extensively studied and used in a wide spectrum of applications such as information retrieval, decision making, spatial data analysis and dat...

Wenjie Zhang, Jing Xu, Xin Liang, Ying Zhang, Xuem...

claim paper

Read More »

23

click to vote

PVLDB
2010

195views more PVLDB 2010»

Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints

13 years 12 days ago

Download www.comp.nus.edu.sg

A string similarity join finds similar pairs between two collections of strings. It is an essential operation in many applications, such as data integration and cleaning, and has ...

Jiannan Wang, Guoliang Li, Jianhua Feng

claim paper

Read More »

21

click to vote

ICDM
2002
IEEE

163views Data Mining» more ICDM 2002»

High Performance Data Mining Using the Nearest Neighbor Join

13 years 10 months ago

Download www.dbs.informatik.uni-muenchen.de

The similarity join has become an important database primitive to support similarity search and data mining. A similarity join combines two sets of complex objects such that the r...

Christian Böhm, Florian Krebs

claim paper

Read More »

15

click to vote

SIGMOD
2004
ACM

182views Database» more SIGMOD 2004»

Efficient set joins on similarity predicates

14 years 5 months ago

Download www.it.iitb.ac.in

In this paper we present an efficient, scalable and general algorithm for performing set joins on predicates involving various similarity measures like intersect size, Jaccard-coe...

Sunita Sarawagi, Alok Kirpal

claim paper

Read More »

10

click to vote

ADBIS
2009
Springer

162views Database» more ADBIS 2009»

Efficient Set Similarity Joins Using Min-prefixes

13 years 9 months ago

Download wwwlgis.informatik.uni-kl.de

Identification of all objects in a dataset whose similarity is not less than a specified threshold is of major importance for management, search, and analysis of data. Set similari...

Leonardo Ribeiro, Theo Härder

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers