Search Sciweavers | Sciweavers

1246 search results - page 4 / 250

» High Performance Clustering Based on the Similarity Join

112

click to vote

CORR
2011
Springer

186views Education» more CORR 2011»

14 years 9 months ago

Similarity Join Size Estimation using Locality Sensitive Hashing

Download www.vldb.org

Similarity joins are important operations with a broad range of applications. In this paper, we study the problem of vector similarity join size estimation (VSJ). It is a generali...

Hongrae Lee, Raymond T. Ng, Kyuseok Shim

claim paper

Read More »

122

Voted

WWW
2008
ACM

214views Internet Technology» more WWW 2008»

16 years 2 months ago

Efficient similarity joins for near duplicate detection

Download www2008.org

With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...

Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...

claim paper

Read More »

117

click to vote

CLUSTER
2002
IEEE

158views Distributed And Parallel Com...» more CLUSTER 2002»

Cluster Based Hybrid Hash Join: Analysis and Evaluation

15 years 6 months ago

Download www.pri.univie.ac.at

The join is the most important, but also the most time consuming operation in relational database systems. We implemented the parallel Hybrid Hash Join algorithm on a PC-cluster a...

Erich Schikuta, Peter Kirkovits

claim paper

Read More »

143

Voted

SIGMOD
2011
ACM

248views Database» more SIGMOD 2011»

Llama: leveraging columnar storage for scalable join processing in the MapReduce framework

14 years 4 months ago

Download www.comp.nus.edu.sg

To achieve high reliability and scalability, most large-scale data warehouse systems have adopted the cluster-based architecture. In this paper, we propose the design of a new clu...

Yuting Lin, Divyakant Agrawal, Chun Chen, Beng Chi...

claim paper

Read More »

139

click to vote

WIRN
2005
Springer

225views Artificial Intelligence» more WIRN 2005»

Ensembles Based on Random Projections to Improve the Accuracy of Clustering Algorithms

15 years 7 months ago

Download homes.dsi.unimi.it

We present an algorithmic scheme for unsupervised cluster ensembles, based on randomized projections between metric spaces, by which a substantial dimensionality reduction is obtai...

Alberto Bertoni, Giorgio Valentini

claim paper

Read More »

« Prev « First page 4 / 250 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers