Search Sciweavers | Sciweavers

80

ICDE
2003
IEEE

116views Database» more ICDE 2003»

Joining Massive High-Dimensional Datasets

14 years 10 months ago

We consider the problem of joining massive datasets. We propose two techniques for minimizing disk I/O cost of join operations for both spatial and sequence data. Our techniques o...

Tamer Kahveci, Christian A. Lang, Ambuj K. Singh

claim paper

Read More »

27

click to vote

SIGMOD
2001
ACM

193views Database» more SIGMOD 2001»

Epsilon Grid Order: An Algorithm for the Similarity Join on Massive High-Dimensional Data

14 years 9 months ago

Download www.dbs.informatik.uni-muenchen.de

The similarity join is an important database primitive which has been successfully applied to speed up applications such as similarity search, data analysis and data mining. The s...

Christian Böhm, Bernhard Braunmüller, Fl...

claim paper

Read More »

81

click to vote

ICDE
1997
IEEE

130views Database» more ICDE 1997»

High-Dimensional Similarity Joins

14 years 10 months ago

Download rakesh.agrawal-family.com

Many emerging data mining applications require a similarity join between points in a high-dimensional domain. We present a new algorithm that utilizes a new index structure, calle...

Kyuseok Shim, Ramakrishnan Srikant, Rakesh Agrawal

claim paper

Read More »

29

click to vote

IDEAL
2004
Springer

122views Intelligent Agents» more IDEAL 2004»

Visualisation of Distributions and Clusters Using ViSOMs on Gene Expression Data

14 years 2 months ago

Download personalpages.manchester.ac.uk

Microarray datasets are often too large to visualise due to the high dimensionality. The self-organising map has been found useful to analyse massive complex datasets. It can be us...

Swapna Sarvesvaran, Hujun Yin

claim paper

Read More »

22

click to vote

ICDM
2002
IEEE

122views Data Mining» more ICDM 2002»

Using Category-Based Adherence to Cluster Market-Basket Data

14 years 2 months ago

Download arbor.ee.ntu.edu.tw

In this paper, we devise an efﬁcient algorithm for clustering market-basket data. Different from those of the traditional data, the features of market-basket data are known to b...

Ching-Huang Yun, Kun-Ta Chuang, Ming-Syan Chen

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers