Search Sciweavers | Sciweavers

1314 search results - page 33 / 263

» Approximate data mining in very large relational data

142

click to vote

KDD
2004
ACM

195views Data Mining» more KDD 2004»

Improved robustness of signature-based near-replica detection via lexicon randomization

16 years 5 months ago

Download ir.iit.edu

Detection of near duplicate documents is an important problem in many data mining and information filtering applications. When faced with massive quantities of data, traditional d...

Aleksander Kolcz, Abdur Chowdhury, Joshua Alspecto...

claim paper

Read More »

146

click to vote

INFOCOM
2010
IEEE

141views Communications» more INFOCOM 2010»

Tracking Quantiles of Network Data Streams with Dynamic Operations

15 years 3 months ago

Download www.bell-labs.com

— Quantiles are very useful in characterizing the data distribution of an evolving dataset in the process of data mining or network monitoring. The method of Stochastic Approxima...

Jin Cao, Li (Erran) Li, Aiyou Chen, Tian Bu

claim paper

Read More »

185

click to vote

CCGRID
2010
IEEE

235views Distributed And Parallel Com...» more CCGRID 2010»

High Performance Dimension Reduction and Visualization for Large High-Dimensional Data Analysis

15 years 6 months ago

Download grids.ucs.indiana.edu

Abstract--Large high dimension datasets are of growing importance in many fields and it is important to be able to visualize them for understanding the results of data mining appro...

Jong Youl Choi, Seung-Hee Bae, Xiaohong Qiu, Geoff...

claim paper

Read More »

270

click to vote

ICDE
2006
IEEE

222views Database» more ICDE 2006»

CLAN: An Algorithm for Mining Closed Cliques from Large Dense Graph Databases

16 years 6 months ago

Download making.csie.ndhu.edu.tw

Most previously proposed frequent graph mining algorithms are intended to find the complete set of all frequent, closed subgraphs. However, in many cases only a subset of the freq...

Jianyong Wang, Zhiping Zeng, Lizhu Zhou

claim paper

Read More »

260

click to vote

ICDE
2002
IEEE

146views Database» more ICDE 2002»

Data Mining Meets Performance Evaluation: Fast Algorithms for Modeling Bursty Traffic

16 years 6 months ago

Download www.pdl.cmu.edu

Network, web, and disk I/O traffic are usually bursty, self-similar [9, 3, 5, 6] and therefore can not be modeled adequately with Poisson arrivals[9]. However, we do want to model...

Mengzhi Wang, Ngai Hang Chan, Spiros Papadimitriou...

claim paper

Read More »

« Prev « First page 33 / 263 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers