Search Sciweavers | Sciweavers

25 search results - page 2 / 5

» ScalParC: A New Scalable and Efficient Parallel Classificati...

click to vote

DPD
2002

125views more DPD 2002»

Parallel Mining of Outliers in Large Database

13 years 5 months ago

Download www4.comp.polyu.edu.hk

Data mining is a new, important and fast growing database application. Outlier (exception) detection is one kind of data mining, which can be applied in a variety of areas like mon...

Edward Hung, David Wai-Lok Cheung

claim paper

Read More »

click to vote

IPPS
2006
IEEE

121views Distributed And Parallel Com...» more IPPS 2006»

Design and analysis of a multi-dimensional data sampling service for large scale data analysis applications

13 years 11 months ago

Download www.cecs.uci.edu

Sampling is a widely used technique to increase efﬁciency in database and data mining applications operating on large dataset. In this paper we present a scalable sampling imple...

Xi Zhang, Tahsin M. Kurç, Joel H. Saltz, Sr...

claim paper

Read More »

click to vote

KDD
1998
ACM

99views Data Mining» more KDD 1998»

On the Efficient Gathering of Sufficient Statistics for Classification from Large SQL Databases

13 years 10 months ago

Download research.microsoft.com

For a wide variety of classification algorithms, scalability to large databases can be achieved by observing that most algorithms are driven by a set of sufficient statistics that...

Goetz Graefe, Usama M. Fayyad, Surajit Chaudhuri

claim paper

Read More »

click to vote

IFIP12
2008

183views Information Technology» more IFIP12 2008»

P-Prism: A Computationally Efficient Approach to Scaling up Classification Rule Induction

13 years 7 months ago

Download www.maxbramer.org.uk

Top Down Induction of Decision Trees (TDIDT) is the most commonly used method of constructing a model from a dataset in the form of classification rules to classify previously unse...

Frederic T. Stahl, Max A. Bramer, Mo Adda

claim paper

Read More »

click to vote

DMKD
1997
ACM

308views Data Mining» more DMKD 1997»

A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining

13 years 10 months ago

Download www.cs.gsu.edu

Partitioning a large set of objects into homogeneous clusters is a fundamental operation in data mining. The k-means algorithm is best suited for implementing this operation becau...

Zhexue Huang

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers