Search Sciweavers | Sciweavers

25 search results - page 2 / 5

» ScalParC: A New Scalable and Efficient Parallel Classificati...

click to vote

DPD
2002

125views more DPD 2002»

Parallel Mining of Outliers in Large Database

13 years 5 months ago

Download www4.comp.polyu.edu.hk

Data mining is a new, important and fast growing database application. Outlier (exception) detection is one kind of data mining, which can be applied in a variety of areas like mon...

Edward Hung, David Wai-Lok Cheung

claim paper

Read More »

click to vote

IPPS
2006
IEEE

121views Distributed And Parallel Com...» more IPPS 2006»

Design and analysis of a multi-dimensional data sampling service for large scale data analysis applications

13 years 11 months ago

Download www.cecs.uci.edu

Sampling is a widely used technique to increase efﬁciency in database and data mining applications operating on large dataset. In this paper we present a scalable sampling imple...

Xi Zhang, Tahsin M. Kurç, Joel H. Saltz, Sr...

claim paper

Read More »

click to vote

KDD
1998
ACM

99views Data Mining» more KDD 1998»

On the Efficient Gathering of Sufficient Statistics for Classification from Large SQL Databases

13 years 9 months ago

Download research.microsoft.com

For a wide variety of classification algorithms, scalability to large databases can be achieved by observing that most algorithms are driven by a set of sufficient statistics that...

Goetz Graefe, Usama M. Fayyad, Surajit Chaudhuri

claim paper

Read More »

click to vote

IFIP12
2008

183views Information Technology» more IFIP12 2008»

P-Prism: A Computationally Efficient Approach to Scaling up Classification Rule Induction

13 years 6 months ago

Download www.maxbramer.org.uk

Top Down Induction of Decision Trees (TDIDT) is the most commonly used method of constructing a model from a dataset in the form of classification rules to classify previously unse...

Frederic T. Stahl, Max A. Bramer, Mo Adda

claim paper

Read More »

click to vote

DMKD
1997
ACM

308views Data Mining» more DMKD 1997»

A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining

13 years 9 months ago

Download www.cs.gsu.edu

Partitioning a large set of objects into homogeneous clusters is a fundamental operation in data mining. The k-means algorithm is best suited for implementing this operation becau...

Zhexue Huang

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers