Search Sciweavers | Sciweavers

319 search results - page 8 / 64

» Algorithms for Mining Distance-Based Outliers in Large Datas...

246

click to vote

IJIT
2004

226views Artificial Intelligence» more IJIT 2004»

IMDC: An Image-Mapped Data Clustering Technique for Large Datasets

15 years 8 months ago

Download www.waset.org

In this paper, we present a new algorithm for clustering data in large datasets using image processing approaches. First the dataset is mapped into a binary image plane. The synthe...

Faruq A. Al-Omari, Nabeel I. Al-Fayoumi

claim paper

Read More »

211

click to vote

ICDE
2010
IEEE

750views Database» more ICDE 2010»

Efficient and accurate discovery of patterns in sequence datasets

15 years 11 months ago

Download making.csie.ndhu.edu.tw

Existing sequence mining algorithms mostly focus on mining for subsequences. However, a large class of applications, such as biological DNA and protein motif mining, require effici...

Avrilia Floratou, Sandeep Tata, Jignesh M. Patel

claim paper

Read More »

199

click to vote

AUSAI
2003
Springer

141views Artificial Intelligence» more AUSAI 2003»

Efficiently Mining Frequent Patterns from Dense Datasets Using a Cluster of Computers

16 years 21 days ago

Download www.computing.edu.au

Efficient mining of frequent patterns from large databases has been an active area of research since it is the most expensive step in association rules mining. In this paper, we pr...

Yudho Giri Sucahyo, Raj P. Gopalan, Amit Rudra

claim paper

Read More »

186

click to vote

KDD
2007
ACM

141views Data Mining» more KDD 2007»

Detecting anomalous records in categorical datasets

16 years 7 months ago

Download www.cs.cmu.edu

We consider the problem of detecting anomalies in high arity categorical datasets. In most applications, anomalies are defined as data points that are 'abnormal'. Quite ...

Kaustav Das, Jeff G. Schneider

claim paper

Read More »

313

click to vote

SIGMOD
2008
ACM

157views Database» more SIGMOD 2008»

CRD: fast co-clustering on large datasets utilizing sampling-based matrix decomposition

16 years 7 months ago

Download compgen.unc.edu

The problem of simultaneously clustering columns and rows (coclustering) arises in important applications, such as text data mining, microarray analysis, and recommendation system...

Feng Pan, Xiang Zhang, Wei Wang 0010

claim paper

Read More »

« Prev « First page 8 / 64 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers