Sciweavers

319 search results - page 6 / 64
» Algorithms for Mining Distance-Based Outliers in Large Datas...
Sort
View
AUSAI
2007
Springer
15 years 4 months ago
DBSC: A Dependency-Based Subspace Clustering Algorithm for High Dimensional Numerical Datasets
Abstract. We present a novel algorithm called DBSC, which finds subspace clusters in numerical datasets based on the concept of ”dependency”. This algorithm employs a depth-...
Xufei Wang, Chunping Li
ECML
2005
Springer
15 years 3 months ago
A Distance-Based Approach for Action Recommendation
Abstract. Rule induction has attracted a great deal of attention in Machine Learning and Data Mining. However, generating rules is not an end in itself because their applicability ...
Ronan Trepos, Ansaf Salleb, Marie-Odile Cordier, V...
KDD
1995
ACM
216views Data Mining» more  KDD 1995»
15 years 1 months ago
Robust Decision Trees: Removing Outliers from Databases
Finding and removingoutliers is an important problem in data mining. Errors in large databases can be extremely common,so an important property of a data mining algorithm is robus...
George H. John
ICDCS
2006
IEEE
15 years 4 months ago
ParRescue: Scalable Parallel Algorithm and Implementation for Biclustering over Large Distributed Datasets
Biclustering refers to simultaneously capturing correlations present among subsets of attributes (columns) and records (rows). It is widely used in data mining applications includ...
Jianhong Zhou, Ashfaq A. Khokhar
ICDM
2006
IEEE
161views Data Mining» more  ICDM 2006»
15 years 4 months ago
Hierarchical Density Shaving: A clustering and visualization framework for large biological datasets
In many clustering applications for bioinformatics, only part of the data clusters into one or more groups while the rest needs to be pruned. For such situations, we present Hiera...
Gunjan Gupta, Alexander Liu, Joydeep Ghosh