Sciweavers

77 search results - page 2 / 16
» On the Anonymization of Sparse High-Dimensional Data
Sort
View
PAKDD
2009
ACM
186views Data Mining» more  PAKDD 2009»
14 years 1 days ago
Pairwise Constrained Clustering for Sparse and High Dimensional Feature Spaces
Abstract. Clustering high dimensional data with sparse features is challenging because pairwise distances between data items are not informative in high dimensional space. To addre...
Su Yan, Hai Wang, Dongwon Lee, C. Lee Giles
DEXA
2009
Springer
151views Database» more  DEXA 2009»
13 years 12 months ago
Detecting Projected Outliers in High-Dimensional Data Streams
Abstract. In this paper, we study the problem of projected outlier detection in high dimensional data streams and propose a new technique, called Stream Projected Ouliter deTector ...
Ji Zhang, Qigang Gao, Hai H. Wang, Qing Liu, Kai X...
ICANN
2009
Springer
13 years 10 months ago
Empirical Study of the Universum SVM Learning for High-Dimensional Data
Abstract. Many applications of machine learning involve sparse highdimensional data, where the number of input features is (much) larger than the number of data samples, d n. Predi...
Vladimir Cherkassky, Wuyang Dai
SIGMOD
2001
ACM
142views Database» more  SIGMOD 2001»
14 years 5 months ago
Outlier Detection for High Dimensional Data
The outlier detection problem has important applications in the eld of fraud detection, network robustness analysis, and intrusion detection. Most such applications are high dimen...
Charu C. Aggarwal, Philip S. Yu
ICDM
2002
IEEE
191views Data Mining» more  ICDM 2002»
13 years 10 months ago
Iterative Clustering of High Dimensional Text Data Augmented by Local Search
The k-means algorithm with cosine similarity, also known as the spherical k-means algorithm, is a popular method for clustering document collections. However, spherical k-means ca...
Inderjit S. Dhillon, Yuqiang Guan, J. Kogan