Sciweavers

6388 search results - page 187 / 1278
» High Performance Data Mining
Sort
View
ICDM
2009
IEEE
155views Data Mining» more  ICDM 2009»
15 years 11 months ago
A Contrast Pattern Based Clustering Quality Index for Categorical Data
Since clustering is unsupervised and highly explorative, clustering validation (i.e. assessing the quality of clustering solutions) has been an important and long standing researc...
Qingbao Liu, Guozhu Dong
KDD
2005
ACM
125views Data Mining» more  KDD 2005»
16 years 4 months ago
Email data cleaning
Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...
Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang
ICPR
2010
IEEE
15 years 11 months ago
A Bound on the Performance of LDA in Randomly Projected Data Spaces
We consider the problem of classification in nonadaptive dimensionality reduction. Specifically, we bound the increase in classification error of Fisher’s Linear Discriminant...
Robert John Durrant, Ata Kaban
APBC
2006
181views Bioinformatics» more  APBC 2006»
15 years 5 months ago
Analyzing Inconsistency Toward Enhancing Integration of Biological Molecular Databases
: The rapid growth of biological databases not only provides biologists with abundant data but also presents a big challenge in relation to the analysis of data. Many data analysis...
Yi-Ping Phoebe Chen, Qingfeng Chen
ICDE
2012
IEEE
246views Database» more  ICDE 2012»
13 years 7 months ago
HiCS: High Contrast Subspaces for Density-Based Outlier Ranking
—Outlier mining is a major task in data analysis. Outliers are objects that highly deviate from regular objects in their local neighborhood. Density-based outlier ranking methods...
Fabian Keller, Emmanuel Müller, Klemens B&oum...