Sciweavers

1836 search results - page 138 / 368
» Mining Clustering Dimensions
Sort
View
ICDM
2005
IEEE
136views Data Mining» more  ICDM 2005»
15 years 7 months ago
Pairwise Symmetry Decomposition Method for Generalized Covariance Analysis
We propose a new theoretical framework for generalizing the traditional notion of covariance. First, we discuss the role of pairwise cross-cumulants by introducing a cluster expan...
Tsuyoshi Idé
ICDM
2005
IEEE
188views Data Mining» more  ICDM 2005»
15 years 7 months ago
CLUMP: A Scalable and Robust Framework for Structure Discovery
We introduce a robust and efficient framework called CLUMP (CLustering Using Multiple Prototypes) for unsupervised discovery of structure in data. CLUMP relies on finding multip...
Kunal Punera, Joydeep Ghosh
SDM
2007
SIAM
98views Data Mining» more  SDM 2007»
15 years 3 months ago
An incremental data-stream sketch using sparse random projections
We propose the use of random projections with a sparse matrix to maintain a sketch of a collection of high-dimensional data-streams that are updated asynchronously. This sketch al...
Aditya Krishna Menon, Gia Vinh Anh Pham, Sanjay Ch...
128
Voted
SAC
2009
ACM
15 years 8 months ago
Combining statistics and semantics via ensemble model for document clustering
Incorporating background knowledge into data mining algorithms is an important but challenging problem. Current approaches in semi-supervised learning require explicit knowledge p...
Samah Jamal Fodeh, William F. Punch, Pang-Ning Tan
PAKDD
2009
ACM
127views Data Mining» more  PAKDD 2009»
15 years 8 months ago
Clustering Documents Using a Wikipedia-Based Concept Representation
Abstract. This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document clustering. We first create a concept-based document representation b...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...