Sciweavers

445 search results - page 3 / 89
» Distributed hierarchical document clustering
Sort
View
SDM
2003
SIAM
134views Data Mining» more  SDM 2003»
13 years 7 months ago
Hierarchical Document Clustering using Frequent Itemsets
A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, ...
Benjamin C. M. Fung, Ke Wang, Martin Ester
CIS
2005
Springer
13 years 11 months ago
Concept Chain Based Text Clustering
Different from familiar clustering objects, text documents have sparse data spaces. A common way of representing a document is as a bag of its component words, but the semantic re...
Shaoxu Song, Jian Zhang, Chunping Li
ICDT
2007
ACM
143views Database» more  ICDT 2007»
13 years 9 months ago
Hierarchical Summarizing and Evaluating for Web Pages
In this investigation we propose a novel summarization method of Web pages using hierarchical expression. We discuss close relationship between summarization and hierarchical clust...
Kou Takahashi, Takao Miura, Isamu Shioya
ICDM
2003
IEEE
119views Data Mining» more  ICDM 2003»
13 years 11 months ago
A Dynamic Adaptive Self-Organising Hybrid Model for Text Clustering
Clustering by document concepts is a powerful way of retrieving information from a large number of documents. This task in general does not make any assumption on the data distrib...
Chihli Hung, Stefan Wermter
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
14 years 6 months ago
Enhanced word clustering for hierarchical text classification
In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...
Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...