Sciweavers

110 search results - page 2 / 22
» A Comparison of Two Document Clustering Approaches for Clust...
Sort
View
SIGIR
2008
ACM
13 years 4 months ago
Pagerank based clustering of hypertext document collections
Clustering hypertext document collection is an important task in Information Retrieval. Most clustering methods are based on document content and do not take into account the hype...
Konstantin Avrachenkov, Vladimir Dobrynin, Danil N...
NLDB
2007
Springer
13 years 11 months ago
Selecting Labels for News Document Clusters
This work deals with determination of meaningful and terse cluster labels for News document clusters. We analyze a number of alternatives for selecting headlines and/or sentences o...
Krishnaprasad Thirunarayan, Trivikram Immaneni, Ma...
KDD
2010
ACM
326views Data Mining» more  KDD 2010»
13 years 2 months ago
Document clustering via dirichlet process mixture model with feature selection
One essential issue of document clustering is to estimate the appropriate number of clusters for a document collection to which documents should be partitioned. In this paper, we ...
Guan Yu, Ruizhang Huang, Zhaojun Wang
KDD
2002
ACM
166views Data Mining» more  KDD 2002»
14 years 5 months ago
Frequent term-based text clustering
Text clustering methods can be used to structure large sets of text or hypertext documents. The well-known methods of text clustering, however, do not really address the special p...
Florian Beil, Martin Ester, Xiaowei Xu
GRC
2005
IEEE
13 years 10 months ago
Semantic based clustering of Web documents
Abstract. A new methodology that structures the semantics of a collection of documents into the geometry of a simplicial complex is developed. A simplicial complex is topologically...
Tsau Young Lin, I-Jen Chiang