Sciweavers

832 search results - page 24 / 167
» Document clustering with committees
Sort
View
WEBI
2005
Springer
15 years 5 months ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini
ACMSE
2007
ACM
15 years 3 months ago
Enhancing clustering blog documents by utilizing author/reader comments
Blogs are a new form of internet phenomenon and a vast everincreasing information resource. Mining blog files for information is a very new research direction in data mining. We p...
Beibei Li, Shuting Xu, Jun Zhang
WWW
2004
ACM
16 years 12 days ago
A hierarchical monothetic document clustering algorithm for summarization and browsing search results
Organizing Web search results into a hierarchy of topics and subtopics facilitates browsing the collection and locating results of interest. In this paper, we propose a new hierar...
Krishna Kummamuru, Rohit Lotlikar, Shourya Roy, Ka...
ACL
2009
14 years 9 months ago
Creating a Gold Standard for Sentence Clustering in Multi-Document Summarization
Sentence Clustering is often used as a first step in Multi-Document Summarization (MDS) to find redundant information. All the same there is no gold standard available. This paper...
Johanna Geiss
SIGIR
2002
ACM
14 years 11 months ago
Unsupervised document classification using sequential information maximization
We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...
Noam Slonim, Nir Friedman, Naftali Tishby