Sciweavers

55 search results - page 2 / 11
» A hybrid unsupervised approach for document clustering
Sort
View
SIGIR
2002
ACM
13 years 4 months ago
Unsupervised document classification using sequential information maximization
We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...
Noam Slonim, Nir Friedman, Naftali Tishby
CIKM
2000
Springer
13 years 9 months ago
A Semi-Supervised Document Clustering Technique for Information Organization
This paper discusses a new type of semi-supervised document clustering that uses partial supervision to partition a large set of documents. Most clustering methods organizes docum...
Han-joon Kim, Sang-goo Lee
AI
2009
Springer
13 years 11 months ago
An Iterative Hybrid Filter-Wrapper Approach to Feature Selection for Document Clustering
The manipulation of large-scale document data sets often involves the processing of a wealth of features that correspond with the available terms in the document space. The employm...
Mohammad-Amin Jashki, Majid Makki, Ebrahim Bagheri...
EMNLP
2007
13 years 6 months ago
Topic Segmentation with Hybrid Document Indexing
We present a domain-independent unsupervised topic segmentation approach based on hybrid document indexing. Lexical chains have been successfully employed to evaluate lexical cohe...
Irina Matveeva, Gina-Anne Levow
ICDAR
2009
IEEE
13 years 11 months ago
Unsupervised HMM Adaptation Using Page Style Clustering
In this paper we present an innovative two-stage adaptation approach for handwriting recognition that is based on clustering of similar pages in the training data. In our approach...
Huaigu Cao, Rohit Prasad, Shirin Saleem, Premkumar...