Sciweavers

1820 search results - page 38 / 364
» Hierarchical Clustering of Words
Sort
View
DAS
2006
Springer
15 years 2 months ago
Efficient Word Retrieval by Means of SOM Clustering and PCA
Abstract. We propose an approach for efficient word retrieval from printed documents belonging to Digital Libraries. The approach combines word image clustering (based on Self Orga...
Simone Marinai, Stefano Faini, Emanuele Marino, Gi...
93
Voted
ICDAR
2009
IEEE
15 years 5 months ago
Robust Recognition of Documents by Fusing Results of Word Clusters
The word error rate of any optical character recognition system (OCR) is usually substantially below its component or character error rate. This is especially true of Indic langua...
Venkat Rasagna, Anand Kumar 0002, C. V. Jawahar, R...
ACL
2001
15 years 16 days ago
Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters
In this paper, a new language model, the Multi-Class Composite N-gram, is proposed to avoid a data sparseness problem for spoken language in that it is difficult to collect traini...
Hirofumi Yamamoto, Shuntaro Isogai, Yoshinori Sagi...
66
Voted
ICPR
2002
IEEE
16 years 6 days ago
Word Segmentation of Printed Text Lines Based on Gap Clustering and Special Symbol Detection
This paper proposes a word segmentation method for machine-printed text lines. It utilizes gaps and special symbols as delimiters between words. A gap clustering technique is used...
Soo-Hyung Kim, Chang Bu Jeong, Hee K. Kwag, Ching ...
EACL
2006
ACL Anthology
15 years 16 days ago
Word Sense Induction: Triplet-Based Clustering and Automatic Evaluation
In this paper a novel solution to automatic and unsupervised word sense induction (WSI) is introduced. It represents an instantiation of the `one sense per collocation' obser...
Stefan Bordag