Sciweavers

31 search results - page 1 / 7
» Robust Recognition of Documents by Fusing Results of Word Cl...
Sort
View
ICDAR
2009
IEEE
13 years 11 months ago
Robust Recognition of Documents by Fusing Results of Word Clusters
The word error rate of any optical character recognition system (OCR) is usually substantially below its component or character error rate. This is especially true of Indic langua...
Venkat Rasagna, Anand Kumar 0002, C. V. Jawahar, R...
ICDAR
2009
IEEE
13 years 11 months ago
Word-Based Adaptive OCR for Historical Books
The aim of this work is to propose a new approach to the recognition of historical texts by providing an adaptive mechanism that automatically tunes itself to a specific book. Th...
Vladimir Kluzner, Asaf Tzadok, Yuval Shimony, Euge...
KDD
2010
ACM
326views Data Mining» more  KDD 2010»
13 years 2 months ago
Document clustering via dirichlet process mixture model with feature selection
One essential issue of document clustering is to estimate the appropriate number of clusters for a document collection to which documents should be partitioned. In this paper, we ...
Guan Yu, Ruizhang Huang, Zhaojun Wang
ICDAR
1995
IEEE
13 years 8 months ago
Visual inter-word relations and their use in OCR postprocessing
A technique is presented that uses visual relationships between word images in a document to improve the recognition of the text it contains. This technique takes advantage of the...
Tao Hong, Jonathan J. Hull
ICASSP
2011
IEEE
12 years 8 months ago
Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition
Named Entity (NE) recognition from the results of Automatic Speech Recognition (ASR) is challenging because of ASR errors. To detect NEs, one of the options is to use a statistica...
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, ...