Sciweavers

17 search results - page 1 / 4
» Optical character recognition errors and their effects on na...
Sort
View
SIGIR
2008
ACM
13 years 4 months ago
Optical character recognition errors and their effects on natural language processing
Errors are unavoidable in advanced computer vision applications such as optical character recognition, and the noise induced by these errors presents a serious challenge to downstr...
Daniel P. Lopresti
EMNLP
2010
13 years 2 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
ICDAR
2009
IEEE
13 years 11 months ago
Error-Correcting Output Coding for the Convolutional Neural Network for Optical Character Recognition
It is known that convolutional neural networks (CNNs) are efficient for optical character recognition (OCR) and many other visual classification tasks. This paper applies error-co...
Huiqun Deng, George Stathopoulos, Ching Y. Suen
JMLR
2012
11 years 7 months ago
Bounding the Probability of Error for High Precision Optical Character Recognition
We consider a model for which it is important, early in processing, to estimate some variables with high precision, but perhaps at relatively low recall. If some variables can be ...
Gary B. Huang, Andrew Kae, Carl Doersch, Erik G. L...
NLPRS
2001
Springer
13 years 9 months ago
Automatic Segmentation of Words using Syllable Bigram Statistics
We present a syllable bigram model for segmenting a Korean sentence into words and correcting word-spacing errors in the spelling checker. We evaluated the system’s performance ...
Seung-Shik Kang, Chong-Woo Woo