Sciweavers

566 search results - page 1 / 114
» OCR with No Shape Training
Sort
View
ICPR
2000
IEEE
14 years 5 months ago
OCR with No Shape Training
We present a document-specific OCR system and apply it to a corpus of faxed business letters. Unsupervised classification of the segmented character bitmaps on each page, using a ...
Tin Kam Ho, George Nagy
DIAL
2004
IEEE
170views Image Analysis» more  DIAL 2004»
13 years 8 months ago
Document Style Census for OCR
Four methods of converting paper documents to computer-readable form are compared with regard to hypothetical labor cost: keyboarding, omnifont OCR, stylespecific OCR, and style-c...
George Nagy, Prateek Sarkar
LREC
2010
141views Education» more  LREC 2010»
13 years 6 months ago
A Game-based Approach to Transcribing Images of Text
We present a methodology that takes as input scanned documents of typed or hand-written text, and produces transcriptions of the text as output. Instead of using OCR technology, t...
Khalil Dahab, Anja Belz
ACL
1998
13 years 5 months ago
Japanese OCR Error Correction using Character Shape Similarity and Statistical Language Model
We present a novel OCR error correction method for languages without word delimiters that have a large character set, such as Japanese and Chinese. It consists of a statistical OC...
Masaaki Nagata
ICDAR
2005
IEEE
13 years 10 months ago
Text Degradations and OCR Training
Printing and scanning of text documents introduces degradations to the characters which can be modeled. Interestingly, certain combinations of the parameters that govern the degra...
Elisa H. Barney Smith, Tim L. Andersen