Sciweavers

23 search results - page 2 / 5
» A word shape coding method for camera-based document images
Sort
View
AAAI
2006
13 years 6 months ago
Script and Language Identification in Degraded and Distorted Document Images
This paper reports a statistical identification technique that differentiates scripts and languages in degraded and distorted document images. We identify scripts and languages th...
Shijian Lu, Chew Lim Tan
ICDAR
2007
IEEE
13 years 9 months ago
Identification of Latin-Based Languages through Character Stroke Categorization
This paper presents a language identification technique that detects Latin-based languages of imaged documents without OCR. The proposed technique detects languages through the wo...
S. J. Lu, L. Li, Chew Lim Tan
ANLP
1994
105views more  ANLP 1994»
13 years 6 months ago
Modeling Content Identification from Document Images
A new technique to locate content-representing words for a given document image using representation of character shapes is described. A character shape code representation define...
Takehiro Nakayama
ANLP
1994
104views more  ANLP 1994»
13 years 6 months ago
Language Determination: Natural Language Processing from Scanned Document Images
Many documents are available to a computer only as images from paper. However, most natural language processing systems expect their input as character-coded text, which may be di...
Penelope Sibun, A. Lawrence Spitz
DRR
2009
13 years 3 months ago
Retrieval of historical documents by word spotting
The implementation of word spotting is not an easy procedure and it gets even worse in the case of historical documents since it requires character recognition and indexing of the...
Nikoleta Doulgeri, Ergina Kavallieratou