Sciweavers

7495 search results - page 104 / 1499
» Intelligent Document Processing
Sort
View
ICDAR
2007
IEEE
15 years 1 months ago
Identification of Latin-Based Languages through Character Stroke Categorization
This paper presents a language identification technique that detects Latin-based languages of imaged documents without OCR. The proposed technique detects languages through the wo...
S. J. Lu, L. Li, Chew Lim Tan
ICDAR
2009
IEEE
14 years 7 months ago
Learning on the Fly: Font-Free Approaches to Difficult OCR Problems
Despite ubiquitous claims that optical character recognition (OCR) is a "solved problem," many categories of documents continue to break modern OCR software such as docu...
Andrew Kae, Erik G. Learned-Miller
IJCNLP
2005
Springer
15 years 3 months ago
Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web
This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method differs from previous approaches to paraphrase...
Marius Pasca, Péter Dienes
ICDAR
2011
IEEE
13 years 9 months ago
Digit/Symbol Pruning and Verification for Arabic Handwritten Digit/Symbol Spotting
—In order to spot the digits in a handwritten document, each component is sent to a classifier. This is a time consuming process because a document usually contains several hundr...
Nicola Nobile, Chun Lei He, Malik Waqas Sagheer, L...
KI
2002
Springer
14 years 9 months ago
The Fraunhofer IESE Experience Management System
: Experience Management (EM) is an area that is increasingly gaining importance. Its roots lie in Experimental Software Engineering ("Experience Factory"), in Artificial ...
Andreas Jedlitschka, Klaus-Dieter Althoff, Bjö...