Sciweavers

1012 search results - page 22 / 203
» Testing documentation with
Sort
View
57
Voted
CORR
2006
Springer
71views Education» more  CORR 2006»
14 years 9 months ago
Using NLP to build the hypertextuel network of a back-of-the-book index
Relying on the idea that back-of-the-book indexes are traditional devices for navigation through large documents, we have developed a method to build a hypertextual network that h...
Touria Aït El Mekki, Adeline Nazarenko
DRR
2011
13 years 9 months ago
Improved document image segmentation algorithm using multiresolution morphology
Page segmentation into text and non-text components is an essential preprocessing step before OCR operation. If this is not done properly, an OCR classification engine produces g...
Syed Saqib Bukhari, Faisal Shafait, Thomas M. Breu...
LREC
2008
141views Education» more  LREC 2008»
14 years 11 months ago
New Resources for Document Classification, Analysis and Translation Technologies
The goal of the DARPA MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Program is to automatically convert foreign language text images into Englis...
Stephanie Strassel, Lauren Friedman, Safa Ismael, ...
ERCIMDL
2007
Springer
91views Education» more  ERCIMDL 2007»
15 years 3 months ago
Using XML Logical Structure to Retrieve (Multimedia) Objects
This paper investigates the use of the logical structure in XML documents for the retrieval of XML multimedia objects. We study different logical levels and their combinations. Our...
Zhigang Kong, Mounia Lalmas
ICCTA
2007
IEEE
15 years 4 months ago
A Study of Different Kinds of Degradation in Printed Gurmukhi Script
The performance of any OCR system heavily depends upon printing quality of the input document. Many OCRs have been designed which correctly identify fine printed documents both in...
Manish Kumar Jindal, Rajendra Kumar Sharma, Gurpre...