Sciweavers

50 search results - page 4 / 10
» The Multistage Approach to Information Extraction in Degrade...
Sort
View
ICDAR
2009
IEEE
14 years 5 days ago
Recognition of Degraded Handwritten Characters Using Local Features
The main problems of Optical Character Recognition (OCR) systems are solved if printed latin text is considered. Since OCR systems are based upon binary images, their results are ...
Markus Diem, Robert Sablatnig
ICPR
2008
IEEE
14 years 6 months ago
Improved document image binarization by using a combination of multiple binarization techniques and adapted edge information
This paper presents a new adaptive approach for document image binarization. The proposed method is mainly based on the combination of several stateof-the-art binarization methodo...
Basilios Gatos, Ioannis Pratikakis, Stavros J. Per...
DOCENG
2009
ACM
13 years 12 months ago
Web document text and images extraction using DOM analysis and natural language processing
: © Web Document Text and Images Extraction using DOM Analysis and Natural Language Processing Parag Mulendra Joshi, Sam Liu HP Laboratories HPL-2009-187 Web page text extraction,...
Parag Mulendra Joshi, Sam Liu
ICDAR
2009
IEEE
14 years 5 days ago
A Modified Adaptive Logical Level Binarization Technique for Historical Document Images
In this paper, a new document image binarization technique is presented, as an improved version of the state-of-the-art adaptive logical level technique (ALLT). The original ALLT ...
Konstantinos Ntirogiannis, Basilios Gatos, Ioannis...
CIKM
2005
Springer
13 years 11 months ago
Structure-based query-specific document summarization
Summarization of text documents is increasingly important with the amount of data available on the Internet. The large majority of current approaches view documents as linear sequ...
Ramakrishna Varadarajan, Vagelis Hristidis