Sciweavers

2926 search results - page 250 / 586
» Document Analysis
Sort
View
CEAS
2007
Springer
16 years 3 days ago
Hardening Fingerprinting by Context
Near-duplicate detection is not only an important pre and post processing task in Information Retrieval but also an effective spam-detection technique. Among different approache...
Aleksander Kolcz, Abdur Chowdhury
GECCO
2006
Springer
186views Optimization» more  GECCO 2006»
15 years 9 months ago
Characterizing large text corpora using a maximum variation sampling genetic algorithm
An enormous amount of information available via the Internet exists. Much of this data is in the form of text-based documents. These documents cover a variety of topics that are v...
Robert M. Patton, Thomas E. Potok
CAIP
2001
Springer
126views Image Analysis» more  CAIP 2001»
15 years 10 months ago
A Technique for Segmentation of Gurmukhi Text
This paper describes a technique for text segmentation of machine printed Gurmukhi script documents. Research in the field of segmentation of Gurmukhi script faces major problems m...
G. S. Lehal, Chandan Singh
DAS
2010
Springer
15 years 10 months ago
Analysis of whole-book recognition
Whole-book recognition is a document image analysis strategy that operates on the complete set of a book’s page images, attempting to improve accuracy by automatic unsupervised ...
Pingping Xiu, Henry S. Baird
EXTREME
2004
ACM
15 years 9 months ago
Interpretation Beyond Markup
The meaning conveyed by documents and their markup often goes well beyond what can be inferred from the markup alone. It often depends on context, so that to interpret document ma...
David Dubin, David J. Birnbaum