Sciweavers

180 search results - page 3 / 36
» Iterated Document Content Classification
Sort
View
DL
1994
Springer
191views Digital Library» more  DL 1994»
13 years 9 months ago
Corpus Linguistics for Establishing The Natural Language Content of Digital Library Documents
Digital Libraries will hold huge amounts of text and other forms of information. For the collections to be maximally useful, they must be highly organized with useful indexes and ...
Robert P. Futrelle, Xiaolan Zhang 0002, Yumiko Sek...
SKG
2006
IEEE
13 years 11 months ago
A Computing Model for Concept Fusing and Document Classification
Effective document classification is a long-pursued goal in knowledge management. This paper proposes a novel hybrid approach of semantic representation and statistical measuremen...
Nan Zhang, Chao He
ICDAR
2003
IEEE
13 years 11 months ago
Document page similarity based on layout visual saliency: Application to query by example and document classification
In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...
Véronique Eglin, Stéphane Bres
ICDAR
2009
IEEE
13 years 3 months ago
Graph b-Coloring for Automatic Recognition of Documents
In order to reduce the rejection rate of our automatic reading system, we propose to pre-classify the business documents by introducing an Automatic Recognition of Documents stage...
Djamel Gaceb, Véronique Eglin, Frank Lebour...
DRR
2008
13 years 7 months ago
Segmentation-based retrieval of document images from diverse collections
We describe a methodology for retrieving document images from large extremely diverse collections. First we perform content extraction, that is the location and measurement of reg...
Michael A. Moll, Henry S. Baird