Sciweavers

3705 search results - page 179 / 741
» Building Documentation Generators
Sort
View
164
Voted
ICDAR
1995
IEEE
15 years 7 months ago
Ground-truthing and benchmarking document page segmentation
We describe a new approach for evaluating page segmentation algorithms. Unlike techniques that rely on OCR output, our method is region-based: the segmentation output, described a...
Berrin A. Yanikoglu, Luc Vincent
121
Voted
ANLP
1994
105views more  ANLP 1994»
15 years 5 months ago
Modeling Content Identification from Document Images
A new technique to locate content-representing words for a given document image using representation of character shapes is described. A character shape code representation define...
Takehiro Nakayama
131
Voted
DKE
2007
132views more  DKE 2007»
15 years 3 months ago
Automated ontology construction for unstructured text documents
Ontology is playing an increasingly important role in knowledge management and the Semantic Web. This study presents a novel episode-based ontology construction mechanism to extra...
Chang-Shing Lee, Yuan-Fang Kao, Yau-Hwang Kuo, Mei...
163
Voted
JIIS
2002
168views more  JIIS 2002»
15 years 3 months ago
Hidden Markov Models for Text Categorization in Multi-Page Documents
In the traditional setting, text categorization is formulated as a concept learning problem where each instance is a single isolated document. However, this perspective is not appr...
Paolo Frasconi, Giovanni Soda, Alessandro Vullo
123
Voted
AND
2009
15 years 1 months ago
Edge noise in document images
A degradation model that describes many image degradations produced by desktop scanning is used to study the edge noise that is present in bilevel document images. The standard de...
Craig McGillivary, Chris Hale, Elisa H. Barney Smi...