Sciweavers

2827 search results - page 209 / 566
» Marking Text Documents
Sort
View
ISIWI
2000
15 years 7 months ago
Automatic Document Classification - A thorough Evaluation of various Methods
(Automatic) document classification is generally defined as content-based assignment of one or more predefined categories to documents. Usually, machine learning, statistical patt...
Christoph Goller, J. Löning, T. Will, W. Wolf...
MVA
1992
115views Computer Vision» more  MVA 1992»
15 years 7 months ago
An OCR System for Printed Documents
This paper describes the general structure of a full automated document analysis system for printed documents. The system is based on a character preclassification stage which red...
Frank Lebourgeois, Jean-Luc Henry, Hubert Emptoz
IS
2006
15 years 6 months ago
Negations and document length in logical retrieval
Abstract. Terms which are not explicitly mentioned in the text of a document receive often a minor role in current retrieval systems. In this work we connect the management of such...
David E. Losada, Alvaro Barreiro
153
Voted
WWW
2006
ACM
16 years 6 months ago
Using symbolic objects to cluster web documents
Web Clustering is useful for several activities in the WWW, from automatically building web directories to improve retrieval performance. Nevertheless, due to the huge size of the...
Esteban Meneses, Oldemar Rodríguez-Rojas
ICDAR
2009
IEEE
16 years 24 days ago
Learning Rich Hidden Markov Models in Document Analysis: Table Location
Hidden Markov Models (HMM) are probabilistic graphical models for interdependent classification. In this paper we experiment with different ways of combining the components of an ...
Ana Costa e Silva