Sciweavers

341 search results - page 38 / 69
» Improving Annotations in Digital Documents
Sort
View
JIIS
2002
168views more  JIIS 2002»
14 years 9 months ago
Hidden Markov Models for Text Categorization in Multi-Page Documents
In the traditional setting, text categorization is formulated as a concept learning problem where each instance is a single isolated document. However, this perspective is not appr...
Paolo Frasconi, Giovanni Soda, Alessandro Vullo
PREMI
2007
Springer
15 years 3 months ago
Self Adaptable Recognizer for Document Image Collections
Abstract. This paper presents an architecture that enables the recognizer to learn incrementally and, thereby adapt to document image collections for performance improvement. We ar...
Million Meshesha, C. V. Jawahar
82
Voted
WWW
2007
ACM
15 years 10 months ago
Automatic searching of tables in digital libraries
Tables are ubiquitous. Unfortunately, no search engine supports table search. In this paper, we propose a novel table specific searching engine, TableSeer, to facilitate the table...
Ying Liu, Kun Bai, Prasenjit Mitra, C. Lee Giles
JCDL
2005
ACM
100views Education» more  JCDL 2005»
15 years 3 months ago
Automatic extraction of titles from general documents using machine learning
In this paper, we propose a machine learning approach to title extraction from general documents. By general documents, we mean documents that can belong to any one of a number of...
Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, Q...
DAS
2010
Springer
15 years 2 months ago
Document analysis issues in reading optical scan ballots
Optical scan voting is considered by many to be the most trustworthy option for conducting elections because it provides an independently verifiable record of each voter’s inte...
Daniel P. Lopresti, George Nagy, Elisa H. Barney S...