Sciweavers

42 search results - page 6 / 9
» Unsupervised style classification of document page images
Sort
View
ITCC
2005
IEEE
15 years 3 months ago
Elimination of Redundant Information for Web Data Mining
These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...
Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang
PREMI
2007
Springer
15 years 3 months ago
Self Adaptable Recognizer for Document Image Collections
Abstract. This paper presents an architecture that enables the recognizer to learn incrementally and, thereby adapt to document image collections for performance improvement. We ar...
Million Meshesha, C. V. Jawahar
ICMCS
2006
IEEE
189views Multimedia» more  ICMCS 2006»
15 years 3 months ago
Multiscale Edge-Based Text Extraction from Complex Images
Text that appears in images contains important and useful information. Detection and extraction of text in images have been used in many applications. In this paper, we propose a ...
Xiaoqing Liu, Jagath Samarabandu
ICML
2009
IEEE
15 years 10 months ago
Learning non-redundant codebooks for classifying complex objects
Codebook-based representations are widely employed in the classification of complex objects such as images and documents. Most previous codebook-based methods construct a single c...
Wei Zhang, Akshat Surve, Xiaoli Fern, Thomas G. Di...
ICDAR
2005
IEEE
15 years 3 months ago
Text/Graphic labelling of Ancient Printed Documents
This paper presents a text/graphic labelling for ancient printed documents. Our approach is based on the extraction and the quantification of the various orientations that are pre...
Nicholas Journet, Véronique Eglin, Jean-Yve...