Document Layout Substructure Discovery

13 years 8 months ago
Document Layout Substructure Discovery
Abstract. In this paper we present a system, DoLSuD, for the automatic discovery of relevant substructures in a document layout. DoLSuD, Document Layout Substructure Discovery, extracts, analyzes and describes the visual content of structured digital documents, such as catalogs, in order to discover repeating and distinctive substructures in the document layout and to establish relations between textual and image content. Establishing meaningful links from the catalog structure between images and text paragraphs allows us to exploit the semantic annotation of the textual part to annotate the images and integrate multimedia processing and Semantic Web technologies. The paper presents the system along with experimental results and the web based service which utilizes the analysis results.
Claudio Andreatta
Added 09 Jun 2010
Updated 09 Jun 2010
Type Conference
Year 2007
Where SAMT
Authors Claudio Andreatta
Comments (0)