This paper shows an approach for converting bitmap images of text glyphs into a vector format which is suitable for being embedded in XML representations of digitized documents. T...
Stefan Pletschacher, Marcel Eckert, Arved C. H&uum...
With an aim to extract the structural information from the table of contents (TOC) to help develop digital document library the requirement of identifying/segmenting the TOC page ...
S. Mandal, S. P. Chowdhury, Amit Kumar Das, Bhabat...
The Informedia Digital Video Library contains over a thousand hours of video, consuming over a of disk space. This paper summarizes the multimedia abstractions used to represent th...
In order to evaluate the performance of information retrieval and extraction algorithms, we need test collections. A test collection consists of a set of documents, a clearly form...
The initiative of standardization of MPEG Query Format (MPQF) has refueled the research around the definition of a unified query language for digital content. The goal is to provi...