In this paper we present a Multi-font OCR system to be employed for document processing, which performs, at the same time, both the character recognition and the font-style detect...
Serena La Manna, Anna Maria Colla, Alessandro Sper...
Due to many unique characteristics of forum data, forum post retrieval is different from traditional document retrieval and web search, raising interesting research questions abou...
The alignment of text line images with text transcript is a crucial step of handwritten document annotation. Handwritten text alignment is prone to errors due to the difficulty of...
In this paper, we present a fast and scalable Bayesian model for improving weakly annotated data – which is typically generated by a (semi) automated information extraction (IE) ...
In this work we propose an intuitive graphic framework for the effective visualization of MPEG-7 low-level features, in the context of classification and annotation of audio-visu...
Marco Campanella, Riccardo Leonardi, Pierangelo Mi...