Multi-label learning is useful in visual object recognition when several objects are present in an image. Conventional approaches implement multi-label learning as a set of binary...
A working face recognition system requires the ability to represent facial images in such a way that permits efficient and accurate processing. The human visual system effectively...
In this paper we describe integrated multimedia processing for Video Scout, a system that segments and indexes TV programs according to their audio, visual, and transcript informa...
Radu S. Jasinschi, Nevenka Dimitrova, Thomas McGee...
In this work we propose an intuitive graphic framework for the effective visualization of MPEG-7 low-level features, in the context of classification and annotation of audio-visu...
Marco Campanella, Riccardo Leonardi, Pierangelo Mi...
Sparse coding which encodes the original signal in a sparse signal space, has shown its state-of-the-art performance in the visual codebook generation and feature quantization pro...