Near-duplicate keyframes (NDK) play a unique role in large-scale video search, news topic detection and tracking. In this paper, we propose a novel NDK retrieval approach by explo...
We describe a scheme to combine the results of audio and face identification for multimedia indexing and retrieval. Audio analysis consists of speech and speaker recognition deri...
Mahesh Viswanathan, Homayoon S. M. Beigi, Alain Tr...
As part of the general growth and diversification of media in different modalities, the presence of information in the form of human speech in the world-wide body of digital conte...
Combining different and complementary object models promises to increase the robustness and generality of today’s computer vision algorithms. This paper introduces a new method ...
Combining retrieval results from multiple modalities plays a crucial role for video retrieval systems, especially for automatic video retrieval systems without any user feedback a...