Many distributed real-time systems face the challenge of dynamically maximizing system utility and meeting stringent resource constraints in response to fluctuations in system wo...
In this paper, we describe a fully automatic video retrieval prototype system that uses an image or a video sequence of an interested identity as probe. The system is based on fac...
Automatic speaker localization is an important task in several applications such as acoustic scene analysis, hands-free videoconferencing orspeechenhancement.Trackingspeakersinmult...
Carlos Segura, Alberto Abad, Javier Hernando, Clim...
This paper presents the algorithm and evaluation results of a face detection and tracking system. A tree-structured multi-view face detector trained by Vector Boosting is used as t...
Abstract. In this paper, we present the ICSI speaker diarization system. This system was used in the 2007 National Institute of Standards and Technology (NIST) Rich Transcription e...