This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
The LIMSI RT-07S speaker diarization system for the conference and lecture meetings is presented in this paper. This system builds upon the RT06S diarization system designed for le...
Xuan Zhu, Claude Barras, Lori Lamel, Jean-Luc Gauv...
Speaker diarization of meetings recorded with Multiple Distant Microphones makes extensive use of multiple feature streams like MFCC and Time Delay of Arrivals (TDOA). Typically t...
Deepu Vijayasenan, Fabio Valente, Petr Motlí...
The following article shows how a state-of-the-art speaker diarization system can be improved by combining traditional short-term features (MFCCs) with prosodic and other longterm...
Gerald Friedland, Oriol Vinyals, C. Yan Huang, Chr...
Abstract. This paper presents the LIA submission to the speaker diarization task of the 2007 NIST Rich Transcription (RT’07) evaluation campaign. We report a system optimised for...