Sciweavers

ICASSP
2011
IEEE
12 years 8 months ago
Fast speaker diarization based on binary keys
Splitting a speech signal into speakers is the main goal of a speaker diarization system, which has become an important building block in many speech processing algorithms. Curren...
Xavier Anguera Miró, Jean-François B...
ICASSP
2011
IEEE
12 years 8 months ago
Speaker diarization of heterogeneous web video files: A preliminary study
In the last ten years, internet as well as its applications changed significantly, mainly thanks to the raising of available personal resources. Concerning multimedia, the most i...
Pierre Clément, Thierry Bazillon, Corinne F...
TASLP
2011
12 years 11 months ago
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization
—With the increase in cheap commercially available sensors, recording meetings is becoming an increasingly practical option. With this trend comes the need to summarize the recor...
Hayley Hung, Yan Huang, Gerald Friedland, Daniel G...
TCSV
2008
125views more  TCSV 2008»
13 years 3 months ago
Exploring Co-Occurence Between Speech and Body Movement for Audio-Guided Video Localization
This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
H. Vajaria, S. Sarkar, R. Kasturi
CLEAR
2007
Springer
175views Biometrics» more  CLEAR 2007»
13 years 10 months ago
The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings
We present the IBM systems for the Rich Transcription 2007 (RT07) speaker diarization evaluation task on lecture meeting data. We first overview our baseline system that was devel...
Jing Huang, Etienne Marcheret, Karthik Visweswaria...
ICASSP
2008
IEEE
13 years 11 months ago
Speaker diarization of French broadcast news
We report results on speaker diarization of French broadcast news and talk shows on current affairs. This speaker diarization process is a multistage segmentation and clustering s...
Vishwa Gupta, Gilles Boulianne, Patrick Kenny, Pie...
MM
2009
ACM
169views Multimedia» more  MM 2009»
13 years 11 months ago
Visual speaker localization aided by acoustic models
The following paper presents a novel audio-visual approach for unsupervised speaker locationing. Using recordings from a single, low-resolution room overview camera and a single f...
Gerald Friedland, Chuohao Yeo, Hayley Hung
ICASSP
2009
IEEE
13 years 11 months ago
Fusing short term and long term features for improved speaker diarization
The following article shows how a state-of-the-art speaker diarization system can be improved by combining traditional short-term features (MFCCs) with prosodic and other longterm...
Gerald Friedland, Oriol Vinyals, C. Yan Huang, Chr...
ICASSP
2009
IEEE
13 years 11 months ago
Multi-modal speaker diarization of real-world meetings using compressed-domain video features
Speaker diarization is originally defined as the task of determining “who spoke when” given an audio track and no other prior knowledge of any kind. The following article sho...
Gerald Friedland, Hayley Hung, Chuohao Yeo