Sciweavers

13 search results - page 1 / 3
» Phonetic subspace mixture model for speaker diarization
Sort
View
INTERSPEECH
2010
12 years 11 months ago
Phonetic subspace mixture model for speaker diarization
This paper presents an improved distance measure for speaker clustering in speaker diarization systems. The proposed phonetic subspace mixture (PSM) model introduces phonetic info...
I-Fan Chen, Shih-Sian Cheng, Hsin-Min Wang
ICASSP
2011
IEEE
12 years 8 months ago
An investigation of subspace modeling for phonetic and speaker variability in automatic speech recognition
This paper investigates the impact of subspace based techniques for acoustic modeling in automatic speech recognition (ASR). There are many well known approaches to subspace based...
Richard C. Rose, Shou-Chun Yin, Yun Tang
TCSV
2008
125views more  TCSV 2008»
13 years 4 months ago
Exploring Co-Occurence Between Speech and Body Movement for Audio-Guided Video Localization
This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
H. Vajaria, S. Sarkar, R. Kasturi
CLEAR
2007
Springer
175views Biometrics» more  CLEAR 2007»
13 years 11 months ago
The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings
We present the IBM systems for the Rich Transcription 2007 (RT07) speaker diarization evaluation task on lecture meeting data. We first overview our baseline system that was devel...
Jing Huang, Etienne Marcheret, Karthik Visweswaria...
MLMI
2005
Springer
13 years 10 months ago
The TNO Speaker Diarization System for NIST RT05s Meeting Data
The TNO speaker speaker diarization system is based on a standard BIC segmentation and clustering algorithm. Since for the NIST Rich Transcription speaker dizarization evaluation m...
David van Leeuwen