Sciweavers

CLEAR
2007
Springer

Progress in the AMIDA Speaker Diarization System for Meeting Data

13 years 10 months ago
Progress in the AMIDA Speaker Diarization System for Meeting Data
In this paper we describe the AMIDA speaker dizarization system as it was submitted to the NIST Rich Transcription evaluation 2007 for conference room data. This is done in the context of the history of this system and other speaker diarization systems. One of the goals of our system is to have as little tunable parameters as possible, while maintaining performance. The system consists of a BIC segmentation/clustering initialization, followed by a combined re-segmentation/cluster merging algorithm. The Diarization Error Rate (DER) result of our best system is 17.0 %, accounting for overlapping speech. However, we find that a slight altering of Speech Activity Detection models has a large impact on the speaker DER.
David A. van Leeuwen, Matej Konecný
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where CLEAR
Authors David A. van Leeuwen, Matej Konecný
Comments (0)