Sciweavers

7 search results - page 1 / 2
» Fusing short term and long term features for improved speake...
Sort
View
ICASSP
2009
IEEE
13 years 11 months ago
Fusing short term and long term features for improved speaker diarization
The following article shows how a state-of-the-art speaker diarization system can be improved by combining traditional short-term features (MFCCs) with prosodic and other longterm...
Gerald Friedland, Oriol Vinyals, C. Yan Huang, Chr...
ICASSP
2011
IEEE
12 years 8 months ago
Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks
We investigate classification of non-linguistic vocalisations with a novel audiovisual approach and Long Short-Term Memory (LSTM) Recurrent Neural Networks as highly successful d...
Florian Eyben, Stavros Petridis, Björn Schull...
TASLP
2008
143views more  TASLP 2008»
13 years 4 months ago
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarizatio
Many current state-of-the-art speaker diarization systems exploit agglomerative hierarchical clustering (AHC) as their speaker clustering strategy, due to its simple processing str...
K. J. Han, S. Kim, S. S. Narayanan
TCSV
2008
125views more  TCSV 2008»
13 years 3 months ago
Exploring Co-Occurence Between Speech and Body Movement for Audio-Guided Video Localization
This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
H. Vajaria, S. Sarkar, R. Kasturi
NIPS
2007
13 years 5 months ago
Discriminative Keyword Selection Using Support Vector Machines
Many tasks in speech processing involve classification of long term characteristics of a speech segment such as language, speaker, dialect, or topic. A natural technique for dete...
William M. Campbell, Fred S. Richardson