Sciweavers

344 search results - page 48 / 69
» speech 2011
Sort
View
ICASSP
2011
IEEE
14 years 1 months ago
Correlogram template matching for time-delay estimation
We propose a correlogram-based time delay estimation method using signals modeled as the output of the cochlea, where the low-level signal processing happens in the human auditory...
Bowon Lee, Ton Kalker, Ronald W. Schafer
ICASSP
2011
IEEE
14 years 1 months ago
Automatic Language Identification in music videos with low level audio and visual features
Automatic Language Identification (LID) in music has received significantly less attention than LID in speech. Here, we study the problem of LID in music videos uploaded on YouT...
Vijay Chandrasekhar, Mehmet Emre Sargin, David A. ...
TASLP
2011
14 years 4 months ago
Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment
—This paper presents a blind source separation method for convolutive mixtures of speech/audio sources. The method can even be applied to an underdetermined case where there are ...
Hiroshi Sawada, Shoko Araki, Shoji Makino
ICASSP
2011
IEEE
14 years 1 months ago
Feature normalization for speaker verification in room reverberation
The performance of a typical speaker verification system degrades significantly in reverberant environments. This degradation is partly due to the conventional feature extractio...
Sriram Ganapathy, Jason W. Pelecanos, Mohamed Kama...
70
Voted
ICASSP
2011
IEEE
14 years 1 months ago
Sentence level emotion recognition based on decisions from subsentence segments
Emotion recognition from speech plays an important role in developing affective and intelligent systems. This study investigates sentence-level emotion recognition. We propose to ...
Je Hun Jeon, Rui Xia, Yang Liu