Neural networks are a useful alternative to Gaussian mixture models for acoustic modeling; however, training multilayer networks involves a difficult, nonconvex optimization that...
In this paper, we explore the use of a Gaussian posteriorgram based representation for unsupervised discovery of speech patterns. Compared with our previous work, the new approach...
We propose the use of the line spectral frequency (LSF) features for emotion recognition from speech, which have not been been previously employed for emotion recognition to the b...
The knowledge of the target speech presence probability in a mixture of signals captured by a speech communication system is of paramount importance in several applications includi...
Mehrez Souden, Jingdong Chen, Jacob Benesty, Sofi&...
A new mapping algorithm for speech recognition relates the features of simultaneous recordings of clean and noisy speech. The model is a piecewise nonfinear transformation appfied...