The most expressive way humans display emotions is through facial expressions. In this work we report on several advances we have made in building a system for classification of f...
Ira Cohen, Nicu Sebe, Yafei Sun, Michael S. Lew, T...
Discriminative training has been a leading factor for improving automatic speech recognition (ASR) performance over the last decade. The traditional discriminative training, howev...
This paper reports a comparison of user performance (time and accuracy) when controlling a popular arcade game of Tetris using speech recognition or non-speech (humming) input tec...
Adam J. Sporka, Sri Hastuti Kurniawan, Murni Mahmu...
In this paper, we propose a joint optimal method for automatic speech recognition (ASR) and ideal binary mask (IBM) estimation in transformed into the cepstral domain through a ne...
Lae-Hoon Kim, Kyung-Tae Kim, Mark Hasegawa-Johnson
In this paper, we propose a novel feature space adaptation technique to improve the robustness of speech recognition in noisy environments. Histogram equalization (HEQ) is an effe...