Music consists of both local and long-term temporal information. However, for a genre classification task, most of the text categorization based approaches only capture local temp...
This paper proposes and compares four cross-lingual and bilingual automatic speech recognition techniques under the constraints of limited memory size and CPU speed. The first thr...
With the purpose of improving Spoken Language Understanding (SLU) performance, a combination of different acoustic speech recognition (ASR) systems is proposed. State a-posteriori...
This paper investigates using Gaussian Mixture Model (GMM) based vowel duration features for automated assessment of non-native speech. Two different types of models were compared...
While much work has been dedicated to exploring how best to incorporate the Ideal Binary Mask (IBM) in automatic speech recognition (ASR) for noisy signals, we demonstrate that th...