Orthogonal information present in the video signal associated with the audio helps in improving the accuracy of a speech recognition system. Audio-visual speech recognition involv...
Tanveer A. Faruquie, Abhik Majumdar, Nitendra Rajp...
Speech recognition in many morphologically rich languages suffers from a very high out-of-vocabulary (OOV) ratio. Earlier work has shown that vocabulary decomposition methods can ...
In addressing the problem of achieving high-accuracy real-time speech recognition systems, we focus on recognizing speech from ARPA's20,000-word Wall Street Journal (WSJ) tas...
Hy Murveit, Peter Monaco, Vassilios Digalakis, Joh...
Improved acoustic modeling can significantly decrease the error rate in large-vocabulary speech recognition. Our approach to the problem is twofold. We first propose a scheme that...
We study key issues related to multilingual acoustic modeling for automatic speech recognition (ASR) through a series of large-scale ASR experiments. Our study explores shared str...
Hui Lin, Li Deng, Dong Yu, Yifan Gong, Alex Acero,...