In this paper, we explore statistical language modelling for a speech-enabled MP3 player application by generating a corpus from the interpretation grammar written for the applica...
Data sparseness is an ever dominating problem in automatic emotion recognition. Using artificially generated speech for training or adapting models could potentially ease this: t...
Several studies have been dedicated to the analysis and modeling of AM–FM modulations in speech and different algorithms have been proposed for the exploitation of modulations i...
This paper describes a new toolkit - SCARF - for doing speech recognition with segmental conditional random fields. It is designed to allow for the integration of numerous, possib...
Although research has previously been done on multilingual speech recognition, it has been found to be very difficult to improve over separately trained systems. The usual approa...
Lukas Burget, Petr Schwarz, Mohit Agarwal, Pinar A...