Sciweavers

1423 search results - page 69 / 285
» Polyphase speech recognition
Sort
View
116
Voted
ACII
2007
Springer
15 years 9 months ago
Frame vs. Turn-Level: Emotion Recognition from Speech Considering Static and Dynamic Processing
Abstract. Opposing the pre-dominant turn-wise statistics of acoustic LowLevel-Descriptors followed by static classification we re-investigate dynamic modeling directly on the frame...
Bogdan Vlasenko, Björn Schuller, Andreas Wend...
127
Voted
ICMI
2005
Springer
170views Biometrics» more  ICMI 2005»
15 years 9 months ago
Inferring body pose using speech content
Untethered multimodal interfaces are more attractive than tethered ones because they are more natural and expressive for interaction. Such interfaces usually require robust vision...
Sy Bor Wang, David Demirdjian
ICASSP
2009
IEEE
15 years 10 months ago
Data-driven lexicon expansion for Mandarin broadcast news and conversation speech recognition
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and conversation speech recognition. The lexicon expansion includes the generation ...
Xin Lei, Wen Wang, Stolcke Stolcke
121
Voted
ICASSP
2009
IEEE
15 years 10 months ago
Training and adapting MLP features for Arabic speech recognition
Features derived from Multi-Layer Perceptrons (MLPs) are becoming increasingly popular for speech recognition. This paper describes various schemes for applying these features to ...
J. Park, Frank Diehl, M. J. F. Gales, Marcus Tomal...
ICASSP
2009
IEEE
15 years 10 months ago
Audio segmentation for speech recognition using segment features
Audio segmentation is an essential preprocessing step in several audio processing applications with a significant impact e.g. on speech recognition performance. We introduce a no...
David Rybach, Christian Gollan, Ralf Schlüter...