Sciweavers

1423 search results - page 71 / 285
» Polyphase speech recognition
Sort
View
108
Voted
ICASSP
2009
IEEE
15 years 10 months ago
A study on multilingual acoustic modeling for large vocabulary ASR
We study key issues related to multilingual acoustic modeling for automatic speech recognition (ASR) through a series of large-scale ASR experiments. Our study explores shared str...
Hui Lin, Li Deng, Dong Yu, Yifan Gong, Alex Acero,...
187
Voted
ICASSP
2008
IEEE
15 years 10 months ago
Unsupervised learning of auditory filter banks using non-negative matrix factorisation
Non-negative matrix factorisation (NMF) is an unsupervised learning technique that decomposes a non-negative data matrix into a product of two lower rank non-negative matrices. Th...
Alexander Bertrand, Kris Demuynck, Veronique Stout...
125
Voted
MIR
2003
ACM
161views Multimedia» more  MIR 2003»
15 years 9 months ago
Highlight scene extraction in real time from baseball live video
This paper proposes a method to automatically extract highlight scenes from sports (baseball) live video in real time and to allow users to retrieve them. For this purpose, sophis...
Yasuo Ariki, Masahito Kumano, Kiyoshi Tsukada
138
Voted
AAAI
2008
15 years 6 months ago
Speech-enabled Card Games for Language Learners
This paper debuts a novel application of speech recognition to foreign language learning. We present a generic framework for developing user-customizable card games designed to ai...
Ian McGraw, Stephanie Seneff
134
Voted
ICASSP
2009
IEEE
15 years 10 months ago
Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion m
This paper considers a method for speech emotion recognition by a max-margin framework incorporating a loss function based on a well-known model called the Watson and Tellegen’s...
Sungrack Yun, Chang D. Yoo