Sciweavers

45 search results - page 3 / 9
» WAPUSK20 - A Database for Robust Audiovisual Speech Recognit...
Sort
View
NOLISP
2005
Springer
15 years 3 months ago
Third-Order Moments of Filtered Speech Signals for Robust Speech Recognition
Novel speech features calculated from third-order statistics of subband-filtered speech signals are introduced and studied for robust speech recognition. These features have the p...
Kevin M. Indrebo, Richard J. Povinelli, Michael T....
77
Voted
ICIP
2003
IEEE
15 years 11 months ago
On automatic annotation of meeting databases
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and m...
Daniel Gatica-Perez, Hervé Bourlard, Iain M...
ACII
2005
Springer
15 years 3 months ago
Pronunciation Learning and Foreign Accent Reduction by an Audiovisual Feedback System
Abstract. Global integration and migration force people to learn additional languages. With respect to major languages, the acquisition is already initiated at primary school but a...
Oliver Jokisch, Uwe Koloska, Diane Hirschfeld, R&u...
CVPR
2012
IEEE
13 years 1 days ago
Robust Boltzmann Machines for recognition and denoising
While Boltzmann Machines have been successful at unsupervised learning and density modeling of images and speech data, they can be very sensitive to noise in the data. In this pap...
Yichuan Tang, Ruslan Salakhutdinov, Geoffrey E. Hi...
NOLISP
2007
Springer
15 years 3 months ago
A Hybrid Genetic-Neural Front-End Extension for Robust Speech Recognition over Telephone Lines
This paper presents a hybrid technique combining the Karhonen-Loeve Transform (KLT), the Multilayer Perceptron (MLP) and Genetic Algorithms (GAs) to obtain less-variant Mel-freque...
Sid-Ahmed Selouani, Habib Hamam, Douglas D. O'Shau...