Sciweavers

300 search results - page 35 / 60
» The COST-277 Speech Database
Sort
View
ICIP
2003
IEEE
16 years 3 months ago
On automatic annotation of meeting databases
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and m...
Daniel Gatica-Perez, Hervé Bourlard, Iain M...
87
Voted
ICIP
2000
IEEE
16 years 3 months ago
Normalized Training for HMM-Based Visual Speech Recognition
This paper presents an approach to estimating the parameters of continuous density HMMs for visual speech recognition. One of the key issues of image-based visual speech recogniti...
Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamur...
117
Voted
ICASSP
2009
IEEE
15 years 8 months ago
Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion m
This paper considers a method for speech emotion recognition by a max-margin framework incorporating a loss function based on a well-known model called the Watson and Tellegen’s...
Sungrack Yun, Chang D. Yoo
NOLISP
2007
Springer
15 years 8 months ago
A Hybrid Genetic-Neural Front-End Extension for Robust Speech Recognition over Telephone Lines
This paper presents a hybrid technique combining the Karhonen-Loeve Transform (KLT), the Multilayer Perceptron (MLP) and Genetic Algorithms (GAs) to obtain less-variant Mel-freque...
Sid-Ahmed Selouani, Habib Hamam, Douglas D. O'Shau...
TSD
2007
Springer
15 years 8 months ago
Festival-si: A Sinhala Text-to-Speech System
Abstract. This paper brings together the development of the first Text-toSpeech (TTS) system for Sinhala using the Festival framework and practical applications of it. Construction...
Ruvan Weerasinghe, Asanka Wasala, Viraj Welgama, K...