Sciweavers

78 search results - page 14 / 16
» The Analysis of Voice Quality in Speech Processing
Sort
View
LREC
2010
159views Education» more  LREC 2010»
13 years 7 months ago
Towards Optimal TTS Corpora
Unit selection text-to-speech systems currently produce very natural synthesized phrases by concatenating speech segments from a large database. Recently, increasing demand for de...
Didier Cadic, Cédric Boidin, Christophe d'A...
ICASSP
2011
IEEE
12 years 9 months ago
Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?
Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 signal which is used for the generation of the source-excitation. When a mixed ...
Javier Latorre, Mark J. F. Gales, Sabine Buchholz,...
TASLP
2010
137views more  TASLP 2010»
13 years 4 months ago
High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch
—This paper considers the problem of obtaining an accurate spectral representation of speech formant structure when the voicing source exhibits a high fundamental frequency. Our ...
Tianyu T. Wang, Thomas F. Quatieri
ICASSP
2010
IEEE
13 years 6 months ago
Word confidence calibration using a maximum entropy model with constraints on confidence and word distributions
It is widely known that the quality of confidence measure is critical for speech applications. In this paper, we present our recent work on improving word confidence scores by cal...
Dong Yu, Shizhen Wang, Jinyu Li, Li Deng
LREC
2010
189views Education» more  LREC 2010»
13 years 7 months ago
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation
CASIA-CASSIL is a large-scale corpus base of Chinese human-human naturally-occurring telephone conversations in restricted domains. The first edition consists of 792 90-second con...
Keyan Zhou, Aijun Li, Zhigang Yin, Chengqing Zong