163
click to vote
TSD
15 years 11 months ago
2004 Springer
In this paper we report on the acquisition and content of a new database intended for developing audio-visual speech recognition systems. This database supports a speaker dependen...
159
click to vote
TSD
15 years 11 months ago
2004 Springer
This paper presents a new unit selection process for Very Low Bit Rate speech encoding around 500 bits/sec. The encoding is based on speech recognition and speech synthesis technol...
156
click to vote
TSD
15 years 11 months ago
2004 Springer
Abstract. A formal prosody model is proposed together with its application in a text-to-speech system. The model is based on a generative of abstract prosodic functionally involved...
155
click to vote
TSD
15 years 11 months ago
2004 Springer
Two sets of linguistic features are developed: The first one to estimate if a single step in a dialogue between a human being and a machine is successful or not. The second set to...
155
click to vote
TSD
15 years 11 months ago
2004 Springer
In this paper a speaker adaptation methodology is proposed, which first automatically determines a number of speaker clusters in the training material, then estimates the paramete...
|