We describe an automated method to assess the expressiveness of children’s oral reading by measuring how well its prosodic contours correlate in pitch, intensity, pauses, and wor...
The reverberation time is one of the most prominent acoustic characteristics of an enclosure. Its value can be used to predict speech intelligibility, and is used by speech enhanc...
Jimi Y. C. Wen, Emanuel A. P. Habets, Patrick A. N...
In this study, a system that discriminates laughter from speech by modelling the relationship between audio and visual features is presented. The underlying assumption is that thi...
Spelling speech recognition can be applied for several purposes including enhancement of speech recognition systems and implementation of name retrieval systems. This paper presen...
Higher quality synthesized speech is required for widespread use of text-to-speech (TTS) technology, and prosodic pattern is the key feature that makes synthetic speech sound unna...