Speech recognition technology suffers from a lack of robustness which limits its usability for fully automated speech-to-text transcription, and manual correction is generally req...
This paper investigates whether high-quality annotations for tasks involving semantic disambiguation can be obtained without a major investment in time or expense. We examine the ...
Sara Rosenthal, William Lipovsky, Kathleen McKeown...
This paper presents the large audiovisual laughter database recorded as part of the AVLaughterCycle project held during the eNTERFACE'09 Workshop in Genova. 24 subjects parti...
This paper presented an overview of Chinese bi-character words' morphological types, and proposed a set of features for machine learning approaches to predict these types bas...
This paper describes an open source voice creation toolkit that supports the creation of unit selection and HMM-based voices, for the MARY (Modular Architecture for Research on sp...