We present tensor scale descriptor (TSD)— a shape descriptor for content-based image retrieval, registration, and analysis. TSD exploits the notion of local structure thickness,...
Paulo A. V. Miranda, Ricardo da Silva Torres, Alex...
Human listeners use lexical stress for word segmentation and disambiguation. We look into using lexical stress for speech recognition by examining a Dutch-language corpus. We propo...
Abstract. We report work on the mapping between the speech signal and articulatory trajectories from the MOCHA database. Contrasting previous works that used Neural Networks for th...
Abstract. A formal prosody description framework is introduced together with its relation to language semantics and NLP. The framework incorporates deep prosodic structures based o...
The aim of the work described in this paper is to extend the EPFL dialogue platform with multimodal capabilities. Based on our experience with the EPFL Rapid Dialogue Prototyping M...