Abstract. Natural audio-visual interface between human user and machine requires understanding of user’s audio-visual commands. This does not necessarily require full speech and ...
The interaction between human beings and computers will be more natural if computers are able to perceive and respond to human non-verbal communication such as emotions. Although ...
Carlos Busso, Zhigang Deng, Serdar Yildirim, Murta...
A method that exploits an information theoretic framework to extract optimized audio features using video information is presented. A simple measure of mutual information (MI) betw...
This paper presents the validation of the expressive content of an acted corpus produced for its use in speech synthesis. Firstly, objective techniques have been carried out by me...
Ignasi Iriondo Sanz, Santiago Planet, Joan Claudi ...