In an attempt to improve models of human perception, the recognition of phonemes in nonsense utterances was predicted with automatic speech recognition (ASR) in order to analyze i...
This paper presents ongoing research leveraging forensic methods for automatic speaker recognition. Some of the methods forensic scientists employ include identifying speaker dist...
Kyu J. Han, Mohamed Kamal Omar, Jason W. Pelecanos...
This paper presents an efficient algorithm for gesture detection in lecture videos by combining visual, speech and electronic slides. Besides accuracy, response time is also cons...
This paper proposes a new framework of speech synthesis based on the Bayesian approach. The Bayesian method is a statistical technique for estimating reliable predictive distribut...
Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Takas...
We present a generic framework for enhanced active multi-sensing. We propose a coopetitive interaction approach, which combines the salient features of cooperation and competition ...
Vivek K. Singh, Pradeep K. Atrey, Mohan S. Kankanh...