In this work, we model speech samples with the generalized Gamma distribution and evaluate the efficiency of such modelling for voice activity detection. Using a computationally i...
Over the past decade, the notion of multi-modal access to technology has moved from the realms of science fiction to reality. It is not now unthinkable to communicate with a machi...
The ability to determine what day-to-day activity (such as cooking pasta, taking a pill, or watching a video) a person is performing is of interest in many application domains. A ...
Mike Perkowitz, Matthai Philipose, Kenneth P. Fish...
We present an implemented model for speech recognition in natural environments which relies on contextual information about salient entities to prime utterance recognition. The hyp...
We present a proposal for an Automatic Speech Recognizer based on a “multigranular” model. The leading hypothesis is that speech signal contains information distributed on more...
Francesco Cutugno, Gianpaolo Coro, Massimo Petrill...