In addressing the problem of achieving high-accuracy real-time speech recognition systems, we focus on recognizing speech from ARPA's20,000-word Wall Street Journal (WSJ) tas...
Hy Murveit, Peter Monaco, Vassilios Digalakis, Joh...
Recent studies have shown that the perception of natural movements--in the sense of being "humanlike"--depends on both joint and task space characteristics of the movemen...
The new model reduces the impact of local spectral and temporal variability by estimating a finite set of spectral and temporal warping factors which are applied to speech at the f...
Antonio Miguel, Eduardo Lleida, Richard Rose, Luis...
It is well known that frame independence assumption is a fundamental limitation of current HMM based speech recognition systems. By treating each speech frame independently, HMMs ...
Speech recognition is usually based on Hidden Markov Models (HMMs), which represent the temporal dynamics of speech very efficiently, and Gaussian mixture models, which do non-opt...