In modern automatic speech recognition systems, it is standard practice to cluster several logical hidden Markov model states into one physical, clustered state. Typically, the cl...
Random projection has been suggested as a means of dimensionality reduction, where the original data are projected onto a subspace using a random matrix. It represents a computati...
Tetsuya Takiguchi, Jeff Bilmes, Mariko Yoshii, Yas...
This paper introduces a HMM-based speech synthesis system which uses a new method for the Separation of Vocal-tract and LiljencrantsFant model plus Noise (SVLN). The glottal sourc...
In this paper, we propose a multi-microphone joint optimal estimation of the direction of arrival (DOA) and the source speech signal through newly introduced EM beamforming. This ...
Lae-Hoon Kim, Mark Hasegawa-Johnson, Gerasimos Pot...
Monaural speech segregation is a very challenging problem which has been studied by many researchers. In this paper, we focus on voiced speech segregation. Different strategies ar...