Traditional MDCT-based perceptual audio coding schemes employ mid/side and intensity stereo techniques to allow efficient joint coding of the two channels of a stereophonic signal...
Christian R. Helmrich, Pontus Carlsson, Sascha Dis...
In supervector UBM/GMM paradigm, each acoustic file is represented by the mean parameters of a GMM model. This supervector space is used as a data representation space, which has...
The use of visual information derived from accurate lip extraction, can provide features invariant to noise perturbation for speech recognition systems and can be also used in a w...
The maximum a posteriori (MAP) criterion is broadly used in the statistical model-based voice activity detection (VAD) approaches. In the conventional MAP criterion, however, the ...
This paper presents a method for mitigating the impact of reverberation upon speaker identification. In particular, two reverberation mitigation techniques were studied: one that ...
Catherine M. Vannicola, Brett Y. Smolenski, Brando...