Sciweavers

ICASSP
2010
IEEE

Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition

13 years 2 months ago
Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition
The REMOS (REverberation MOdeling for Speech recognition) concept for reverberation-robust distant-talking speech recognition, introduced in [1] for melspectral features, is extended in this contribution to logarithmic melspectral (logmelspec) features. Based on a combined acoustic model consisting of a hidden Markov model network and a reverberation model, REMOS determines clean-speech and reverberation estimates during recognition by an inner optimization operation. A reformulation of this inner optimization problem for logmelspec features, allowing an efficient solution by nonlinear optimization algorithms, is derived in this paper so that an efficient implementation of REMOS for logmelspec features becomes possible. Connected digit recognition experiments show that the proposed REMOS implementation significantly outperforms reverberantlytrained HMMs in highly reverberant environments.
Armin Sehr, Roland Maas, Walter Kellermann
Added 11 Feb 2011
Updated 11 Feb 2011
Type Journal
Year 2010
Where ICASSP
Authors Armin Sehr, Roland Maas, Walter Kellermann
Comments (0)