Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition

15 years 5 months ago

Download research.microsoft.com

We recently proposed a new algorithm to perform acoustic model adaptation to noisy environments called Linear Spline Interpolation (LSI). In this method, the nonlinear relationship between clean and noisy speech features is modeled using linear spline regression. Linear spline parameters that minimize the error the between the predicted noisy features and the actual noisy features are learned from training data. A variance associated with each spline segment captures the uncertainty in the assumed model. In this work, we extend the LSI algorithm in two ways. First, the adaptation scheme is extended to compensate for the presence of linear channel distortion. Second, we show how the noise and channel parameters can be updated during decoding in an unsupervised manner within the LSI framework. Using LSI, we obtain an average relative improvement in word error rate of 10.8% over VTS adaptation on the Aurora 2 task with improvements of 15-18% at SNRs between 10 and 15 dB.

Michael L. Seltzer, Alex Acero, Kaustubh Kalgaonka

Real-time Traffic

ICASSP 2010 | Linear Spline | Linear Spline Interpolation | Noisy Features | Signal Processing |

claim paper

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Michael L. Seltzer, Alex Acero, Kaustubh Kalgaonkar

Sciweavers

Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition

ICASSP 2010 | Linear Spline | Linear Spline Interpolation | Noisy Features | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers