Rapid feature space MLLR speaker adaptation with bilinear models

14 years 8 months ago

Download mirlab.org

In this paper, we propose a novel method for rapid feature space Maximum Likelihood Linear Regression (FMLLR) speaker adaptation based on bilinear models. When the amount of adaptation data is limited, the conventional FMLLR transforms can be easily over-trained and can even degrade the performance. In such cases, usually by introducing structural constraints on the FMLLR transformation, the original FMLLR adaptation method can be modified for rapid adaptation. The objective of our bilinear model is to introduce a prior knowledge analysis on the training speakers based on Singular Vector Decomposition (SVD), and to incorporate it in the decoding process. This can effectively reduce the number of free parameters of FMLLR transformation and achieve performance improvements even with limited adaptation data. The efficiency of the proposed algorithm is demonstrated with experiments on the Mandarin digital dataset and the Mandarin voice search dataset respectively.

Shilei Zhang, Peder A. Olsen, Yong Qin

Real-time Traffic

Adaptation Data | Bilinear Model | FMLLR Transformation | ICASSP 2011 | Signal Processing |

claim paper

» A new method for speaker adaptation using bilinear model

» Subspace constrained LU decomposition of FMLLR for rapid adaptation

» Constrained discriminative mapping transforms for unsupervised speaker adaptation

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Shilei Zhang, Peder A. Olsen, Yong Qin

Comments (0)

Sciweavers

Rapid feature space MLLR speaker adaptation with bilinear models

Adaptation Data | Bilinear Model | FMLLR Transformation | ICASSP 2011 | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers