Irrelevant variability normalization based HMM training using map estimation of feature transforms for robust speech recognition

14 years 6 days ago

Download www1.i2r.a-star.edu.sg

In the past several years, we’ve been studying feature transformation (FT) approaches to robust automatic speech recognition (ASR) which can compensate for possible “distortions” caused by factors irrelevant to phonetic classiﬁcation in both training and recognition stages. Several FT functions with different degrees of ﬂexibility have been studied and the corresponding maximum likelihood (ML) training techniques developed. In this paper, we study yet another new FT function which takes the most ﬂexible form of frame-dependent linear transformation. Maximum a posteriori (MAP) estimation is used for estimating FT function parameters to deal with the possible problem of insufﬁcient training data caused by the increased number of model parameters. The effectiveness of the proposed approach is conﬁrmed by evaluation experiments on Finnish Aurora3 database.

Donglai Zhu, Qiang Huo

Real-time Traffic