Front-end feature transforms with context filtering for speaker adaptation

12 years 8 months ago

Download mirlab.org

Feature-space transforms such as feature-space maximum likelihood linear regression (FMLLR) are very effective speaker adaptation technique, especially on mismatched test data. In this study, we extend the full-rank square matrix of FMLLR to a non-square matrix that uses neighboring feature vectors in estimating the adapted central feature vector. Through optimizing an appropriate objective function we aim to ﬁlter out and transform features through the correlation of the feature context. We compare to FMLLR that just consider the current feature vector only. Our experiments are conducted on the automobile data with different speed conditions. Results show that context ﬁltering improves 23% on word error rate over conventional FMLLR on noisy 60mph data with adapted ML model, and 7%/9% improvement over the discriminatively trained FMMI/BMMI models.

Jing Huang, Karthik Visweswariah, Peder A. Olsen,

Real-time Traffic

Central Feature Vector | Feature Vector | Feature-space Maximum Likelihood | ICASSP 2011 | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Jing Huang, Karthik Visweswariah, Peder A. Olsen, Vaibhava Goel

Comments (0)

Sciweavers

Front-end feature transforms with context filtering for speaker adaptation

Central Feature Vector | Feature Vector | Feature-space Maximum Likelihood | ICASSP 2011 | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers