Sciweavers

ICASSP
2011
IEEE

Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil

12 years 8 months ago
Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil
This paper presents our work on rapid language adaptation of acoustic models based on multilingual cross-language bootstrapping and unsupervised training. We used Automatic Speech Recognition (ASR) systems in English, French, German, and Spanish to build a Czech ASR system from scratch. System building was performed without using any transcribed audio data by applying three consecutive steps, i.e. cross-language transfer, unsupervised training based on the “multilingual A-stabil“ confidence score [1], and bootstrapping. Based on the confidence score we selected 72% (16.6 hours) of the available audio data with a transcription WER of less than 14.5%. The cross-language bootstrap achieves a word error rate of 23.3% on the Czech development set and 22.4% on the evaluation set. These results are very promising as the performance compares favorably to the Czech ASR system which was trained on 23 hours of
Ngoc Thang Vu, Franziska Kraus, Tanja Schultz
Added 21 Aug 2011
Updated 21 Aug 2011
Type Journal
Year 2011
Where ICASSP
Authors Ngoc Thang Vu, Franziska Kraus, Tanja Schultz
Comments (0)