Sciweavers

ICMCS
2009
IEEE

Acoustic modeling using an extended phone set considering cross-lingual pronunciation variations

13 years 2 months ago
Acoustic modeling using an extended phone set considering cross-lingual pronunciation variations
To deal with the issue of data unbalanced condition among a task of multilingual speech recognition and a phenomenon of pronunciation variations across languages, we propose an approach to clustering context dependent phones from an extended phone set in an acoustic model trained on a data unbalanced bilingual corpus. First, we generate an extended phone set using pronunciation modeling by a confidence measure between Mandarin and Taiwanese. Second, we use a two-step agglomerative hierarchical clustering with delta Bayesian information criteria to automatically generate a merged extended phone set (MEPS). Third, we choose a parametric modeling technique, model complexity selection, to increase the final number of Gaussian components dependent on the available training data in a data unbalanced condition. The experimental results show that the proposed automatic extending phone clustering approach reduced relative syllable error rate by 8.3% over the best result of the decision tree ba...
Dau-Cheng Lyu, Ren-Yuan Lyu, Ming-Tat Ko
Added 19 Feb 2011
Updated 19 Feb 2011
Type Journal
Year 2009
Where ICMCS
Authors Dau-Cheng Lyu, Ren-Yuan Lyu, Ming-Tat Ko
Comments (0)