Sciweavers

INTERSPEECH
2010

Integration of multilayer regression analysis with structure-based pronunciation assessment

12 years 11 months ago
Integration of multilayer regression analysis with structure-based pronunciation assessment
Automatic pronunciation assessment has several difficulties. Adequacy in controlling the vocal organs is often estimated from the spectral envelopes of input utterances but the envelope patterns are also affected by other factors such as speaker identity. Recently, a new method of speech representation was proposed where these non-linguistic variations are effectively removed through modeling only the contrastive aspects of speech features. This speech representation is called speech structure. However, the often excessively high dimensionality of the speech structure can degrade the performance of structurebased pronunciation assessment. To deal with this problem, we integrate multilayer regression analysis with the structure-based assessment. The results show higher correlation between human and machine scores and also show much higher robustness to speaker differences compared to widely used GOP-based analysis.
Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keiki
Added 19 May 2011
Updated 19 May 2011
Type Journal
Year 2010
Where INTERSPEECH
Authors Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose
Comments (0)