Enriching Mandarin speech recognition by incorporating a hierarchical prosody model

12 years 8 months ago

Download mirlab.org

This paper presents a new probabilistic framework of Mandarin speech recognition by incorporating a sophisticated hierarchical prosody model into the conventional HMM-based system. The prosody model describes the relations of linguistic cues of various levels, break types and prosodic states which represent the prosody hierarchical structure, and prosody-related acoustic features. Aside from producing the recognized word sequences, the system also decodes other information including word’s part-of-speech, punctuation marks, inter-syllable break types, and prosodic states of syllables. Experimental results on the TCC300 corpus, which consists of paragraphic utterances, showed that the proposed system significantly outperformed the baseline system. The word and character error rates decreased from 24.4% and 18.1% to 20.7% and 14.4% (or 15.2% and 20.4% relative improvements), respectively.

Jyh-Her Yang, Ming-Chieh Liu, Hao-Hsiang Chang, Ch

Real-time Traffic

Break Types | ICASSP 2011 | Prosodic States | Prosody Model | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Jyh-Her Yang, Ming-Chieh Liu, Hao-Hsiang Chang, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen

Comments (0)

Sciweavers

Enriching Mandarin speech recognition by incorporating a hierarchical prosody model

Break Types | ICASSP 2011 | Prosodic States | Prosody Model | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers