A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese

15 years 4 months ago

Download www.ntut.edu.tw

This paper presents a set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese. A large speech corpus produced by a single speaker is used, and the speech output is synthesized from waveform units of variable lengths, with desired linguistic properties, retrieved from this corpus. Detailed methodologies were developed for designing "phonetically rich" and "prosodically rich" corpora by automatically selecting sentences from a large text corpus to include as many desired phonetic combinations and prosodic features as possible. Automatic phonetic labeling with iterative correction rules and automatic prosodic labeling with a multi-pass top-down procedure were also developed such that the labeling process for the corpora can be completely automatic. Hierarchical prosodic structure for an arbitrary desired text sentence is then generated based on the identification of different levels of break indices, and the prosodic feature sets and appropriate ...

Fu-Chiang Chou, Chiu-yu Tseng, Lin-Shan Lee

Real-time Traffic

Mandarin Chinese | Prosodic Feature | TASLP 2002 | Waveform Units |

claim paper

Added	23 Dec 2010
Updated	23 Dec 2010
Type	Journal
Year	2002
Where	TASLP
Authors	Fu-Chiang Chou, Chiu-yu Tseng, Lin-Shan Lee

Sciweavers

A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese

Mandarin Chinese | Prosodic Feature | TASLP 2002 | Waveform Units |

Explore & Download

Productivity Tools

Sciweavers