Modeling Prosodic Structures in Linguistically Enriched Environments

15 years 11 months ago

Download speech.di.uoa.gr

A signiﬁcant challenge in Text-to-Speech (TtS) synthesis is the formulation of the prosodic structures (phrase breaks, pitch accents, phrase accents and boundary tones) of utterances. The prediction of these elements robustly relies on the accuracy and the quality of error-prone linguistic procedures, such as the identiﬁcation of the part-of-speech and the syntactic tree. Additional linguistic factors, such as rhetorical relations, improve the naturalness of the prosody, but are hard to extract from plain texts. In this work, we are proposing a method to generate enhanced prosodic events for TtS by utilizing accurate, error-free and high-level linguistic information. We are also presenting an appropriate XML annotation scheme to encode syntax, grammar, new or given information, phrase subject/object information, as well as rhetorical elements. These linguistically enriched has have been utilized to build realistic machine learning models for the prediction of the prosodic structure...

Gerasimos Xydas, Dimitris Spiliotopoulos, Georgios

Real-time Traffic

Error-prone Linguistic Procedures | Plain Text | Prosodic Structures | Signal Processing | TSD 2004 |

claim paper

» HMMbased prosodic structure model using rich linguistic context

» Integrating Linguistic and PerformanceBased Constraints for Assigning Phrase Breaks

» Appropriately Handled Prosodic Breaks Help PCFG Parsing

» ProSynth an integrated prosodic approach to deviceindependent naturalsounding speech synth...

» Querying Linguistic Trees

» An Evolving eScience Environment for Research Data in Linguistics

» Incorporating Linguistic Information to Statistical WordLevel Alignment

» The Bell Labs German texttospeech system

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	TSD
Authors	Gerasimos Xydas, Dimitris Spiliotopoulos, Georgios Kouroupetroglou

Comments (0)

Sciweavers

Modeling Prosodic Structures in Linguistically Enriched Environments

Error-prone Linguistic Procedures | Plain Text | Prosodic Structures | Signal Processing | TSD 2004 |

Explore & Download

Productivity Tools

Sciweavers