Sciweavers

TSD
2005
Springer

Robust Rule-Based Method for Automatic Break Assignment in Russian Texts

13 years 10 months ago
Robust Rule-Based Method for Automatic Break Assignment in Russian Texts
In this paper a new rule-based approach to break assignment for the Russian language is discussed. It is a flexible and robust method of segmentation of texts in Russian in prosodic units. We implemented it in the recent “Orator” text-to-speech (TTS) system. The model was developed to use for the inflective languages as an alternative both for statistic and for strict rule-based algorithms. It is designed in such a way that all potentially tunable context dependencies are brought up to the interface grammar and can be easily modified by linguists. The algorithm we developed performs well on different kinds of texts due to this simple and intuitive grammar built upon an elaborate mechanism of morphogrammatical analysis. Juncture correct rate varies between more than 98% for simple literary texts and 85% for raw transcripts of spontaneous speech.
Ilya Oparin
Added 28 Jun 2010
Updated 28 Jun 2010
Type Conference
Year 2005
Where TSD
Authors Ilya Oparin
Comments (0)