Towards Optimal TTS Corpora

15 years 5 months ago

Download www.lrec-conf.org

Unit selection text-to-speech systems currently produce very natural synthesized phrases by concatenating speech segments from a large database. Recently, increasing demand for designing high quality voices with less data has created need for further optimization of the textual corpus recorded by the speaker. This corpus is traditionally the result of a condensation process: sentences are selected from a reference corpus, using an optimization algorithm (generally greedy) guided by the coverage rate of classic units (diphones, triphones, words...). Such an approach is, however, strongly constrained by the finite content of the reference corpus, providing limited language possibilities. To gain flexibility in the optimization process, in this paper, we introduce a new corpus building procedure based on sentence construction rather than sentence selection. Sentences are generated using Finite State Transducers, assisted by a human operator and guided by a new frequency-weighted coverage...

Didier Cadic, Cédric Boidin, Christophe d'A

Real-time Traffic

Coverage Rate | Education | LREC 2010 | Reference Corpus | Textual Corpus |

claim paper

» Evolutionary Computing as a Tool for Grammar Development

» Semisupervised Learning with WeaklyRelated Unlabeled Data Towards Better Text Categorizati...

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Didier Cadic, Cédric Boidin, Christophe d'Alessandro

Comments (0)

Sciweavers

Towards Optimal TTS Corpora

Coverage Rate | Education | LREC 2010 | Reference Corpus | Textual Corpus |

Explore & Download

Productivity Tools

Sciweavers