Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis

15 years 10 months ago

Download mi.eng.cam.ac.uk

HMM based synthesis has attracted great interest due to its compact and ﬂexible modelling of spectral and prosodic parameters. In this approach, short term spectra, fundamental frequency (F0) and duration are simultaneously modelled by multi-stream HMMs. However, since F0 values in unvoiced regions are normally considered as undeﬁned, it is difﬁcult to use standard HMMs for F0 modelling. The currently preferred solution to this is to use a multi-space distribution HMM (MSDHMM) in which discrete distributions are used for modelling the voiced/unvoiced decision and continuous Gaussian distributions are used for modelling the F0 values within the voiced regions. However, the assumption of undeﬁned unvoiced F0 regions and the special structure of the MSDHMM lead to limitations in the accurate modelling of F0 patterns. In this paper an alternative is explored whereby unvoiced F0 values are assumed to exist and are modelled within the standard HMM framework using a globally tied dis...

Kai Yu, Tomoki Toda, Milica Gasic, Simon Keizer, F

Real-time Traffic

HMM Based Synthesis | ICASSP 2009 | Modelling | Signal Processing | Standard Hmm |

claim paper

» Continuous F0 in the sourceexcitation generation for HMMbased TTS Do we need voicedunvoice...

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Kai Yu, Tomoki Toda, Milica Gasic, Simon Keizer, François Mairesse, Blaise Thomson, Steve Young

Comments (0)

Sciweavers

Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis

HMM Based Synthesis | ICASSP 2009 | Modelling | Signal Processing | Standard Hmm |

Explore & Download

Productivity Tools

Sciweavers