Sciweavers

ICASSP
2011
IEEE

One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model

12 years 8 months ago
One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model
This paper presents a rapid voice adaptation algorithm using GMM-based frequency warping and shift with parameters of a subband basis spectrum model (SBM)[1]. The SBM parameter represents a shape of a spectrum of speech. It is calculated by fitting a sub-band basis to the log-spectrum. Since the parameter is the frequency domain representation, frequency warping can be directly applied to the SBM parameter. A frequency warping function that minimize the distance between source and target SBM parameter pairs in each mixture component of a GMM is derived using a DP (Dynamic programming) algorithm. The proposed method is evaluated in an unit-selection based voice adaptation framework applied to a unit-fusion based text-to-speech synthesizer. The experimental results show that the proposed adaptation method is effective for rapid voice adaptation using just one sentence, compared to the conventional GMM.-based linear transformation of mel-cepstra.
Masatsune Tamura, Masahiro Morita, Takehiko Kagosh
Added 20 Aug 2011
Updated 20 Aug 2011
Type Journal
Year 2011
Where ICASSP
Authors Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine
Comments (0)