Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

13 years 4 months ago

Download www.cstr.ed.ac.uk

In the EMIME project, we are developing a mobile device that performs personalized speech-to-speech translation such that a user’s spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user’s voice. We integrate two techniques, unsupervised adaptation for HMM-based TTS using a wordbased large-vocabulary continuous speech recognizer and cross-lingual speaker adaptation for HMM-based TTS, into a single architecture. Thus, an unsupervised cross-lingual speaker adaptation system can be developed. Listening tests show very promising results, demonstrating that adapted voices sound similar to the target speaker and that differences between supervised and unsupervised cross-lingual speaker adaptation are small.

Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi,

Real-time Traffic

Cross-lingual Speaker Adaptation | HMM-based TTS | ICASSP 2010 | Signal Processing | Unsupervised Cross-lingual Speaker |

claim paper

» Personalising SpeechToSpeech Translation in the EMIME Project

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester

Comments (0)

Sciweavers

Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

Cross-lingual Speaker Adaptation | HMM-based TTS | ICASSP 2010 | Signal Processing | Unsupervised Cross-lingual Speaker |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers