Statistical approach to enhancing esophageal speech based on Gaussian mixture models

15 years 5 months ago

Download spalab.naist.jp

This paper presents a novel method of enhancing esophageal speech using statistical voice conversion. Esophageal speech is one of the alternative speaking methods for laryngectomees. Although it doesn’t require any external devices, generated voices sound unnatural. To improve the intelligibility and naturalness of esophageal speech, we propose a voice conversion method from esophageal speech into normal speech. A spectral parameter and excitation parameters of target normal speech are separately estimated from a spectral parameter of the esophageal speech based on Gaussian mixture models. The experimental results demonstrate that the proposed method yields signiﬁcant improvements in intelligibility and naturalness. We also apply one-to-many eigenvoice conversion to esophageal speech enhancement for ﬂexibly controlling enhanced voice quality.

Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi

Real-time Traffic

Esophageal Speech | ICASSP 2010 | Normal Speech | Signal Processing | Voice Conversion |

claim paper

» An evaluation of alaryngeal speech enhancement methods based on voice conversion technique...

» Gaussian ModelBased Multichannel Speech Presence Probability

» Speech Enhancement Using Gaussian Scale Mixture Models

» Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian ...

» Mixture of Support Vector Machines for HMM based Speech Recognition

» Information Theoretic Expectation Maximization Based Gaussian Mixture Modeling for Speaker...

» Maximum likelihood approach to speech enhancement for noisy reverberant signals

» Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture M...

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano

Comments (0)

Sciweavers

Statistical approach to enhancing esophageal speech based on Gaussian mixture models

Esophageal Speech | ICASSP 2010 | Normal Speech | Signal Processing | Voice Conversion |

Explore & Download

Productivity Tools

Sciweavers