A Harmonic Excitation State-Space Approach to Blind Separation of Speech

15 years 5 months ago

Download books.nips.cc

We discuss an identification framework for noisy speech mixtures. A block-based generative model is formulated that explicitly incorporates the time-varying harmonic plus noise (H+N) model for a number of latent sources observed through noisy convolutive mixtures. All parameters including the pitches of the source signals, the amplitudes and phases of the sources, the mixing filters and the noise statistics are estimated by maximum likelihood, using an EM-algorithm. Exact averaging over the hidden sources is obtained using the Kalman smoother. We show that pitch estimation and source separation can be performed simultaneously. The pitch estimates are compared to laryngograph (EGG) measurements. Artificial and real room mixtures are used to demonstrate the viability of the approach. Intelligible speech signals are re-synthesized from the estimated H+N models.

Rasmus Kongsgaard Olsson, Lars Kai Hansen

Real-time Traffic

Block-based Generative Model | NIPS 2004 | NIPS 2007 | Noisy Convolutive Mixtures | Noisy Speech Mixtures |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	NIPS
Authors	Rasmus Kongsgaard Olsson, Lars Kai Hansen

Comments (0)

Sciweavers

A Harmonic Excitation State-Space Approach to Blind Separation of Speech

Block-based Generative Model | NIPS 2004 | NIPS 2007 | Noisy Convolutive Mixtures | Noisy Speech Mixtures |

Explore & Download

Productivity Tools

Sciweavers