Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

10

ICASSP
2008
IEEE

favoriteEmaildiscussreport

101views Signal Processing» more ICASSP 2008»

Single-channel speech separation based on modulation frequency

13 years 10 months ago

Single-channel speech separation based on modulation frequency

Download www.cs.cmu.edu

This paper describes an algorithm that performs a simple form of computational auditory scene analysis to separate multiple speech signals from one another on the basis of the modulation frequencies of the components. The most novel aspect of the algorithm is the use of the cross-correlation of the instantaneous frequencies of the components of a signal to identify and separate those components that are likely have been produced by a common sound source. The putative desired target speech signal is reconstructed by choosing those components that have the greatest mutual correlation, and then using extrinsic information such as fundamental frequency or speaker identiﬁcation to determine which component clusters belong to which speaker. The system was evaluated by comparing speech recognition accuracy of a target speech signal that was extracted from a mixture of two speakers. It was found that recognition accuracy obtained when the separation was based on cross-correlation of changes...

Lingyun Gu, Richard M. Stern

Real-time Traffic

ICASSP 2008 | Recognition Accuracy | Signal Processing | Speech Signal | Target Speech Signal |

claim paper

Related Content

» SourceFilterBased SingleChannel Speech Separation Using Pitch Information

» Discrimination of speech from nonspeeech in broadcast news based on modulation frequency f...

» WideBand Audio Coding Based on FrequencyDomain Linear Prediction

» Recognizing Reverberant Speech Based on Amplitude and Frequency Modulation

» Speech separation using speakeradapted eigenvoice speech models

» Joint frequency spectral lag representation for crossfrequency modulation analysis in the ...

» Semiblind SpeechMusic Separation Using Sparsity and Continuity Priors

» Integrating binaural cues and blind source separation method for separating reverberant sp...

» Automatic Detection of Prominent Words in Russian Speech

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICASSP
Authors	Lingyun Gu, Richard M. Stern

Comments (0)