Gaussian Mixture Modeling Using Short Time Fourier Transform Features for Audio Fingerprinting

15 years 6 months ago

Download www.cecs.uci.edu

In audio ﬁngerprinting, an audio clip must be recognized by matching an extracted ﬁngerprint to a database of previously computed ﬁngerprints. The ﬁngerprints should reduce the dimensionality of the input signiﬁcantly, provide discrimination among different audio clips, and at the same time, invariant to the distorted versions of the same audio clip. In this paper, we design ﬁngerprints addressing the above issues by modeling an audio clip by Gaussian mixture models (GMM) using a wide range of easy-to-compute short time Fourier transform features such as Shannon entropy, Renyi entropy, spectral centroid, spectral bandwidth, spectral ﬂatness measure, spectral crest factor, and Mel-frequency cepstral coefﬁcients. We test the robustness of the ﬁngerprints under a large number of distortions. To make the system robust, we use some of the distorted versions of the audio for training. However, we show that the audio ﬁngerprints modeled using GMM are not only robust to th...

Arunan Ramalingam, Sridhar Krishnan

Real-time Traffic

Audio Clip | Distorted Versions | ICMCS 2005 | Spectral Centroid |

claim paper

Added	24 Jun 2010
Updated	24 Jun 2010
Type	Conference
Year	2005
Where	ICMCS
Authors	Arunan Ramalingam, Sridhar Krishnan

Sciweavers

Gaussian Mixture Modeling Using Short Time Fourier Transform Features for Audio Fingerprinting

Audio Clip | Distorted Versions | ICMCS 2005 | Spectral Centroid |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers